Comment by Sébastien Portebois on "How we can divert traffic in same replica of backend module ?"

It really depends on the state of your backend, and where your bottleneck is.

By state, I first think about stateful vs stateless: the common best practice as you develop your backend is to build is stateless (as per the famous 12factor.net mantra), then your solution becomes as simple as a load-balancing problem (think NGinx or HAProxy, or look at newer service meshes, and but since you used the #kubernetes tags then it'S even simpler since Kubernetes as load-balancing built-in and it's just of matter of increasing your desired replicas, in fact k8s comes with a pod autoscaler)

But the issue might come from a database capacity, then it'S other problem and other solutions.

Search Hashnode