Modern applications often consist of many small(er) services, which talk with each other using APIs. To make a good use of such architectures, the different services need to be able to scale individually. The service, which is under load should run in multiple instances. However, other services might not need to scale. In such situation you need some mechanism how to load balance a traffic between
{{#tags}}- {{label}}
{{/tags}}