System Design (6 Part Series)
A Load Balancers is a critical component in designing any distributed systems. Load Balancer does its magic by diving the traffic among a group of server thus resulting in improved response and availability of a website or application.
In a traditional client-server architecture, we used to have multiple clients trying to connect to one server. The problem with such architecture was if the traffic volume or number of requests increased then it increased the load on the server hence affecting the response time of the server. Also, there was only one server hence it was prone to failure i.e, it the server goes down, the entire application/website will be impacted.
To overcome the problem with the traditional Client-Server architecture, Load Balancers are used. As discussed above, the load balancer works by distributing the high volume of incoming traffic among the group of available servers, hence making the distributed system fault-tolerant(i.e, by removing the single point of failure).
Load Balancers. Source: educative.io
Load Balancers reside between the client and server, hence when a client makes a request to the server, the request is first received by the Load Balancers which then handles the request by routing it to a backend server. The server then responds directly to the client.
Client =====> Load Balancer =====> Server
Client <============================== Server
Least Connection Method — The load balancers will choose the server handling the least number of active connections. This algorithm is particularly useful when the load is unevenly distributed.
The Least Response Time Method — The server with the lowest average response time will be chosen. Some load balancers will consider the server with the lowest average response time and the least number of active connections.
Round Robin Method — This algorithm will work as follows: the first request is sent to the first server, then the next to the second, and so on to the last. Then it is started again, assigning the next request to the first server, and so on.
Weighted Round Robin Method — Every server is assigned an integer value know as weight based on their processing capacity. While serving a new request, the server will be chosen on the basis of weights(the server with higher weight will be preferred over the server with less weight).
Least Bandwidth Method — The algorithm will choose the server currently serving the least amount of traffic, measured in megabits per second (Mbps).
Hash — Hash of client IP address is used to redirect the request to the server.
- Fasters response time and increased throughput.
- A fault-tolerant system as it removes the problem of a single point of failure.
- The system becomes highly scalable.
- Higher availability and negligible downtime.
- Application of Load Balancers
- Load Balancer uses Horizontal Scaling
Originally published at nlogn.in on March 11, 2020.
Posted on Mar 21 by: