notes

Scaling Frontend

GeoDNS for load balancing between DCs.
CDN
LoadBalancer
Remove State from App
- sessions => Memory Store (Redis, Memcache)
- locks => Memory Store (Redis, Memcache)
- uploaded files => External File Storage
External File Storage
- S3
- SAN or FTP
- it’s great when app doesn’t know that it uses network file system

Benefits
- Spread the load across different servers
- Rolling updates and maintanance
Implementation
- Smart Clients - not a good idea because to change infra you need to change clients.
- DNS
  - - a lot of caches, some of them could ingnore TTL, as a result it’s impossible to perform rolling update, or replace servers or increase capacity and so on
  - - uneven distribution in case of more active cliends
  - - restricted amount of servers in DNS response (512 bytes)
- Hosted Solution (ELB)
- Software Based (Nginx, HAProxy)
- Hardwared Based (F5)
  - very good performance
  - big upfront cost
  - big cost to operate (not enough engineers with sufficient skills)
Types

NAT - replace source IP to LB IP and destination IP to backend IP. Keep this mapping in memory to rewrite response IPs
- - consume a lot of CPU and memory on LB in case of high load
IP Tunneling
- IP tunnel is setup in advance between LB and all backends
- LB encapsulates packet, and sets destination IP as a backend IP
- Backend remove LB ips, process packet, and send response directly to the client
- + do not have a packets rewrite, as NAT
- - backend should now about ip tunneling
Direct routing
- The LB sets destination MAC address as a MAC addres of choosen backend
- - LB and backend should be on the same network

Send health check packages to backends to make sure that they are healthy.
Flow control
- count the amount of pending requests for every backend and limit up to some number (e.g. if there are 100 pending requests to backend it means it’s overloaded and struggling, so it’s better not to send more requests to it)
Subsetting - every load balancer connected to some subset of backends (to remove storing all the connections in memory)

Random
- - not even traffic distribution, especially in the case of small amount of backends
Round Robin
- + simple to implement
- - varying query cost
- - different machine hardware (in big DC there is wide varyity of CPU hardware)
- - unpredictable events
  - noisy neighbors
  - task restarts - which in many cases requires
Least Loaded Round Robin (least connections)
- + spread requests based on backend load
- - if backend is unhealthy it could get 100% requests, because errors are serfed very fast. As a solution, we could count recent errors as a active connection.
- - no adjustment based on CPU power
- - the count of the connection to the backend does not include the requests from other load balancers
Weighted Round Robin - backends track the amount of served requests and resource utilization (mostly CPU) and sends them to the load balancer, so that LB could choose the best client.
Least Bandwidth
IP Hash

Handling user connections, working with slow clients, keep-alive
1. Nginx uses async model of handling requests. It means that process is listening for multiple sockets in epool syscall. When some sockets have data in it the process goes to “running” stage and handle the data from this sockets
2. Async model has very good performance - we have only one thread(excluding threads for reading files) and we don’t spend time on context switches (which are very expensive, because of copying registries and cleaning up TLB
3. That’s why reverse proxy is very good for handling incoming connections, because we don’t spend time on creating new thread/process for every connection and do not consume memory/cpu time.
SSL termination
Serve static - after some time we will have all the static in memory (because linux caches file which were read from disk), and if there are enough memory to remain all static files there - we will always serve them from memory.