18px
Architecture·December 17, 2025·12 min read

Why YouTube Live Chat Scales Without WebSockets

How YouTube serves millions of concurrent viewers with HTTP-based polling instead of WebSockets, saving tens of millions in infrastructure costs while maintaining real-time performance.

Why YouTube Live Chat Scales Without WebSockets

Introduction

The Core Problem: Live Chat at Internet Scale

Common Assumption: WebSockets = Real-Time Chat

Why WebSockets Do Not Scale Well for YouTube

1. Persistent Connections Are Expensive

2. WebSockets Break CDN Economics

3. Load Balancing Becomes Hard

What YouTube Uses Instead

Why This Works

Cost Comparison (Back-of-the-Envelope)

WebSockets Scenario

HTTP Polling Scenario

Estimated Savings

Why Twitch Made a Different Choice

A Broader Industry Pattern

Lessons for System Designers

Conclusion

Filed under fieldnotesDecember 17, 2025