Sushil Kulkarni

Posted on May 2

Why ChatGPT Uses SSE Instead of WebSockets (And Why You Probably Should Too)

#webdev #javascript #backend #ai

If you've ever watched an AI chatbot “type” its answer word by word, you’ve already seen Server-Sent Events (SSE) in action.

That smooth streaming experience—where text appears token by token instead of waiting for the full response—is one of the reasons modern chat UIs feel fast and alive.

But here’s the interesting part:

Many developers assume this must be powered by WebSockets.

In reality, a huge number of AI chat apps use SSE instead.

And honestly? For most chat interfaces, SSE is often the better choice.

Quick Summary

SSE is great for chat UI because:

It streams AI responses in real time
It avoids constant polling
It uses standard HTTP (simpler infra)
It automatically reconnects if the connection drops
It’s lighter and easier to scale than WebSockets for one-way updates

If your app mostly needs server → client streaming, SSE is probably all you need.

What is SSE?

Server-Sent Events (SSE) is a lightweight web technology that allows the server to push real-time updates to the browser over a single long-lived HTTP connection.

Unlike traditional HTTP requests where the client asks and waits…

SSE keeps the connection open and continuously streams updates.

Think:

AI chatbots
Live dashboards
Stock tickers
Live sports scores
Notifications
Support systems

Basically: anything where the server talks more than the client

Watch this video for visual explanation

The Simple Analogy

SSE is like a long phone call where:

the client listens
the server keeps talking

The browser opens the line.

The server keeps sending updates.

The browser updates the UI instantly.

Simple.

How SSE Works (Step by Step)

1. The Handshake (Client Request)

The browser starts by sending a normal HTTP request.

But with one important header:

Accept: text/event-stream

This tells the server:

“Don’t send one response and close.
Keep the connection open and stream updates.”

2. The Open Pipe (Persistent Connection)

The server responds with:

HTTP/1.1 200 OK
Content-Type: text/event-stream

Now the pipe stays open.

No repeated polling.

No constant reconnecting.

Just one persistent stream.

3. The Data Stream (The Typing Effect)

As your AI model generates tokens…

the server sends small chunks like this:

data: {"text": "Hello"}

data: {"text": " there!"}

Each chunk arrives immediately.

This creates that familiar:

“AI is typing…”

experience.

4. The Client Listener (UI Update)

In JavaScript, the browser uses the native EventSource API:

const eventSource = new EventSource("/stream");

eventSource.onmessage = (event) => {
  const data = JSON.parse(event.data);
  console.log(data.text);
};

Every time new data arrives:

the UI updates instantly.

No refresh.

No polling.

No hacks.

5. Automatic Reconnection

This is underrated.

If the connection drops:

the browser automatically tries to reconnect.

No custom retry logic required.

This makes SSE surprisingly reliable for production systems.

6. Closing the Stream

Once the AI finishes generating:

the server can:

send an end event
or simply close the connection

Done.

Clean and efficient.

Why Not Just Use WebSockets?

Because sometimes WebSockets are overkill.

WebSockets are great when:

both client and server constantly talk
multiplayer apps
collaborative editors
trading systems
gaming

But for AI chat?

Usually the flow is:

User sends prompt once
Server streams response for 20 seconds

That’s mostly one-way communication

Which makes SSE a better fit.

SSE vs WebSockets

Feature	SSE	WebSockets
Direction	Server → Client	Two-way
Protocol	Standard HTTP	Custom WS Protocol
Infra Complexity	Simple	Higher
Auto Reconnect	Yes	Manual
Scaling	Easier	Harder
Best For	AI chat, live feeds	Real-time collaboration

This is why many production AI apps prefer SSE.

Not because it’s “newer.”

Because it’s simpler.

Real-World Use Cases

AI Chatbots

Like OpenAI ChatGPT-style interfaces where responses stream token by token.

Live Support Dashboards

Push:

“Agent is typing…”

“Ticket updated”

instantly.

Notifications

Real-time alerts without polling every 5 seconds.

Live Scoreboards

Sports, finance, operations dashboards.

Perfect fit.

Practical Takeaway

Before adding WebSockets to your architecture, ask:

Do I really need two-way real-time communication?

If the answer is:

“Mostly the server sends updates”

Use SSE first.

It’s simpler.

Cheaper.

Cleaner.

And often better.

Final Thought

Good engineering is not about choosing the most powerful tool.

It’s about choosing the simplest tool that solves the problem.

For modern chat UIs:

that tool is often SSE.

Not WebSockets.

Not polling.

Just elegant HTTP streaming.

And honestly…

that’s beautiful engineering.

What Do You Prefer?

Have you used SSE in production?

Do you still prefer WebSockets for chat systems?

Or do you think polling is still underrated?

I’d love to hear your take.

DEV Community