Kfir

Posted on Nov 5 • Originally published at levelup.gitconnected.com on Oct 20

How the Browser Works

#css #browsers #webdev #javascript

Photo by Jonny Caspari on Unsplash

Most of us use browsers every day, but few understand how they actually work under the hood.

Knowing this helps us write more efficient, predictable web applications — and, honestly, it’s just fascinating.

Let’s take a look at what happens when you open a webpage, step by step.

The Browser Is Almost an Operating System

A modern browser isn’t just a window to the web- it’s a small operating system.

It manages memory, processes, file storage, networking, rendering, JavaScript execution, and a user interface layer.

Browser Capabilities

Networking: Fetch resources over HTTP(S).
Data storage: Cookies, localStorage, IndexedDB.
Execution: Run JavaScript securely in a sandboxed environment.
Rendering: Parse and paint HTML and CSS.
UI: Manage tabs, buttons, and the visual interface.

Each of these parts works together seamlessly- and it all starts with fetching and parsing data.

Behind the Scenes: Browser Architecture

At a high level, a browser consists of:

UI Layer: What you interact with- address bar, tabs, etc.
Data Storage Layer: Where cookies, cache, and site data live.
Browser Engine: The core that coordinates everything. It’s divided into two main parts:
Rendering Engine: Parses HTML and CSS, creates the visual output.
JavaScript Engine: Parses, compiles, and executes JS code.

Examples:

Blink (used by Chrome, Edge, Opera)
WebKit (used by Safari)
Gecko (used by Firefox)

The HTML Journey

When you load a webpage, here’s what happens to the HTML file:

Get raw bytes from the network.
Convert bytes → characters using the correct encoding (e.g., UTF-8).
Tokenize the characters into meaningful pieces — e.g.,

<h1>, <p>, <div>.

Build objects for each element (with parent/child/sibling relationships).
Construct the DOM (Document Object Model) — a live tree structure representing the page.

The DOM isn’t just a static representation- it’s interactive.

JavaScript can modify it, and the browser will update the display accordingly.

Reference: HTML Living Standard

CSS and the Render Tree

CSS follows a similar process:

Parse raw bytes → characters → tokens → nodes.
Build the CSSOM (CSS Object Model)- representing all style rules.
Combine the DOM + CSSOM → Render Tree.
The render tree goes through:

Layout: Calculating sizes and positions.
Painting: Filling pixels on the screen.

The rendering engine handles this- running complex optimizations to repaint only what’s needed.

Reference: CSSOM Specification

JavaScript: The DOM Manipulator

When the browser encounters a <script> tag, it pauses HTML and CSS parsing to execute JavaScript.

That’s because JavaScript can modify the DOM or request additional resources, so the browser needs to know the final structure before continuing.

However, this can block rendering if the CSSOM isn’t ready yet- since the script might depend on styles.

To avoid blocking, we can use:

<script src="app.js" defer></script>

defer tells the browser to continue parsing HTML and execute JS after the DOM is ready.
async executes JS as soon as it’s downloaded, independently of DOM parsing.

Reference: HTML Spec- Script Element

Putting It All Together

Here’s a simplified flow:

HTML → DOM
CSS → CSSOM
DOM + CSSOM → Render Tree → Layout → Paint
JS → Can manipulate DOM/CSSOM → Re-render

Every click, scroll, or animation triggers a careful dance between these systems.

Why It Matters

Understanding how browsers work helps you:

Write non-blocking, performant code.
Optimize paint and layout cycles.
Debug complex rendering or loading issues.
Appreciate just how much happens before your app even loads.

Browsers are one of the most sophisticated pieces of software on your machine- and now you know why.

Further Reading:

MDN: How browsers work

DEV Community