woai3c

Posted on Jan 26

Make a tiny browser engine from scratch

#webdev #javascript #node #frontend

Introduction

Browser rendering principles are essential knowledge for frontend developers and are frequently discussed in interviews and frontend training courses. You can also find related descriptions in MDN documentation.

As a seasoned frontend developer, I understand browser rendering principles, but my knowledge has been limited to theoretical aspects. Therefore, I decided to build a tiny browser engine from scratch.

The rendering engine is a component of the browser that transforms source code (HTML, CSS, JavaScript) into a format that users can read, view, and hear. However, implementing a complete browser engine alone would be too challenging and time-consuming. So, I decided to take a step back and create a tiny browser engine instead. I happened to find an open-source toy rendering engine called Robinson written in Rust on Github, which inspired me to create my own version using JavaScript. I've also published it on Github as tiny-rendering-engine.

This tiny rendering engine consists of five phases:

Parse HTML and generate DOM tree
Parse CSS and generate CSS rule collection
Combine DOM tree and CSS rules to create Style tree
Generate Layout tree
Painting

I've created separate branches on Github for the code of each phase. Since understanding the entire rendering engine's code at once might be challenging, I advise starting with the first branch and progressing step by step, from easy to difficult, for better learning efficiency.

HTML parser - v1 branch
CSS parser - v2 branch
Style tree - v3 branch
Layout tree - v4 branch
Painting - v5 branch

Now, let's look at how to create an HTML parser.

HTML parser

The purpose of the HTML parser is to transform HTML code into a DOM tree. For example:

<div class="lightblue test" id=" div " data-index="1">test!</div>

The above of HTML code will be transforming as a DOM tree:

{
    "tagName": "div",
    "attributes": {
        "class": "lightblue test",
        "id": "div",
        "data-index": "1"
    },
    "children": [
        {
            "nodeValue": "test!",
            "nodeType": 3
        }
    ],
    "nodeType": 1
}

Writing a parser requires some knowledge of compilation principles such as lexical analysis and syntactic analysis. However, our tiny parser is very simple, so it's okay even if you don't understand these principles - you'll understand once you see the code.

Looking back at the HTML code above, the entire parsing process is shown in the following picture.

Each piece of HTML code has its corresponding parsing method.

To simplify the HTML parser, we need to add some restrictions:

HTML tag must be shown with a pair: <div>...</div>
HTML attribute value must be quoted: <div class="test">...</div>
Don't support comments
No need for most error handling
Only support two nodes: Element and Text

With these restrictions, the HTML parser will be simpler.

Node Types

First, we need to design data structure to support different node types:

export enum NodeType {
    Element = 1,
    Text = 3,
}

export interface Element {
    tagName: string
    attributes: Record<string, string>
    children: Node[]
    nodeType: NodeType.Element
}

interface Text {
    nodeValue: string
    nodeType: NodeType.Text
}

export type Node = Element | Text

And then we need two create function:

export function element(tagName: string) {
    return {
        tagName,
        attributes: {},
        children: [],
        nodeType: NodeType.Element,
    } as Element
}

export function text(data: string) {
    return {
        nodeValue: data,
        nodeType: NodeType.Text,
    } as Text
}

These two functions will return corresponding DOM nodes when they are called in parsing Element code or Text code.

HTML Parser Execution Process

The following diagram shows the execution process of the HTML parser:

The entry point of the HTML parser is the parse() method, which traverses and parses all HTML text until the end:

Check if the current character is <. If it is, parse it as an Element node by calling parseElement(); otherwise, call parseText().
parseText() is relatively simple - it traverses forward through the string until encountering the < character. All characters between the current position and the < character become the value of the Text node.
parseElement() is more complex. First, it calls parseTag() to parse and obtain the element's tag name.
Then it enters the parseAttrs() method to check for attribute nodes. If the node has a class or other HTML attributes, it calls parseAttr() to parse these attributes.
At this point, the first half of the element node has been parsed. Next, it needs to parse the element's child nodes. This creates a recursive process, returning to step 1.
After all child nodes are parsed, it calls parseTag() to verify that the ending tag name matches the starting tag name. If they match, parseElement() or parse() completes; otherwise, it throws an error.

Detailed Implementation of HTML Parser Methods

Entry point `parse()`

The entry point of the HTML parser is parse(rawText):

parse(rawText: string) {
    this.rawText = rawText.trim()
    this.len = this.rawText.length
    this.index = 0
    this.stack = []

    const root = element('root')
    while (this.index < this.len) {
        this.removeSpaces()
        if (this.rawText[this.index].startsWith('<')) {
            this.index++
            this.parseElement(root)
        } else {
            this.parseText(root)
        }
    }
}

The parse() method traverses through the entire HTML text. It first checks if the current character is <. If it is, the text is treated as an Element node and parseElement() is called; otherwise, it's treated as a Text node and parseText() is called.

Parse Element Node `parseElement()`

private parseElement(parent: Element) {
    // Parse tag
    const tag = this.parseTag()
    // Generate element node
    const ele = element(tag)

    this.stack.push(tag)

    parent.children.push(ele)
    // Parse attributes
    this.parseAttrs(ele)

    while (this.index < this.len) {
        this.removeSpaces()
        if (this.rawText[this.index].startsWith('<')) {
            this.index++
            this.removeSpaces()
            // Check if it's an end tag
            if (this.rawText[this.index].startsWith('/')) {
                this.index++
                const startTag = this.stack[this.stack.length - 1]
                // End tag
                const endTag = this.parseTag()
                if (startTag !== endTag) {
                    throw Error(`The end tagName ${endTag} does not match start tagName ${startTag}`)
                }

                this.stack.pop()
                while (this.index < this.len && this.rawText[this.index] !== '>') {
                    this.index++
                }

                break
            } else {
                this.parseElement(ele)
            }
        } else {
            this.parseText(ele)
        }
    }

    this.index++
}

parseElement() first calls parseTag() and parseAttrs() to parse the tag name and attributes, then recursively parses child nodes until all HTML text has been processed.

Parse Text Node `parseText()`

private parseText(parent: Element) {
    let str = ''
    while (
        this.index < this.len
        && !(this.rawText[this.index] === '<' && /\w|\//.test(this.rawText[this.index + 1]))
    ) {
        str += this.rawText[this.index]
        this.index++
    }

    this.sliceText()
    parent.children.push(text(removeExtraSpaces(str)))
}

Parsing text nodes is relatively simpler. The method continues to traverse forward until it encounters the < character. For example, when processing the HTML text <div>test!</div>, parseText() extracts the value test!.

Parse Tag `parseTag()`

After entering parseElement(), the first call is to parseTag(), which parses the tag name:

private parseTag() {
    let tag = ''

    this.removeSpaces()

    // get tag name
    while (this.index < this.len && this.rawText[this.index] !== ' ' && this.rawText[this.index] !== '>') {
        tag += this.rawText[this.index]
        this.index++
    }

    this.sliceText()
    return tag
}

For example, when processing the HTML text <div>test!</div>, parseTag() extracts the tag name div.

Parse Attribute Nodes `parseAttrs()`

After parsing the tag name, the next step is to parse attribute nodes:

private parseAttrs(ele: Element) {
    // Continue traversing until encountering '>', indicating the end of the <div ....> segment
    while (this.index < this.len && this.rawText[this.index] !== '>') {
        this.removeSpaces()
        this.parseAttr(ele)
        this.removeSpaces()
    }

    this.index++
}

// parse a single attribute, such as class="foo bar"
private parseAttr(ele: Element) {
    let attr = ''
    let value = ''
    while (this.index < this.len && this.rawText[this.index] !== '=' && this.rawText[this.index] !== '>') {
        attr += this.rawText[this.index++]
    }

    this.sliceText()
    attr = attr.trim()
    if (!attr.trim()) return

    this.index++
    let startSymbol = ''
    if (this.rawText[this.index] === "'" || this.rawText[this.index] === '"') {
        startSymbol = this.rawText[this.index++]
    }

    while (this.index < this.len && this.rawText[this.index] !== startSymbol) {
        value += this.rawText[this.index++]
    }

    this.index++
    ele.attributes[attr] = value.trim()
    this.sliceText()
}

parseAttr() can parse HTML text such as class="test" into an object { class: "test" }.

Helper Methods

Sometimes there are many unnecessary spaces between different nodes and attributes, so we need a method to remove them:

protected removeSpaces() {
    while (this.index < this.len && (this.rawText[this.index] === ' ' || this.rawText[this.index] === '\n')) {
        this.index++
    }

    this.sliceText()
}

For debugging purposes, developers need to check the current character being processed. If all previously processed characters remain in the text, debugging becomes more difficult as developers need to manually find the current character based on the index value. Therefore, we need to remove all processed characters to ensure only unprocessed text remains:

protected sliceText() {
    this.rawText = this.rawText.slice(this.index)
    this.len = this.rawText.length
    this.index = 0
}

The sliceText() method removes all processed characters. For example, when parsing the tag name div:

After parsing, we need to remove the processed text, as shown in the following diagram:

Brief summary

In conclusion, we have covered the complete logic of the HTML parser. The entire implementation consists of approximately 200 lines of code, or around 100 lines excluding TypeScript type declarations.

CSS Parser

A CSS stylesheet is a collection of CSS rules, and the purpose of CSS parser is to transform CSS text into a CSS rule collection.

div, p {
    font-size: 88px;
    color: #000;
}

For example, the CSS parser will transform the above CSS text into the following CSS rule collection:

[
    {
        "selectors": [
            {
                "id": "",
                "class": "",
                "tagName": "div"
            },
            {
                "id": "",
                "class": "",
                "tagName": "p"
            }
        ],
        "declarations": [
            {
                "name": "font-size",
                "value": "88px"
            },
            {
                "name": "color",
                "value": "#000"
            }
        ]
    }
]

Each rule has a selectors and declarations attribute, where selectors indicates CSS selectors, and declarations indicates a collection of CSS property declarations.

export interface Rule {
    selectors: Selector[]
    declarations: Declaration[]
}

export interface Selector {
    tagName: string
    id: string
    class: string
}

export interface Declaration {
    name: string
    value: string | number
}

Each CSS rule can contain multiple selectors and CSS properties.

Parse CSS Rule `parseRule()`

private parseRule() {
    const rule: Rule = {
        selectors: [],
        declarations: [],
    }

    rule.selectors = this.parseSelectors()
    rule.declarations = this.parseDeclarations()

    return rule
}

In parseRule(), it calls parseSelectors() to parse CSS selectors, and then calls parseDeclarations() to parse CSS properties from the remaining CSS text.

Parse Selector `parseSelector()`

private parseSelector() {
    const selector: Selector = {
        id: '',
        class: '',
        tagName: '',
    }

    switch (this.rawText[this.index]) {
        case '.':
            this.index++
            selector.class = this.parseIdentifier()
            break
        case '#':
            this.index++
            selector.id = this.parseIdentifier()
            break
        case '*':
            this.index++
            selector.tagName = '*'
            break
        default:
            selector.tagName = this.parseIdentifier()
    }

    return selector
}

private parseIdentifier() {
    let result = ''
    while (this.index < this.len && this.identifierRE.test(this.rawText[this.index])) {
        result += this.rawText[this.index++]
    }

    this.sliceText()
    return result
}

We only support tag names, ID selectors with the # prefix, class selectors with the . prefix, or combinations of these. If the tag name is *, it represents a universal selector that can match any tag.

The standard CSS parser will skip unrecognized parts and continue parsing the remaining CSS text. This behavior ensures compatibility with older browsers and prevents program interruption due to errors. Our CSS parser is simpler and doesn't include such error handling.

Parse CSS Properties `parseDeclaration()`

private parseDeclaration() {
    const declaration: Declaration = { name: '', value: '' }
    this.removeSpaces()
    declaration.name = this.parseIdentifier()
    this.removeSpaces()

    while (this.index < this.len && this.rawText[this.index] !== ':') {
        this.index++
    }

    this.index++ // clear :
    this.removeSpaces()
    declaration.value = this.parseValue()
    this.removeSpaces()

    return declaration
}

parseDeclaration() will parse CSS text such as color: red; into an object { name: "color", value: "red" }.

Brief Summary

The CSS parser is relatively simpler since most concepts have been covered in the HTML parser section. The entire CSS parser's code is approximately 100 lines, and if you have read the HTML parser's code, you should find the CSS parser's code easier to understand.

Build Style Tree

The purpose of this phase is to write a style tree builder that takes a DOM tree and a collection of CSS rules as input and generates a style tree.

Each node in the style tree contains CSS property values and a reference to its corresponding DOM node:

interface AnyObject {
    [key: string]: any
}

export interface StyleNode {
    node: Node // DOM node
    values: AnyObject // style property values
    children: StyleNode[] // style tree children
}

Let's look at a simple example:

<div>test</div>

div {
    font-size: 88px;
    color: #000;
}

The above HTML and CSS will be transformed by the style builder into a style tree:

{
    "node": { // DOM node
        "tagName": "div",
        "attributes": {},
        "children": [
            {
                "nodeValue": "test",
                "nodeType": 3
            }
        ],
        "nodeType": 1
    },
    "values": { // CSS property values
        "font-size": "88px",
        "color": "#000"
    },
    "children": [ // style tree children
        {
            "node": {
                "nodeValue": "test",
                "nodeType": 3
            },
            "values": { // text node inherits parent's styles
                "font-size": "88px",
                "color": "#000"
            },
            "children": []
        }
    ]
}

Traverse DOM Tree

Now we need to traverse the DOM tree and check if each node matches any CSS rules.

export function getStyleTree(eles: Node | Node[], cssRules: Rule[], parent?: StyleNode) {
    if (Array.isArray(eles)) {
        return eles.map((ele) => getStyleNode(ele, cssRules, parent))
    }

    return getStyleNode(eles, cssRules, parent)
}

Match Selector

The selector matching is easier to implement since our CSS parser only supports simple selectors. We just need to check if the element itself matches the selector.

/**
 * Check if CSS selector matches the element
 */
function isMatch(ele: Element, selectors: Selector[]) {
    return selectors.some((selector) => {
        // Universal selector
        if (selector.tagName === '*') return true
        if (selector.tagName === ele.tagName) return true
        if (ele.attributes.id === selector.id) return true

        if (ele.attributes.class) {
            const classes = ele.attributes.class.split(' ').filter(Boolean)
            const classes2 = selector.class.split(' ').filter(Boolean)
            for (const name of classes) {
                if (classes2.includes(name)) return true
            }
        }

        return false
    })
}

Once we find the matching DOM node, we need to combine the DOM node with its matching CSS properties to output a style tree node:

function getStyleNode(ele: Node, cssRules: Rule[], parent?: StyleNode) {
    const styleNode: StyleNode = {
        node: ele,
        values: getStyleValues(ele, cssRules, parent),
        children: [],
    }

    if (ele.nodeType === NodeType.Element) {
        // Merge inline styles
        if (ele.attributes.style) {
            styleNode.values = { ...styleNode.values, ...getInlineStyle(ele.attributes.style) }
        }

        styleNode.children = ele.children.map((e) => getStyleNode(e, cssRules, styleNode)) as unknown as StyleNode[]
    }

    return styleNode
}

function getStyleValues(ele: Node, cssRules: Rule[], parent?: StyleNode) {
    const inheritableAttrValue = getInheritableAttrValues(parent)

    // Text nodes inherit inheritable properties from parent
    if (ele.nodeType === NodeType.Text) return inheritableAttrValue

    return cssRules.reduce((result: AnyObject, rule) => {
        if (isMatch(ele as Element, rule.selectors)) {
            result = { ...result, ...cssValueArrToObject(rule.declarations) }
        }

        return result
    }, inheritableAttrValue)
}

In CSS selectors, different selectors have different priorities. For example, ID selector's priority is higher than class selectors. However, for simplicity, we haven't implemented selector priorities - all selectors have the same priority.

Inherit Property

Text nodes can't match any selector, so where do their styles come from? The answer is inheritance - text nodes inherit styles from their parent nodes.

There are many inheritable properties in CSS. Even when child nodes haven't declared certain properties, they can still inherit them from their parents. For example, font color, font family and so on are all inheritable. For simplicity, we only support inheriting the color and font-size properties from parent nodes.

// Inheritable properties for child elements, only two listed here but there are many more
const inheritableAttrs = ['color', 'font-size']

/**
 * Get inheritable property values from parent element
 */
function getInheritableAttrValues(parent?: StyleNode) {
    if (!parent) return {}
    const keys = Object.keys(parent.values)
    return keys.reduce((result: AnyObject, key) => {
        if (inheritableAttrs.includes(key)) {
            result[key] = parent.values[key]
        }

        return result
    }, {})
}

Inline Style

In CSS, inline styles have the highest priority except for !important.

<span style="color: red; background: yellow;">

We first call getStyleValues() to get the current DOM node's CSS property values, and then get the node's inline styles. The inline styles will override the current node's styles.

Layout Tree

The fourth phase, transforming a style tree into a layout tree, is one of the more complex parts of the entire rendering engine.

CSS Box Model

In CSS, every DOM node can be represented as a box. The box model consists of content, padding, border, margin, and information about the node's position on the page.

We can represent the box model using the following data structures:

export default class Dimensions {
    content: Rect
    padding: EdgeSizes
    border: EdgeSizes
    margin: EdgeSizes
}

export default class Rect {
    x: number
    y: number
    width: number
    height: number
}

export interface EdgeSizes {
    top: number
    right: number
    bottom: number
    left: number
}

Block Layout and Inline Layout

The CSS display property determines how a box model is laid out. While the display property can have many values such as block, inline, flex, and others, we will only support block and inline layouts in our implementation. By default, all box models have display: inline.

Let's look at the differences between these layouts using HTML code:

<container>
  <a></a>
  <b></b>
  <c></c>
  <d></d>
</container>

With block layout, elements are stacked vertically (top to bottom):

With inline layout, elements are arranged horizontally (left to right):

When a container has both block and inline elements, we wrap the inline elements in an anonymous block container:

This allows us to properly handle both inline and block elements within the same container.

Generally, page content grows vertically. When child nodes are added to a container, they increase the container's height rather than its width. In other words, child nodes typically expand to fill their container's width, while the container's height expands to accommodate its child nodes.

Layout Tree

The layout tree is a collection of box models.

export default class LayoutBox {
    dimensions: Dimensions
    boxType: BoxType
    children: LayoutBox[]
    styleNode: StyleNode
}

Each box model can be of type block, inline, or anonymous:

export enum BoxType {
    BlockNode = 'BlockNode',
    InlineNode = 'InlineNode',
    AnonymousBlock = 'AnonymousBlock',
}

We generate box models according to each DOM node's display property when building the style tree.

When a block node contains an inline child node, we need to create an anonymous node (which is actually a block node) to wrap the child node. If there are multiple inline child nodes in a row, they all need to be placed in the same anonymous node.

function buildLayoutTree(styleNode: StyleNode) {
    if (getDisplayValue(styleNode) === Display.None) {
        throw new Error('Root node has display: none.')
    }

    const layoutBox = new LayoutBox(styleNode)

    let anonymousBlock: LayoutBox | undefined
    for (const child of styleNode.children) {
        const childDisplay = getDisplayValue(child)
        // Skip if DOM node has display: none
        if (childDisplay === Display.None) continue

        if (childDisplay === Display.Block) {
            anonymousBlock = undefined
            layoutBox.children.push(buildLayoutTree(child))
        } else {
            // Create an anonymous container for inline nodes
            if (!anonymousBlock) {
                anonymousBlock = new LayoutBox()
                layoutBox.children.push(anonymousBlock)
            }

            anonymousBlock.children.push(buildLayoutTree(child))
        }
    }

    return layoutBox
}

Traverse Layout Tree

To start building the layout tree, we use the entry point function getLayoutTree():

export function getLayoutTree(styleNode: StyleNode, parentBlock: Dimensions) {
    parentBlock.content.height = 0
    const root = buildLayoutTree(styleNode)
    root.layout(parentBlock)
    return root
}

The entry point traverses the style tree, combines the relevant information from style tree nodes to generate a LayoutBox object, and then calls the layout() method. This method calculates the position and dimension information for each box model.

As mentioned at the beginning of the chapter, a box model's width depends on its parent, while its height depends on its child nodes. This means our code needs to traverse the tree top-down when calculating widths (so we can set child node widths after knowing their parent's width), and then bottom-up when calculating heights (so we can calculate parent heights after knowing their children's dimensions).

layout(parentBlock: Dimensions) {
    // Calculate current node's width before traversing children
    // since child width depends on parent width
    this.calculateBlockWidth(parentBlock)
    // Calculate box node position
    this.calculateBlockPosition(parentBlock)
    // Traverse children and calculate their positions and dimensions
    this.layoutBlockChildren()
    // Calculate current node's height after children
    // since parent height depends on children's height
    this.calculateBlockHeight()
}

This method performs one complete traversal of the layout tree - top-down for width calculations and bottom-up for height calculations. A production-grade layout engine might perform multiple tree traversals, alternating between top-down and bottom-up passes as needed.

Calculating Width

Now, let's first calculate the box model's width. This part is complex, so we need to explain it in detail.

First, we need to get the current node's width, padding, border, and margin information:

calculateBlockWidth(parentBlock: Dimensions) {
    // Initial values
    const styleValues = this.styleNode?.values || {}

    // Default value is auto
    let width = styleValues.width ?? 'auto'
    let marginLeft = styleValues['margin-left'] || styleValues.margin || 0
    let marginRight = styleValues['margin-right'] || styleValues.margin || 0

    let borderLeft = styleValues['border-left'] || styleValues.border || 0
    let borderRight = styleValues['border-right'] || styleValues.border || 0

    let paddingLeft = styleValues['padding-left'] || styleValues.padding || 0
    let paddingRight = styleValues['padding-right'] || styleValues.padding || 0

    // Get parent node's width, if any property is 'auto', set it to 0
    let totalWidth = sum(width, marginLeft, marginRight, borderLeft, borderRight, paddingLeft, paddingRight)
    // ...
}

If these CSS property values haven't been set, they will default to 0. We also need to compare if the current node's total width is equal to the parent node's width. If the width or margin property is set to auto, then we can adjust these properties to fit the available space. So now we need to check the current node's width.

const isWidthAuto = width === 'auto'
const isMarginLeftAuto = marginLeft === 'auto'
const isMarginRightAuto = marginRight === 'auto'

// If current block width exceeds parent width, set its expandable margins to 0
if (!isWidthAuto && totalWidth > parentWidth) {
    if (isMarginLeftAuto) {
        marginLeft = 0
    }

    if (isMarginRightAuto) {
        marginRight = 0
    }
}

// Adjust current element's width based on the difference between parent and child element widths
const underflow = parentWidth - totalWidth

// If all three values are set, fill the difference into marginRight
if (!isWidthAuto && !isMarginLeftAuto && !isMarginRightAuto) {
    marginRight += underflow
} else if (!isWidthAuto && !isMarginLeftAuto && isMarginRightAuto) {
    // If right margin is auto, set marginRight to the difference value
    marginRight = underflow
} else if (!isWidthAuto && isMarginLeftAuto && !isMarginRightAuto) {
    // If left margin is auto, set marginLeft to the difference value
    marginLeft = underflow
} else if (isWidthAuto) {
    // If only width is auto, set the other two values to 0
    if (isMarginLeftAuto) {
        marginLeft = 0
    }

    if (isMarginRightAuto) {
        marginRight = 0
    }

    if (underflow >= 0) {
        // Expand width to fill remaining space, original width was auto, calculated as 0
        width = underflow
    } else {
        // Width cannot be negative, so adjust marginRight instead
        width = 0
        marginRight += underflow
    }
} else if (!isWidthAuto && isMarginLeftAuto && isMarginRightAuto) {
    // If only marginLeft and marginRight are auto, set both to half of underflow
    marginLeft = underflow / 2
    marginRight = underflow / 2
}

The above code has shown the calculation details, and important parts have been commented.

By comparing the current node's width with the parent's width, we can get a difference value:

// Adjust current element's width based on the difference between parent and child element widths
const underflow = parentWidth - totalWidth

If the difference value is greater than 0, it indicates that the child nodes' width is less than the parent's width. If the difference value is less than 0, it indicates that the child nodes' width is greater than the parent's width. The above code logic uses these property values (underflow, width, padding, margin) to adjust child nodes' width and margin so they can fit within the parent's width.

Position

Calculating the current node's position information is relatively simpler. The method calculateBlockPosition() will locate the current node by calculating the margin, border, padding information of the current node along with the parent node's position information:

calculateBlockPosition(parentBlock: Dimensions) {
    const styleValues = this.styleNode?.values || {}
    const { x, y, height } = parentBlock.content
    const dimensions = this.dimensions

    dimensions.margin.top = transformValueSafe(styleValues['margin-top'] || styleValues.margin || 0)
    dimensions.margin.bottom = transformValueSafe(styleValues['margin-bottom'] || styleValues.margin || 0)

    dimensions.border.top = transformValueSafe(styleValues['border-top'] || styleValues.border || 0)
    dimensions.border.bottom = transformValueSafe(styleValues['border-bottom'] || styleValues.border || 0)

    dimensions.padding.top = transformValueSafe(styleValues['padding-top'] || styleValues.padding || 0)
    dimensions.padding.bottom = transformValueSafe(styleValues['padding-bottom'] || styleValues.padding || 0)

    dimensions.content.x = x + dimensions.margin.left + dimensions.border.left + dimensions.padding.left
    dimensions.content.y = y + height + dimensions.margin.top + dimensions.border.top + dimensions.padding.top
}

function transformValueSafe(val: number | string) {
    if (val === 'auto') return 0
    return parseInt(String(val))
}

For example, the following code shows how to calculate the x coordinate of the current node's content area:

dimensions.content.x = x + dimensions.margin.left + dimensions.border.left + dimensions.padding.left

Traverse Child Nodes

We must traverse child nodes before calculating the parent's height, because the parent's height depends on its child nodes' height.

layoutBlockChildren() {
    const { dimensions } = this
    for (const child of this.children) {
        child.layout(dimensions)
        // Calculate parent node's height after traversing child nodes
        dimensions.content.height += child.dimensions.marginBox().height
    }
}

Each node's height is the difference value of its top margin and bottom margin, so we can call marginBox() to get the height:

export default class Dimensions {
    content: Rect
    padding: EdgeSizes
    border: EdgeSizes
    margin: EdgeSizes

    constructor() {
        const initValue = {
            top: 0,
            right: 0,
            bottom: 0,
            left: 0,
        }

        this.content = new Rect()

        this.padding = { ...initValue }
        this.border = { ...initValue }
        this.margin = { ...initValue }
    }

    paddingBox() {
        return this.content.expandedBy(this.padding)
    }

    borderBox() {
        return this.paddingBox().expandedBy(this.border)
    }

    marginBox() {
        return this.borderBox().expandedBy(this.margin)
    }
}

export default class Rect {
    x: number
    y: number
    width: number
    height: number

    constructor() {
        this.x = 0
        this.y = 0
        this.width = 0
        this.height = 0
    }

    expandedBy(edge: EdgeSizes) {
        const rect = new Rect()
        rect.x = this.x - edge.left
        rect.y = this.y - edge.top
        rect.width = this.width + edge.left + edge.right
        rect.height = this.height + edge.top + edge.bottom

        return rect
    }
}

After traversing through child nodes and performing relative methods, we have got all child nodes' height, and then get the parent's height (the sum of child nodes' height).

Height Property

By default, a node's height is equal to its content's height. But if we have set the height property value manually, we need to set the node's height to the specified value:

calculateBlockHeight() {
    // If element has height property set, use that height; otherwise use height calculated by layoutBlockChildren()
    const height = this.styleNode?.values.height
    if (height) {
        this.dimensions.content.height = parseInt(height)
    }
}

For more simplicity, we don't need to implement collapsing margins.

Brief Summary

The layout tree is the most complex part of the rendering engine. After this phase, we understand each node's position and dimensional information in the layout tree. In the next phase, we need to study how to paint the layout tree onto the browser page.

Paint

The purpose of this phase is painting each node on the page according to the information of the layout tree. The most computers are using raster(bitmap) technology so far. The process of paint each node to page is called rasterization.

Browsers usually use various graphics APIs and libraries (such as Skia, Cairo, Direct2D and others) to implement rasterization. These apis provide a lot of features such as paint Polygons, lines, curves, gradients and text.

Actually, painting is the most complex part, but by using the canvas library, we don't need to implement rasterization ourselves, which simplifies the painting implementation. Before start to the paint phase, we first study some basic knowledge of how to paint image and text, it help us to understand what the implementation process of rasterization.

How Computer Paint Image And Text

Painting pixel is belong to the low operation of computer, it depends on the detail of screen and gpu api. For simplicity, we can use a section of memory to indicates the screen, the each bit of memory indicates a pixel of screen. For example, if painting a pixel on the (x,y) of the screen, we can use memory[x + y * rowSize] = 1 to indicate. Starting from top-left of the screen, the column calculation is left to right, the row calculation is top to bottom. So the coordinate of the top left corner is (0,0).

For simplicity, we use one bit to indicates a pixel of screen, 0 is represent white, 1 is represent black. The each row length of screen is represented with rowSize, the each column height of screen is represented with colSize.

Painting Line

If we need to paint a line, we just need to know the start point (x1,y1) and the end point (x2,y2).

And then set the area memory values from (x1,y1) to (x2,y2) as 1 according to memory[x + y * rowSize] = 1 formula, then we have drawed a line.

Painting Character

For paint text on the screen, we first divide the screen to several area according to the logic character, each area can output a single completely character. Assume that we have a screen with 256 rows and 512 column, if we assign each character with a 11*8 pixel's grid, then the screen can show 23 rows, each row contains 64 characters(it also have 3 pixels not to used).

According to the above precondition, now we plan to draw a A:

The picture of A is represented by a 11*8 pixels grid. For show it in the memory, we can use a two-dimensional array to indicate:

const charA = [
    [0, 0, 1, 1, 0, 0, 0, 0], // 按从左至右的顺序来读取 bit，转换成十进制数字就是 12
    [0, 1, 1, 1, 1, 0, 0, 0], // 30
    [1, 1, 0, 0, 1, 1, 0, 0], // 51
    [1, 1, 0, 0, 1, 1, 0, 0], // 51
    [1, 1, 1, 1, 1, 1, 0, 0], // 63
    [1, 1, 0, 0, 1, 1, 0, 0], // 51
    [1, 1, 0, 0, 1, 1, 0, 0], // 51
    [1, 1, 0, 0, 1, 1, 0, 0], // 51
    [1, 1, 0, 0, 1, 1, 0, 0], // 51
    [0, 0, 0, 0, 0, 0, 0, 0], // 0
    [0, 0, 0, 0, 0, 0, 0, 0], // 0
]

The first item of the two-dimensional array indicates each bit's value of the first row in the memory. There are a total of 11 rows to draw a A.

Painting Layout Tree

After the basic knowledge of paint screen of popular science, now we start to painting layout tree(for convenience, we using node-canvas library).

First, we need to traverse the entire layout tree, and then paint they one by one:

function renderLayoutBox(layoutBox: LayoutBox, ctx: CanvasRenderingContext2D, parent?: LayoutBox) {
    renderBackground(layoutBox, ctx)
    renderBorder(layoutBox, ctx)
    renderText(layoutBox, ctx, parent)
    for (const child of layoutBox.children) {
        renderLayoutBox(child, ctx, layoutBox)
    }
}

The function will traverse each node and then paint background, border, text in turn, and then paint all child nodes recursively.

By default, we will paint HTML elements according to their order in the layout tree. If two elements overlap, we need to paint the next element on the previous element. The sort way also represented in the layout tree, it will paint a element according to its order in the layout tree.

Painting Background Color

function renderBackground(layoutBox: LayoutBox, ctx: CanvasRenderingContext2D) {
    const { width, height, x, y } = layoutBox.dimensions.borderBox()
    ctx.fillStyle = getStyleValue(layoutBox, 'background')
    ctx.fillRect(x, y, width, height)
}

First, we need to get the position and dimension information of the layout node, starting from x,y, paint the rectangle area. And fill the rectangle according to the value of the CSS property background.

Painting Border

function renderBorder(layoutBox: LayoutBox, ctx: CanvasRenderingContext2D) {
    const { width, height, x, y } = layoutBox.dimensions.borderBox()
    const { left, top, right, bottom } = layoutBox.dimensions.border
    const borderColor = getStyleValue(layoutBox, 'border-color')
    if (!borderColor) return

    ctx.fillStyle = borderColor

    // left
    ctx.fillRect(x, y, left, height)
    // top
    ctx.fillRect(x, y, width, top)
    // right
    ctx.fillRect(x + width - right, y, right, height)
    // bottom
    ctx.fillRect(x, y + height - bottom, width, bottom)
}

Actually, Painting border is painting four rectangles, each rectangle is a border.

Painting Text

function renderText(layoutBox: LayoutBox, ctx: CanvasRenderingContext2D, parent?: LayoutBox) {
    if (layoutBox.styleNode?.node.nodeType === NodeType.Text) {
        // get AnonymousBlock x y
        const { x = 0, y = 0, width } = parent?.dimensions.content || {}
        const styles = layoutBox.styleNode?.values || {}
        const fontSize = styles['font-size'] || '14px'
        const fontFamily = styles['font-family'] || 'serif'
        const fontWeight = styles['font-weight'] || 'normal'
        const fontStyle = styles['font-style'] || 'normal'

        ctx.fillStyle = styles.color
        ctx.font = `${fontStyle} ${fontWeight} ${fontSize} ${fontFamily}`
        ctx.fillText(layoutBox.styleNode?.node.nodeValue, x, y + parseInt(fontSize), width)
    }
}

By the fillText() method of canvas, we can very convenient to paint text with different font style, size, color.

Outputting Image

After painting completely, we can output images with the help of the canvas apis. Let's show it with the following example:

<html>
    <body id=" body " data-index="1" style="color: red; background: yellow;">
        <div>
            <div class="lightblue test">test1!</div>
            <div class="lightblue test">
                <div class="foo">foo</div>
            </div>
        </div>
    </body>
</html>

* {
    display: block;
}

div {
    font-size: 14px;
    width: 400px;
    background: #fff;
    margin-bottom: 20px;
    display: block;
    background: lightblue;
}

.lightblue {
    font-size: 16px;
    display: block;
    width: 200px;
    height: 200px;
    background: blue;
    border-color: green;
    border: 10px;
}

.foo {
    width: 100px;
    height: 100px;
    background: red;
    color: yellow;
    margin-left: 50px;
}

body {
    display: block;
    font-size: 88px;
    color: #000;
}

The above HTML, CSS codes will output a image by parse with a rendering engine:

Summary

So far, we have completed our tiny rendering engine. While it may not be practical for real use, implementing it helps us understand how real rendering engines work, making it valuable from a learning perspective.

Introduction

HTML parser

Node Types

HTML Parser Execution Process

Detailed Implementation of HTML Parser Methods

Entry point parse()

Parse Element Node parseElement()

Parse Text Node parseText()

Parse Tag parseTag()

Parse Attribute Nodes parseAttrs()

Helper Methods

Brief summary

CSS Parser

Parse CSS Rule parseRule()

Parse Selector parseSelector()

Parse CSS Properties parseDeclaration()

Brief Summary

Build Style Tree

Traverse DOM Tree

Match Selector

Inherit Property

Inline Style

Layout Tree

CSS Box Model

Block Layout and Inline Layout

Layout Tree

Traverse Layout Tree

Calculating Width

Position

Traverse Child Nodes

Height Property

Brief Summary

Paint

How Computer Paint Image And Text

Painting Line

Painting Character

Painting Layout Tree

Painting Background Color

Painting Border

Painting Text

Outputting Image

Summary

Reference Material

Entry point `parse()`

Parse Element Node `parseElement()`

Parse Text Node `parseText()`

Parse Tag `parseTag()`

Parse Attribute Nodes `parseAttrs()`

Parse CSS Rule `parseRule()`

Parse Selector `parseSelector()`

Parse CSS Properties `parseDeclaration()`