DEV Community: Hani Amro

Generate PDF invoices from HTML in Python — without a headless browser

Hani Amro — Wed, 24 Jun 2026 09:30:19 +0000

Almost every app eventually has to turn HTML into a PDF: invoices, receipts, reports, tickets, certificates. The reflex is "spin up Puppeteer / headless Chrome." Then you inherit:

a ~300 MB Chromium in every container,
--no-sandbox and font headaches,
Chrome crashing under load and memory leaks,
slow cold starts that make serverless painful.

You wanted a PDF. You got a browser to babysit. For HTML + CSS — which is what invoices are — you don't need a browser. Here's the lighter way.

WeasyPrint: HTML/CSS → PDF, no browser
A real, styled invoice
The margins / sizing gotcha
Multi-page reports: page numbers + headers
When you DO still need a browser
Prefer not to host it yourself?
Wrap-up

WeasyPrint turns HTML and CSS into PDF without a browser

pip install weasyprint
# system libs (Debian/Ubuntu): apt install libpango-1.0-0 libpangocairo-1.0-0

from weasyprint import HTML

HTML(string="<h1>Invoice #1042</h1><p>Total: $99</p>").write_pdf("invoice.pdf")

That's the whole dependency story. No Chromium, no sandbox flags.

A real, styled invoice

from weasyprint import HTML

INVOICE = """
<!doctype html><html><head><meta charset="utf-8"><style>
  body { font-family: Arial, sans-serif; color:#1a1a1a; }
  .head { display:flex; justify-content:space-between; }
  h1 { color:#2563eb; margin:0; }
  table { width:100%; border-collapse:collapse; margin-top:24px; }
  th,td { padding:10px 8px; border-bottom:1px solid #e5e7eb; text-align:left; }
  th { background:#f8fafc; }
  .right { text-align:right; }
  .total { font-size:1.2em; font-weight:700; }
</style></head><body>
  <div class="head">
    <div><h1>Invoice</h1><div>#1042 · 23 Jun 2026</div></div>
    <div class="right"><strong>Acme Corp</strong><br>billing@acme.com</div>
  </div>
  <table>
    <tr><th>Item</th><th class="right">Qty</th><th class="right">Price</th></tr>
    <tr><td>API plan — Pro</td><td class="right">1</td><td class="right">$12.00</td></tr>
    <tr><td>Overage (320 req)</td><td class="right">320</td><td class="right">$0.96</td></tr>
    <tr><td class="right total" colspan="2">Total</td><td class="right total">$12.96</td></tr>
  </table>
</body></html>
"""
HTML(string=INVOICE).write_pdf("invoice.pdf")

You control every pixel with normal CSS — no template-engine lock-in.

The gotcha everyone hits with margins and sizing

Two complaints come up constantly with HTML→PDF:

Huge side margins → that's the page margin, not your HTML.
Content looks shrunk/tiny → the converter is doing "shrink to fit" because your layout is wider than the page.

Both are fixed with a CSS @page rule — the bit most online converters silently ignore:

@page {
  size: A4;          /* or Letter, A5, or "210mm 297mm" */
  margin: 12mm;      /* exactly what you want; margin: 0 for edge-to-edge */
}

For the shrinking problem, make sure your content width fits the page (use mm/relative units, not a fixed 1200px container) — then nothing gets scaled down.

Multi-page reports with page numbers and repeating headers

For anything longer than a page, @page gives you running page numbers in pure CSS — no JS, no manual pagination:

@page {
  size: A4;
  margin: 20mm 15mm;
  @bottom-center { content: "Page " counter(page) " of " counter(pages); }
}
thead { display: table-header-group; }  /* repeat the header row on every page */

Already have a Word or Excel template?

Don't rebuild it in HTML — convert it with headless LibreOffice:

soffice --headless --convert-to pdf --outdir out/ invoice.docx

Call it via subprocess; give each call its own -env:UserInstallation=file:///tmp/lo_xyz dir so concurrent runs don't collide.

When you DO still need a real browser

Honest caveat: WeasyPrint doesn't run JavaScript. If your document only exists after client-side JS draws it (a heavy charting lib, a SPA view), you still need headless Chrome for that one case. For static HTML + CSS — 95% of invoices, reports, and receipts — WeasyPrint is lighter, faster, and safer.

Prefer not to host it at all?

Running WeasyPrint means installing Pango/Cairo and keeping it patched. If it's one feature among many, offload it to an HTTP call. I maintain a flat-priced PDF API where html-to-pdf is one endpoint (SSRF-hardened — internal URLs are blocked — and the same key also does merge, OCR, Office→PDF, etc.). Free tier is 1,000 requests/month, no card. Disclosure: I built it.

import requests

requests.post(
    "https://pdf-tools-api2.p.rapidapi.com/html-to-pdf",
    headers={"X-RapidAPI-Key": "YOUR_KEY",
             "X-RapidAPI-Host": "pdf-tools-api2.p.rapidapi.com"},
    data={"html": INVOICE, "page_size": "A4"},
)

Wrap-up

You don't need to ship a browser to make a PDF. For HTML + CSS, WeasyPrint renders invoices and reports in one call, @page gives you exact margins and page numbers, and LibreOffice covers Word/Excel templates. Save headless Chrome for the rare JS-rendered case.

Your turn: what are you generating — invoices, reports, labels? And what bit you: margins, fonts, or page breaks? 👇

Try the PDF API free — 1,000 requests/month, no card

*Built and maintained by a solo developer (based in Syria) who actually answers — questions welcome in the comments!

Arabic OCR with an API: Make Scanned Arabic PDFs Searchable (Python)

Hani Amro — Tue, 23 Jun 2026 11:25:30 +0000

If you've ever tried to extract text from a scanned Arabic document, you already know the pain. Most OCR tooling is built English-first. Arabic adds three problems on top:

Right-to-left (RTL) text that breaks naive layout assumptions.
Connected letters (ligatures) — the same letter changes shape depending on its position in the word.
Diacritics and a different numeral set that generic models drop or mangle.

The result: you run a scanned Arabic contract, invoice, or government form through a typical "PDF to text" tool and get back garbage — reversed words, missing letters, or nothing at all.

This post shows a practical way to turn a scanned Arabic PDF into a searchable PDF (a real, selectable text layer underneath the original page image) with a single API call — no ML pipeline to build, no GPU, no model weights to host. Code is in Python, cURL, and JavaScript.

What "searchable PDF" actually means
The approach
Tips for better Arabic OCR results
Honest limitations
Why an API instead of self-hosting Tesseract
Pricing
Wrap-up

What "searchable PDF" actually means

There are two different things people call "OCR":

Text extraction — you get back a string of the recognized text.
Searchable PDF — you get back a PDF that looks identical to the scan, but now has an invisible text layer, so Ctrl+F, copy-paste, and indexing all work.

The second is what most real workflows need: you keep the original document exactly as scanned (important for legal/official docs), but it becomes searchable and accessible. That's what we'll produce here.

The approach

We'll use the PDF Tools API /ocr endpoint. Under the hood it runs Tesseract with the Arabic (ara) and English (eng) language models and rebuilds the PDF with an invisible OCR text layer. The relevant detail for us: you can pass lang=eng+ara to recognize mixed Arabic/English documents in one pass — which is what most real MENA paperwork actually is (Arabic body text, English brand names, Latin numbers).

You'll need a free API key from the listing (the free tier is 1,000 requests/month, no card). Then:

Python

import requests

API_KEY = "YOUR_RAPIDAPI_KEY"
HOST = "pdf-tools-api2.p.rapidapi.com"

with open("arabic_scan.pdf", "rb") as f:
    resp = requests.post(
        f"https://{HOST}/ocr",
        headers={"X-RapidAPI-Key": API_KEY, "X-RapidAPI-Host": HOST},
        files={"file": ("arabic_scan.pdf", f, "application/pdf")},
        data={"lang": "eng+ara"},   # mixed Arabic + English
    )
resp.raise_for_status()

with open("searchable.pdf", "wb") as out:
    out.write(resp.content)

print("Done — searchable.pdf now has a real text layer.")

Open searchable.pdf and try selecting the Arabic text or searching it. It's there now.

cURL

curl -X POST "https://pdf-tools-api2.p.rapidapi.com/ocr" \
  -H "X-RapidAPI-Key: YOUR_RAPIDAPI_KEY" \
  -H "X-RapidAPI-Host: pdf-tools-api2.p.rapidapi.com" \
  -F "file=@arabic_scan.pdf" \
  -F "lang=eng+ara" \
  --output searchable.pdf

JavaScript (Node / browser)

const form = new FormData();
form.append("file", fileInput.files[0]);
form.append("lang", "eng+ara");

const res = await fetch("https://pdf-tools-api2.p.rapidapi.com/ocr", {
  method: "POST",
  headers: {
    "X-RapidAPI-Key": "YOUR_RAPIDAPI_KEY",
    "X-RapidAPI-Host": "pdf-tools-api2.p.rapidapi.com",
  },
  body: form,
});
const blob = await res.blob(); // application/pdf, now searchable

// Browser: download the searchable PDF
const url = URL.createObjectURL(blob);
const a = Object.assign(document.createElement("a"), { href: url, download: "searchable.pdf" });
a.click();
URL.revokeObjectURL(url);

Just need the raw text instead of a searchable PDF?

If you only want the extracted string (for a database, a search index, an LLM pipeline), run the searchable PDF through /extract-text:

resp = requests.post(
    "https://pdf-tools-api2.p.rapidapi.com/extract-text",
    headers={"X-RapidAPI-Key": API_KEY, "X-RapidAPI-Host": HOST},
    files={"file": ("searchable.pdf", open("searchable.pdf", "rb"), "application/pdf")},
)
print(resp.json()["text"])

Tips for better Arabic OCR results

OCR quality depends mostly on the input scan, not the engine. To get clean output:

Scan at 300 DPI or higher. Below ~200 DPI, connected Arabic letters blur together.
Deskew crooked scans before sending. Even 2–3° of rotation hurts RTL recognition.
Use eng+ara, not ara alone, for any document that mixes Latin characters (almost all real-world ones do).
Keep it under 15 pages per request (split larger docs first — there's a /split endpoint).
Black-on-white beats colored backgrounds; if your scan is noisy, that's the biggest quality lever.

Honest limitations

This is Tesseract-based OCR, not a frontier vision model. It's excellent for printed Arabic (forms, contracts, books, invoices). It is not built for handwritten Arabic, heavily stylized calligraphy, or low-resolution phone photos — accuracy drops sharply there, same as every OCR engine. For clean printed scans it's genuinely good and, importantly, it's available — which is more than most PDF APIs can say for Arabic at all.

Why an API instead of self-hosting Tesseract

You can apt install tesseract-ocr-ara and wire up the PDF rebuild yourself. People do. But you then own:

installing and updating Tesseract + the Arabic language data,
the rasterize → OCR → re-embed-text-layer pipeline (the fiddly part),
font/encoding edge cases for the invisible RTL text layer,
scaling it without melting your server on a 15-page scan.

If Arabic OCR is core to your product, self-hosting is fine. If it's one feature among many, one HTTP call you can put in a spreadsheet beats a maintenance project.

Pricing, briefly

The API is flat per-request — one OCR call is one request, whether it's a 1-page or 15-page scan. No credit tables, no per-page billing (iLovePDF, for comparison, charges OCR per page in credits). Free tier is 1,000 requests/month, permanently, no card. The same key also does merge, split, compress, encrypt, HTML→PDF, Office→PDF, redaction, and table extraction — 26 endpoints total.

Wrap-up

Arabic OCR has a reputation for being painful, and self-hosting it is. But for printed documents, turning a scanned Arabic PDF into a searchable one is now a single API call with lang=eng+ara. If you're digitizing Arabic archives, building a MENA document-management product, or just need Ctrl+F to work on a scanned contract, this gets you there in five minutes.

Your turn: what trips you up most with Arabic OCR — RTL layout, connected-letter ligatures, or diacritics getting dropped? And what are you digitizing: contracts, old books, or handwritten notes? Tell me in the comments. 👇

Try the Arabic OCR API free — 1,000 requests/month, no card

Built and maintained by a solo developer (based in Syria) who actually answers — questions welcome in the comments.

DEV Community: Hani Amro

Generate PDF invoices from HTML in Python — without a headless browser

Contents

WeasyPrint turns HTML and CSS into PDF without a browser

A real, styled invoice

The gotcha everyone hits with margins and sizing

Multi-page reports with page numbers and repeating headers

When you DO still need a real browser

Prefer not to host it at all?

Wrap-up

Arabic OCR with an API: Make Scanned Arabic PDFs Searchable (Python)

Contents

What "searchable PDF" actually means

The approach

Python

cURL

JavaScript (Node / browser)

Tips for better Arabic OCR results

Honest limitations

Why an API instead of self-hosting Tesseract

Pricing, briefly

Wrap-up