DEV Community

Kreuzberg

The fastest document intelligence engine for RAG developers.

Organization Settings Admin

Kreuzberg is an open-source polyglot document intelligence framework with a fast Rust core. We build tools that help developers extract, process, and understand documents at scale, in 97+ formats.

Why AI Agents Need Structured Code Intelligence (And How to Stop Managing Parsers)

Why AI Agents Need Structured Code Intelligence (And How to Stop Managing Parsers)

Comments
5 min read
Beyond the Model: Why Document Intelligence Is the Next AI Infrastructure Layer

Beyond the Model: Why Document Intelligence Is the Next AI Infrastructure Layer

Comments
4 min read
The Haystack converter that handles 91+ file formats without a Cloud API

The Haystack converter that handles 91+ file formats without a Cloud API

Comments
7 min read
Document Structure Extraction with Kreuzberg

Document Structure Extraction with Kreuzberg

Comments
7 min read
BM25 + Vector Search in One Query: kreuzberg-surrealdb + SurrealDB v3

BM25 + Vector Search in One Query: kreuzberg-surrealdb + SurrealDB v3

4
Comments
8 min read
How to Extract Text from PDF in Python (2026)

How to Extract Text from PDF in Python (2026)

Comments
5 min read
Kreuzberg vs. Unstructured.io: Benchmarks and Architecture Comparison (March 2026)

Kreuzberg vs. Unstructured.io: Benchmarks and Architecture Comparison (March 2026)

Comments
6 min read
Building a RAG pipeline with Kreuzberg and LangChain

Building a RAG pipeline with Kreuzberg and LangChain

1
Comments
6 min read
loading...