š Hey Everyone!
If you've ever built a document processing pipeline in n8n, you probably know the exact headache we've been dealing with for the last year:
You use a standard OCR node, it dumps a massive wall of raw text, and you spend hours writing custom Code nodes filled with fragile Regex just to find the "Total Amount" on an invoice. Or, you try to use heavy cloud tools like AWS Textract, and you get bogged down in complex setups and unpredictable token pricing.
We got tired of our workflows breaking every time a vendor changed their invoice layout, so we built a dedicated data extraction platform specifically for n8n workflows and just released our official verified community node.
š ļø What it does
It essentially acts as a pre-processor for your document pipelines. You feed it a binary file (PDFs, invoices, receipts, etc.), and it returns reliable, structured JSON output matching the exact schema you define. No AI hallucinations breaking your downstream nodes, and zero post-processing required.
⨠Key Features for Builders
Simple Setup: If you're on n8n Cloud, just search for "easybits Extractor" in the node panel ā it's available out of the box, no installation needed. Self-hosted users: go to Settings ā Community Nodes ā Install and enter
@easybits/n8n-nodes-extractorBinary In, Clean JSON Out: Replaces massive, fragile Code nodes. The node reads binary file attachments directly from any upstream node ā a Telegram Trigger, Gmail Trigger, n8n Form with file upload, or an HTTP Request that fetches a document. You define your schema once on our platform, and we return the exact structured data you need under json.data.
Per-Field Prompting: Instead of one generic prompt for the whole document, you write a specific description for each data point you want to extract. This gives the model precise instructions per field ā like telling it exactly what "billing period" means on your invoices versus guessing from context. The more specific your field descriptions, the more accurate your results.
Transparent Pricing: 1 request = 1 request. No complex token conversion math, and no guessing what your pipeline will cost. Our free plan includes 50 API requests per month ā enough to build and test your workflows before committing.
Zero Maintenance: We handle all the AI model updates in the background. If a vendor completely changes their invoice layout, your n8n workflow keeps running without you needing to fix anything.
š How to get started
- n8n Cloud: Search for "easybits Extractor" in the node panel ā it's ready to use immediately.
-
Self-hosted: Go to Settings ā Community Nodes ā Install ā enter
@easybits/n8n-nodes-extractor. - Set up your extraction schema for free at easybits.tech.
- Drop the node into your canvas and connect it to any node that outputs a file!
For more details on configuration options and input types, check out our n8n integration guide.
š Put your current setup to the test
We've been running this in production for a while now, and we're confident it holds up ā but we'd love for you to challenge it. If you have a gnarly, multi-page document that keeps breaking your current extraction setup, run it through easybits and see how it compares.
Let us know what you think, what features you'd like to see next, or if you run into any edge cases.
Happy building!
Felix

Top comments (0)