DEV Community

# ocr

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
What is OCR Pre-fill?

What is OCR Pre-fill?

Comments
2 min read
Building an AI-Powered Passport Scanner with MRZ, OCR, and Face Detection in JavaScript

Building an AI-Powered Passport Scanner with MRZ, OCR, and Face Detection in JavaScript

Comments
9 min read
How can I determine the position of a text string on the screen?

How can I determine the position of a text string on the screen?

Comments
2 min read
Leveraging Dynamic Web TWAIN's New OCR API for Modern Document Management

Leveraging Dynamic Web TWAIN's New OCR API for Modern Document Management

Comments
5 min read
What is due diligence for IDP and why is it important?

What is due diligence for IDP and why is it important?

Comments
5 min read
Building an Event-Driven OCR Service: Challenges and Solutions

Building an Event-Driven OCR Service: Challenges and Solutions

Comments
5 min read
Stop Typing That Image Text: PaddleOCR Makes AI-Powered Text Extraction Effortless

Stop Typing That Image Text: PaddleOCR Makes AI-Powered Text Extraction Effortless

Comments
3 min read
What is Intelligent Document Processing?

What is Intelligent Document Processing?

Comments
3 min read
Optical Clear Adhesive (OCA): Why It Matters in Modern Display Assembly

Optical Clear Adhesive (OCA): Why It Matters in Modern Display Assembly

Comments
4 min read
2025 Complete Guide: How to Build End-to-End OCR with HunyuanOCR

2025 Complete Guide: How to Build End-to-End OCR with HunyuanOCR

1
Comments
6 min read
Paddle OCR-VL & DeepSeek-OCR

Paddle OCR-VL & DeepSeek-OCR

Comments
2 min read
DeepSeek OCR in Automation Pipelines: Practical Engineering Insights and Integration Patterns

DeepSeek OCR in Automation Pipelines: Practical Engineering Insights and Integration Patterns

33
Comments 8
4 min read
DeepSeek-OCR: When a Picture Is Actually Worth 10 Fewer Tokens

DeepSeek-OCR: When a Picture Is Actually Worth 10 Fewer Tokens

7
Comments
9 min read
I am building a document api suite that gives you coordinates for every answer

I am building a document api suite that gives you coordinates for every answer

Comments
1 min read
Complete Guide 2025: How DeepSeek OCR Reduces AI Costs by 20x Through "Visual Compression"

Complete Guide 2025: How DeepSeek OCR Reduces AI Costs by 20x Through "Visual Compression"

1
Comments
10 min read
PaddleOCR: Revolutionizing OCR with AI-Powered Document Understanding

PaddleOCR: Revolutionizing OCR with AI-Powered Document Understanding

1
Comments
3 min read
Kreuzberg: Revolutionizing Document Intelligence in Python

Kreuzberg: Revolutionizing Document Intelligence in Python

1
Comments
3 min read
2025 Complete Guide: PaddleOCR-VL-0.9B — Baidu's Ultra-Lightweight Document Parsing Powerhouse

2025 Complete Guide: PaddleOCR-VL-0.9B — Baidu's Ultra-Lightweight Document Parsing Powerhouse

6
Comments 1
9 min read
Farsi Image generator

Farsi Image generator

Comments
2 min read
Building Purchase Tracker: The MVP That Eats Your Receipts (So You Don’t Have To)

Building Purchase Tracker: The MVP That Eats Your Receipts (So You Don’t Have To)

Comments
2 min read
Step-by-Step Guide to Translating Documents Online Without Breaking Formatting

Step-by-Step Guide to Translating Documents Online Without Breaking Formatting

Comments
3 min read
The OCR Model That Outranks GPT-4o

The OCR Model That Outranks GPT-4o

5
Comments 1
16 min read
Generating Synthetic RTL OCR Data for Donut with SynthDoG-RTL

Generating Synthetic RTL OCR Data for Donut with SynthDoG-RTL

1
Comments
2 min read
Major Challenges in Document Processing & How AI Solves Them | 2025 Guide

Major Challenges in Document Processing & How AI Solves Them | 2025 Guide

Comments
4 min read
NuMarkdown-8B-Thinking: The Open-Source Reasoning OCR that Converts PDFs to Auditable Markdown for Enterprise RAG Pipelines

NuMarkdown-8B-Thinking: The Open-Source Reasoning OCR that Converts PDFs to Auditable Markdown for Enterprise RAG Pipelines

Comments
10 min read
loading...