Optical Character Recognition (OCR)

#python

What is OCR?

Optical Character Recognition is a technology that allows text to be scanned from things like a document or image. An example of this is when you take a picture on your smartphone and you can pull the text straight out. OCR in Python is a helpful tool is pulling text out of an upload document such as a pdf. A commonly used library in Python is Pytesseract.

What is Pytesseract?

Pytesseract is an open-source library used for OCR. It is a wrapper for the Tesseract-OCR engine. Lets look at an example of applying Pytesseract.

First, we need to import the necessary libraries

from PIL import Image
import pyTesseract
import numpy as np

This will be the example image I will be using:

Next, we will pull the text out of the image:

file = './images-1.png'

img = Image.open(file)
txt = pytesseract.image_to_string(img)

print(txt)

The results is as follows:

bash-3.2$ python3 example.py 

Arial

While this was just a simple example, pytesseract and other libraries used for OCR can make life so much simpler for programmers needing to pull text out of images.

DEV Community

Optical Character Recognition (OCR)

What is OCR?

What is Pytesseract?

Top comments (0)

Read next

¡Hola Wagtail!

How to Create Your Own RAG with Free LLM Models and a Knowledge Base

Python crawler practice: using 98ip proxy IP to obtain cross-border e-commerce data

How to Add Quotes and Commas to Each Line in a Text File Using Python