DEV Community

IronSoftware
IronSoftware

Posted on • Originally published at ironsoftware.com

Gujarati OCR in C# and .NET

Other versions of this document

IronOCR is a C# software component allowing .NET coders to read text from images and PDF documents in 126 language, including Gujarati.

It is an advanced fork of Tesseract, built exclusively for the .NET developers and regularly outperforms other Tesseract engines for both speed and accuracy.

Contents of IronOcr.Languages.Gujarati

This package contain 120 OCR languages for .NET:

  • Gujarati
  • GujaratiBest
  • GujaratiFast
  • GujaratiAlphabet
  • GujaratiAlphabetBest
  • GujaratiAlphabetFast

Download

Gujarati Language Pack [ગુજરાતી]

Installation

The first thing we have to do is install our Gujarati OCR package to your .NET project.

PM> Install-Package IronOCR.Languages.Gujarati

Code Example

This C# code example reads Gujarati text from an Image or PDF document.

C#:

//PM> Install-Package IronOcr.Languages.Gujarati
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Gujarati;
using (var Input = new OcrInput(@"images\Gujarati.png"))
{
    var Result = Ocr.Read(Input);
    Var AllText =  Result.Text
}
Enter fullscreen mode Exit fullscreen mode

VB:

'PM> Install-Package IronOcr.Languages.Gujarati
Imports IronOcr
Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Gujarati
Using Input = New OcrInput("images\Gujarati.png")
    Dim Result = Ocr.Read(Input)
    Dim AllText As Var = Result.Text
End Using
Enter fullscreen mode Exit fullscreen mode

Top comments (0)