DEV Community

IronSoftware
IronSoftware

Posted on • Originally published at ironsoftware.com

Multiple Languages for 1 Document

IronOCR supports 125 international languages.

It is possible to use More than one language at a Time to read documents that contain words in more than one language.

You may also use downloaded or proprietary languages and fonts following the Tesseract .traineddata file format standard.

C#:

//PM> Install-Package IronOcr.Languages.Arabic

using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.English;
Ocr.AddSecondaryLanguage(OcrLanguage.Arabic);
// Add as many as you like

using (var Input = new OcrInput(@"images\image.png"))
{
    var Result = Ocr.Read(Input);
    Console.WriteLine(Result.Text);
}
Enter fullscreen mode Exit fullscreen mode

VB:

'PM> Install-Package IronOcr.Languages.Arabic

Imports IronOcr

Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.English
Ocr.AddSecondaryLanguage(OcrLanguage.Arabic)
' Add as many as you like

Using Input = New OcrInput("images\image.png")
    Dim Result = Ocr.Read(Input)
    Console.WriteLine(Result.Text)
End Using
Enter fullscreen mode Exit fullscreen mode

Top comments (0)