<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Tanisha Koladiya</title>
    <description>The latest articles on DEV Community by Tanisha Koladiya (@tanisha_koladiya_a4b9cae0).</description>
    <link>https://dev.to/tanisha_koladiya_a4b9cae0</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3279303%2F302a3114-d194-4218-8106-dc35fdb35f5f.png</url>
      <title>DEV Community: Tanisha Koladiya</title>
      <link>https://dev.to/tanisha_koladiya_a4b9cae0</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/tanisha_koladiya_a4b9cae0"/>
    <language>en</language>
    <item>
      <title>Tesseract OCR not reading blurry or broken text from image — need accurate image-to-text method</title>
      <dc:creator>Tanisha Koladiya</dc:creator>
      <pubDate>Fri, 20 Jun 2025 09:37:03 +0000</pubDate>
      <link>https://dev.to/tanisha_koladiya_a4b9cae0/tesseract-ocr-not-reading-blurry-or-broken-text-from-image-need-accurate-image-to-text-method-22cj</link>
      <guid>https://dev.to/tanisha_koladiya_a4b9cae0/tesseract-ocr-not-reading-blurry-or-broken-text-from-image-need-accurate-image-to-text-method-22cj</guid>
      <description>&lt;p&gt;I am currently using the Tesseract-OCR engine in my application to extract text from images. While it works well in many cases, I’m facing issues where it fails to read blurry or partially broken text, especially when the image contains:&lt;/p&gt;

&lt;p&gt;Small or anti-aliased fonts&lt;/p&gt;

&lt;p&gt;Blurry characters due to low resolution&lt;/p&gt;

&lt;p&gt;Digits or symbols like /, %, . that appear broken or unclear&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;I’ve already tried:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Preprocessing the image using OpenCV (Emgu CV in C#): resizing, thresholding, Gaussian blur, morphology&lt;/p&gt;

&lt;p&gt;Using OEM 1 (LSTM-only) and PSM 6 or 7&lt;/p&gt;

&lt;p&gt;Character whitelisting (e.g., "0123456789./%")&lt;/p&gt;

&lt;p&gt;Still, in some images (attached below), the OCR result is inaccurate — for example, it fails to read values like 96 / 120, 6.67%, etc.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What I need:&lt;/strong&gt;&lt;br&gt;
A reliable image-to-text conversion method that can:&lt;/p&gt;

&lt;p&gt;Handle blurry/low-resolution text&lt;/p&gt;

&lt;p&gt;Read small numeric data and symbols from digital display screenshots&lt;/p&gt;

&lt;p&gt;Be integrated into a C# application&lt;/p&gt;

</description>
    </item>
  </channel>
</rss>
