<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Ajinkya Bobade</title>
    <description>The latest articles on DEV Community by Ajinkya Bobade (@ajinkya_bobade_f1cf60e720).</description>
    <link>https://dev.to/ajinkya_bobade_f1cf60e720</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F2689467%2Ff1ea179d-9f5a-4493-a804-e4b3145b388d.jpg</url>
      <title>DEV Community: Ajinkya Bobade</title>
      <link>https://dev.to/ajinkya_bobade_f1cf60e720</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/ajinkya_bobade_f1cf60e720"/>
    <language>en</language>
    <item>
      <title>📝✨ClearText</title>
      <dc:creator>Ajinkya Bobade</dc:creator>
      <pubDate>Wed, 15 Jan 2025 18:21:59 +0000</pubDate>
      <link>https://dev.to/ajinkya_bobade_f1cf60e720/cleartext-o73</link>
      <guid>https://dev.to/ajinkya_bobade_f1cf60e720/cleartext-o73</guid>
      <description>&lt;p&gt;&lt;em&gt;This is a submission for the &lt;a href="https://dev.to/challenges/github"&gt;GitHub Copilot Challenge &lt;/a&gt;: Transitions and Transformations&lt;/em&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  What I Built
&lt;/h2&gt;

&lt;p&gt;I have built "ClearText" which is an AI-powered text detection and enhancement tool that makes text in images cleaner. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3psbh1m7wwbz1yjwv1tc.jpeg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3psbh1m7wwbz1yjwv1tc.jpeg" alt="Title bar" width="800" height="77"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  It's Perfect For 🎯
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;📄 Document Digitization&lt;/li&gt;
&lt;li&gt;📚 Book Scanning&lt;/li&gt;
&lt;li&gt;📱 Mobile Photos of Text&lt;/li&gt;
&lt;li&gt;🖨️ Improving Scanned Documents&lt;/li&gt;
&lt;li&gt;📑 Text Enhancement in Images&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Demo
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://www.youtube.com/watch?v=hS363wB1pmU" rel="noopener noreferrer"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Frb46gqvucal1sw7smstq.jpg" alt="ClearText Demo" width="480" height="360"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Repo
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://github.com/ajinkya933/ClearText" rel="noopener noreferrer"&gt;Github Repository - ClearText&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Here's an example of what ClearText can do:&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3aeoyzeb8snws9dbojdh.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3aeoyzeb8snws9dbojdh.png" alt="ClearText Demo" width="800" height="549"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fpl4czuon5ys779lhbw9t.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fpl4czuon5ys779lhbw9t.png" alt="Image description" width="800" height="294"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;ClearText takes input image (left hand side), removes all noise and outputs pure text (right hand side). &lt;/p&gt;

&lt;p&gt;ClearText has a huge potential where it can be used in the following fields:&lt;/p&gt;

&lt;h3&gt;
  
  
  Document Processing 📄
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Banking &amp;amp; Finance&lt;/strong&gt;

&lt;ul&gt;
&lt;li&gt;🏦 Check processing&lt;/li&gt;
&lt;li&gt;📊 Financial statement digitization&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;h3&gt;
  
  
  Healthcare 🏥
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Medical Records&lt;/strong&gt;

&lt;ul&gt;
&lt;li&gt;📋 Patient records digitization&lt;/li&gt;
&lt;li&gt;🔬 Lab report enhancement&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;h3&gt;
  
  
  Legal Industry ⚖️
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Document Management&lt;/strong&gt;

&lt;ul&gt;
&lt;li&gt;📜 Contract digitization&lt;/li&gt;
&lt;li&gt;🗄️ Case file processing&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;h3&gt;
  
  
  Academic Use Cases 📚
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;📖 Textbook scanning&lt;/li&gt;
&lt;li&gt;📑 Research paper digitization&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Copilot Experience 🤖
&lt;/h2&gt;

&lt;p&gt;I used co-pilot extensively to complete this amazing project. Here are the ways in which co-pilot helped me :&lt;/p&gt;

&lt;h3&gt;
  
  
  Code Completion 📝
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Auto-completed common OpenCV operations&lt;/li&gt;
&lt;li&gt;Suggested image processing parameters&lt;/li&gt;
&lt;li&gt;Completed function signatures for Streamlit components&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Chat Assistance 💬
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Debugged ONNX model loading issues&lt;/li&gt;
&lt;li&gt;Explained image processing pipeline&lt;/li&gt;
&lt;li&gt;Suggested optimizations for image transformations&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Inline Suggestions ⚡
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Recommended error handling patterns&lt;/li&gt;
&lt;li&gt;Suggested variable names and types&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Model Switching 🔄
&lt;/h3&gt;

&lt;p&gt;Used different models for specific tasks:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Code Completion: GitHub Copilot&lt;/li&gt;
&lt;li&gt;Documentation: Claude&lt;/li&gt;
&lt;li&gt;Debugging: GPT-4&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Common Prompts Used 🎯
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;# Function implementation
/explain image processing pipeline
/suggest error handling
/optimize performance
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h3&gt;
  
  
  Code Edits ✏️
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Refactored image processing functions&lt;/li&gt;
&lt;li&gt;Added blur/no-blur options&lt;/li&gt;
&lt;li&gt;Improved error messages&lt;/li&gt;
&lt;li&gt;Enhanced documentation&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Project Evolution &amp;amp; Contributions
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Building on Open Source
&lt;/h3&gt;

&lt;p&gt;This project builds upon the excellent &lt;a href="https://github.com/clovaai/CRAFT-pytorch" rel="noopener noreferrer"&gt;CRAFT text detection model&lt;/a&gt; by CLOVA AI Research, while making significant architectural and functional improvements:&lt;/p&gt;

&lt;h4&gt;
  
  
  1. Production-Ready Architecture 🏗️
&lt;/h4&gt;

&lt;ul&gt;
&lt;li&gt;I converted the research-focused PyTorch model to production-ready ONNX format&lt;/li&gt;
&lt;li&gt;Leveraged &lt;a href="https://onnxruntime.ai/" rel="noopener noreferrer"&gt;ONNX Runtime&lt;/a&gt; for optimized inference across different hardware&lt;/li&gt;
&lt;li&gt;Added complete Docker containerization for reliable deployment&lt;/li&gt;
&lt;/ul&gt;

&lt;h4&gt;
  
  
  2. Enhanced Text Processing Pipeline 🔄
&lt;/h4&gt;

&lt;p&gt;The original CRAFT model provides basic text detection. ClearText significantly expands on this by:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Adding custom image preprocessing for better text clarity&lt;/li&gt;
&lt;li&gt;Implementing new post-processing transforms for enhanced output quality&lt;/li&gt;
&lt;li&gt;Creating an entirely new text enhancement pipeline&lt;/li&gt;
&lt;li&gt;Developing a user-friendly web interface for easy interaction&lt;/li&gt;
&lt;/ul&gt;

&lt;h4&gt;
  
  
  3. Major Output Improvements 📈
&lt;/h4&gt;

&lt;p&gt;ClearText transforms the basic text detection output into a comprehensive text enhancement solution:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Original CRAFT: Basic text region detection&lt;/li&gt;
&lt;li&gt;ClearText Additions:

&lt;ul&gt;
&lt;li&gt;Text clarity enhancement&lt;/li&gt;
&lt;li&gt;Document digitization capabilities&lt;/li&gt;
&lt;li&gt;Support for various document types (books, mobile photos, scanned documents)&lt;/li&gt;
&lt;li&gt;Complete image processing pipeline&lt;/li&gt;
&lt;/ul&gt;


&lt;/li&gt;

&lt;/ul&gt;

&lt;h3&gt;
  
  
  Transparency Statement
&lt;/h3&gt;

&lt;p&gt;While this project builds upon CRAFT's foundational text detection capabilities, ClearText represents a significant evolution with entirely new functionality, architecture, and use cases. All original CRAFT code is properly credited and licensed under MIT License.&lt;/p&gt;

&lt;h2&gt;
  
  
  Conclusion
&lt;/h2&gt;

&lt;p&gt;Developing ClearText during the GitHub Copilot 1-Day Build Challenge has been an amazing journey. Without co-pilot, transforming complex text detection model into an accessible, user-friendly web application would have been tremendously difficult. The project showcases how AI can bridge the gap between computer vision and practical, everyday use cases.&lt;/p&gt;

</description>
      <category>devchallenge</category>
      <category>githubchallenge</category>
      <category>webdev</category>
      <category>ai</category>
    </item>
  </channel>
</rss>
