<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Rafael Pereira </title>
    <description>The latest articles on DEV Community by Rafael Pereira  (@shakarpg).</description>
    <link>https://dev.to/shakarpg</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3500501%2Faacc7cb7-bae9-4d68-8a0f-3feab7226d1c.jpeg</url>
      <title>DEV Community: Rafael Pereira </title>
      <link>https://dev.to/shakarpg</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/shakarpg"/>
    <language>en</language>
    <item>
      <title>Vision Stock-Financial Applet</title>
      <dc:creator>Rafael Pereira </dc:creator>
      <pubDate>Sun, 14 Sep 2025 00:40:59 +0000</pubDate>
      <link>https://dev.to/shakarpg/vision-stock-financial-applet-2hc6</link>
      <guid>https://dev.to/shakarpg/vision-stock-financial-applet-2hc6</guid>
      <description>&lt;p&gt;&lt;em&gt;This is a submission for the &lt;a href="https://dev.to/challenges/google-ai-studio-2025-09-03"&gt;Google AI Studio Multimodal Challenge&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;I built &lt;strong&gt;Vision Stock -Financial&lt;/strong&gt;, an applet designed to revolutionize how small business owners manage their operations.&lt;/p&gt;

&lt;p&gt;The problem it solves is the difficult and time-consuming nature of manual inventory tracking and financial logging. With our applet, a user can simply &lt;strong&gt;take a picture of a shelf&lt;/strong&gt; to update their inventory or &lt;strong&gt;snap a photo of a receipt&lt;/strong&gt; to log an expense or revenue. This makes the process fast, intuitive, and less prone to errors, allowing business owners to focus on what truly matters: growing their business.&lt;/p&gt;

&lt;h2&gt;
  
  
  Demo
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Applet Link:&lt;/strong&gt; [&lt;a href="https://github.com/shakarpg/Vision_Estoque_Financeiro_Applet.git" rel="noopener noreferrer"&gt;https://github.com/shakarpg/Vision_Estoque_Financeiro_Applet.git&lt;/a&gt;]&lt;/p&gt;

&lt;p&gt;Below are a few screenshots of our applet in action:&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Caption: The main interface of the applet.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwg4l20j8jt8nt5uh0ei7.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwg4l20j8jt8nt5uh0ei7.png" alt="page" width="800" height="371"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fadp7alps6djbbdmoiz82.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fadp7alps6djbbdmoiz82.png" alt="slogan" width="800" height="800"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  How I Used Google AI Studio
&lt;/h2&gt;

&lt;p&gt;We used &lt;strong&gt;Google AI Studio&lt;/strong&gt; as our primary tool to develop and prototype the core intelligence of our applet. Specifically, we leveraged the power of the &lt;strong&gt;Gemini 2.5 Flash&lt;/strong&gt; model for its outstanding multimodal processing capabilities (image and text) and its speed.&lt;/p&gt;

&lt;p&gt;Within AI Studio, we crafted and refined the prompts that instruct the AI to:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt; &lt;strong&gt;Analyze an image&lt;/strong&gt; of a store shelf, visually identify the products, and count the units for each item.&lt;/li&gt;
&lt;li&gt; &lt;strong&gt;Extract crucial information&lt;/strong&gt; from a receipt image, such as the total amount, date, vendor name, and line items, and structure this data neatly.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;The AI Studio interface allowed us to rapidly test different prompting strategies and fine-tune the instructions for maximum accuracy, which significantly accelerated our development cycle.&lt;/p&gt;

&lt;h2&gt;
  
  
  Multimodal Features
&lt;/h2&gt;

&lt;p&gt;Our project's core multimodal feature is &lt;strong&gt;inventory and financial management through image analysis&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;This dramatically enhances the user experience by &lt;strong&gt;eliminating the need for manual data entry&lt;/strong&gt;. Instead of opening a spreadsheet to type "15 soda cans" or "$10.00 - purchase of cleaning supplies," the user simply points their camera and takes a picture.&lt;/p&gt;

&lt;p&gt;This multimodal approach makes management:&lt;/p&gt;

&lt;p&gt;Faster: A photo takes seconds, while manual entry can take several minutes.&lt;/p&gt;

&lt;p&gt;More Accurate: It significantly reduces human errors from typos or miscounts.&lt;/p&gt;

&lt;p&gt;More Accessible: It offers a far more intuitive and natural way to interact with a management system, especially for users who aren't comfortable with complex software.&lt;/p&gt;

</description>
      <category>devchallenge</category>
      <category>googleaichallenge</category>
      <category>ai</category>
      <category>gemini</category>
    </item>
  </channel>
</rss>
