<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Craig ML Dsouza</title>
    <description>The latest articles on DEV Community by Craig ML Dsouza (@craigmldsouza).</description>
    <link>https://dev.to/craigmldsouza</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3859064%2Fc8668948-8931-4456-81c9-ce48d0dcd4de.png</url>
      <title>DEV Community: Craig ML Dsouza</title>
      <link>https://dev.to/craigmldsouza</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/craigmldsouza"/>
    <language>en</language>
    <item>
      <title>Building an AI Research Agent That Uses Real Data (Wiki + Finance)</title>
      <dc:creator>Craig ML Dsouza</dc:creator>
      <pubDate>Fri, 03 Apr 2026 09:04:59 +0000</pubDate>
      <link>https://dev.to/craigmldsouza/building-an-ai-research-agent-that-uses-real-data-wiki-finance-ke5</link>
      <guid>https://dev.to/craigmldsouza/building-an-ai-research-agent-that-uses-real-data-wiki-finance-ke5</guid>
      <description>&lt;h1&gt;
  
  
  I Built an AI Agent That Uses Real Data Instead of Just Guessing
&lt;/h1&gt;

&lt;p&gt;Most AI tools today just generate text.&lt;/p&gt;

&lt;p&gt;They rely on model memory, often guess information, and produce inconsistent outputs.&lt;/p&gt;

&lt;p&gt;I wanted to explore a different approach — what if an AI system could &lt;strong&gt;fetch real data, process it step-by-step, and return structured insights&lt;/strong&gt; instead of raw text?&lt;/p&gt;

&lt;p&gt;So I built &lt;strong&gt;OpenAgent&lt;/strong&gt;.&lt;/p&gt;




&lt;h2&gt;
  
  
  What is OpenAgent?
&lt;/h2&gt;

&lt;p&gt;OpenAgent is a multi-step AI research agent designed to move beyond basic text generation.&lt;/p&gt;

&lt;p&gt;Instead of relying only on the model, it:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;pulls data from &lt;strong&gt;Wikipedia&lt;/strong&gt; for context&lt;/li&gt;
&lt;li&gt;fetches market data from &lt;strong&gt;Yahoo Finance&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;processes everything through a structured pipeline&lt;/li&gt;
&lt;li&gt;outputs clean, structured insights&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Why this matters
&lt;/h2&gt;

&lt;p&gt;Most AI systems:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;generate unstructured text&lt;/li&gt;
&lt;li&gt;mix signal with noise&lt;/li&gt;
&lt;li&gt;are difficult to integrate into real applications&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;OpenAgent focuses on:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;real data instead of guesses&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;structured outputs instead of paragraphs&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;step-by-step processing instead of single-pass generation&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  How it works
&lt;/h2&gt;

&lt;p&gt;Each query goes through a multi-phase pipeline:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Planning → Execution → Signal Extraction → Synthesis
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Planning&lt;/strong&gt;&lt;br&gt;
Determines which tools to use (Wikipedia, Finance)&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Execution&lt;/strong&gt;&lt;br&gt;
Fetches real data from external sources&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Signal Extraction&lt;/strong&gt;&lt;br&gt;
Filters high-value information from raw data&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Synthesis&lt;/strong&gt;&lt;br&gt;
Generates structured output with key insights&lt;/p&gt;


&lt;h2&gt;
  
  
  Example
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Prompt:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Analyze Microsoft (MSFT)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Output:&lt;/strong&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight json"&gt;&lt;code&gt;&lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"summary"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"..."&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"keyInsights"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="s2"&gt;"..."&lt;/span&gt;&lt;span class="p"&gt;],&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"risks"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="s2"&gt;"..."&lt;/span&gt;&lt;span class="p"&gt;],&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"opportunities"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="s2"&gt;"..."&lt;/span&gt;&lt;span class="p"&gt;],&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"sentiment"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="s2"&gt;"POSITIVE"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;&lt;span class="w"&gt;
  &lt;/span&gt;&lt;span class="nl"&gt;"confidenceScore"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;&lt;span class="w"&gt; &lt;/span&gt;&lt;span class="mi"&gt;82&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="w"&gt;
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Instead of a long paragraph, you get &lt;strong&gt;usable, structured data&lt;/strong&gt;.&lt;/p&gt;




&lt;h2&gt;
  
  
  Demo
&lt;/h2&gt;

&lt;p&gt;The system fetches real data, processes it, and generates structured results in real time.&lt;/p&gt;

&lt;h2&gt;
  
  
    &lt;iframe src="https://www.youtube.com/embed/24_Jr5Cz0Bo"&gt;
  &lt;/iframe&gt;

&lt;/h2&gt;

&lt;h2&gt;
  
  
  What you can do with it
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Analyze stocks and companies&lt;/li&gt;
&lt;li&gt;Perform quick research with real data&lt;/li&gt;
&lt;li&gt;Build structured AI workflows&lt;/li&gt;
&lt;li&gt;Use outputs directly in applications&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Key takeaway
&lt;/h2&gt;

&lt;p&gt;LLMs are powerful, but raw text output is often not enough.&lt;/p&gt;

&lt;p&gt;By combining:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;external data sources&lt;/li&gt;
&lt;li&gt;structured pipelines&lt;/li&gt;
&lt;li&gt;controlled outputs&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;you can build systems that are &lt;strong&gt;more reliable and usable in practice&lt;/strong&gt;.&lt;/p&gt;




&lt;h2&gt;
  
  
  Try it out
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;GitHub:&lt;/strong&gt;&lt;br&gt;
&lt;a href="https://github.com/CraigMLdsouza/OpenAgent" rel="noopener noreferrer"&gt;https://github.com/CraigMLdsouza/OpenAgent&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Full version:&lt;/strong&gt;&lt;br&gt;
&lt;a href="https://craigstorm.gumroad.com/l/openagent-research" rel="noopener noreferrer"&gt;https://craigstorm.gumroad.com/l/openagent-research&lt;/a&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Final note
&lt;/h2&gt;

&lt;p&gt;This is a developer-focused project aimed at exploring more reliable AI systems.&lt;/p&gt;

&lt;p&gt;If you're building with AI, moving beyond text generation into &lt;strong&gt;data-driven agents&lt;/strong&gt; is a direction worth exploring.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>webdev</category>
      <category>programming</category>
      <category>agents</category>
    </item>
  </channel>
</rss>
