<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Silvestre</title>
    <description>The latest articles on DEV Community by Silvestre (@silvestre-po).</description>
    <link>https://dev.to/silvestre-po</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3648562%2F6dab7d5b-8051-4f33-9b6a-72b7a498d939.jpg</url>
      <title>DEV Community: Silvestre</title>
      <link>https://dev.to/silvestre-po</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/silvestre-po"/>
    <language>en</language>
    <item>
      <title>From "Why?" to Wow: Building a Multi-Agent Storyteller After 5-Day AI Agents Intensive Course with Google</title>
      <dc:creator>Silvestre</dc:creator>
      <pubDate>Wed, 10 Dec 2025 05:58:06 +0000</pubDate>
      <link>https://dev.to/silvestre-po/from-why-to-wow-building-a-multi-agent-storyteller-after-5-day-ai-agents-intensive-course-with-1lm7</link>
      <guid>https://dev.to/silvestre-po/from-why-to-wow-building-a-multi-agent-storyteller-after-5-day-ai-agents-intensive-course-with-1lm7</guid>
      <description>&lt;h2&gt;
  
  
  My "Aha!" Moment: AI Agents Are More Than Just Chatbots
&lt;/h2&gt;

&lt;p&gt;Before the 5-Day AI Agents Intensive, my view of AI agents was largely centered around conversational interfaces—smart chatbots that could answer questions. The course completely shattered that perception. My key takeaway, and the concept that resonated most, was the idea of an agent as an &lt;strong&gt;orchestrator of specialized tools&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;It's not about one giant model doing everything. It's about a reasoning engine that knows how to solve a complex problem by breaking it down and delegating tasks to the best "specialist" for the job. This shift from a monolithic to a modular, tool-centric mindset was my biggest "aha!" moment.&lt;/p&gt;

&lt;h2&gt;
  
  
  How My Understanding Evolved: The Power of the "Worker Agents"
&lt;/h2&gt;

&lt;p&gt;The course's deep dive into &lt;strong&gt;Multi-Agent Systems&lt;/strong&gt; (Day 1) and &lt;strong&gt;Tools/MCP&lt;/strong&gt;(Day 2) was a game-changer. I stopped thinking about building a single, all-powerful agent and started thinking about creating a team of "worker agents" managed by a "coordinator".&lt;/p&gt;

&lt;p&gt;This led to a fundamental change in my approach:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Before: "How can I prompt a model to generate a story, an image, and audio?"&lt;/li&gt;
&lt;li&gt;After: "How can a Coordinator Agent manage three Specialized Agents—a Writer (Gemini), an Illustrator (Flux.1), and a Narrator (OpenAI TTS)—to work in parallel and deliver a result faster and more efficiently?"&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This evolution in understanding was the direct inspiration for my capstone project.&lt;/p&gt;

&lt;h2&gt;
  
  
  My Capstone Project: 🦁 Curiosity Storybook
&lt;/h2&gt;

&lt;p&gt;For the capstone, I built &lt;strong&gt;Curiosity Storybook&lt;/strong&gt;, an AI agent for the "Agents for Good" track that transforms a child's "Why?" into a magical, multi-sensory learning experience.&lt;/p&gt;

&lt;p&gt;Instead of a dry answer, it generates a complete, personalized storybook page with a story, an illustration, and an audio narration.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F1udh34e9aesf6dpmurtx.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F1udh34e9aesf6dpmurtx.jpg" alt="Curiosity Storybook" width="800" height="450"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://github.com/Silvestre-PO/Curiosity-Storybook" rel="noopener noreferrer"&gt;GitHub Repository&lt;/a&gt;&lt;br&gt;
&lt;a href="https://www.youtube.com/watch?v=22J-rUAjF9Y" rel="noopener noreferrer"&gt;Youtube video&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;This project is a demonstration of how the most advanced concepts from the course can create a seamless and magical experience.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;General Architecture&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;1. Frontend (UI/UX)&lt;/strong&gt;: A kid-friendly interface built with Gradio, hosted on Hugging Face Spaces.&lt;br&gt;
&lt;strong&gt;2. Agent Orchestrator&lt;/strong&gt;: A main agent managed with Blaxel that uses Gemini 2.5 Pro for reasoning and content generation.&lt;br&gt;
&lt;strong&gt;3. Tools&lt;/strong&gt;:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;A custom MCP (Model Context Protocol) server that exposes tools for specific tasks like narration.&lt;/li&gt;
&lt;li&gt;Direct calls to heavy-compute services for long-running tasks like image generation.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;4. AI Models&lt;/strong&gt;:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Google Gemini 2.5 Pro: For generating the main story and the illustration prompt.&lt;/li&gt;
&lt;li&gt;Flux.1-schnell: For high-quality image generation.&lt;/li&gt;
&lt;li&gt;OpenAI TTS: For audio narration.&lt;/li&gt;
&lt;li&gt;Hyperbolic (Llama 3.3): For ultra-fast generation of related questions.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2nbuzqhmehy434wwz4so.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2nbuzqhmehy434wwz4so.png" alt="General Architecture" width="800" height="568"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  What I Learned by Building It
&lt;/h2&gt;

&lt;p&gt;Building this project was where the concepts from the course truly clicked.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Multi-Agent Systems are Practical, Not Just Theoretical&lt;/strong&gt;: My project implements a &lt;strong&gt;Coordinator/Specialist&lt;/strong&gt; pattern. A main agent in Blaxel orchestrates three parallel tasks, each handled by a specialized model. Watching the story, image, and audio generate concurrently was proof of how powerful this architecture is for user experience.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Context Engineering is the Secret Sauce&lt;/strong&gt;: Day 3's lesson on Context Engineering was crucial. I implemented a ConversationContext class that uses &lt;strong&gt;compaction&lt;/strong&gt; (summarizing history) to feed a "Question Suggester" agent (Hyperbolic). This allows the agent to suggest relevant follow-up questions without needing the entire conversation transcript, making it fast and efficient. It's the feature that makes the experience feel like a continuous journey of discovery.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Observability Isn't an Afterthought&lt;/strong&gt;: The "Agent Quality" lesson (Day 4) pushed me to integrate basic observability from the start. I implemented &lt;strong&gt;logging&lt;/strong&gt; for all tool calls and &lt;strong&gt;tracing&lt;/strong&gt; (by passing a session_id) to follow a request from start to finish. When the image generation failed once, I could pinpoint the exact step, proving the value of this pillar immediately.&lt;/li&gt;
&lt;/ol&gt;

&lt;h2&gt;
  
  
  Final Thoughts
&lt;/h2&gt;

&lt;p&gt;The AI Agents Intensive course was more than a series of lectures; it was a fundamental shift in my mental model of what AI can do. It moved me from thinking about "prompts" to thinking about "systems". My understanding has evolved from seeing agents as simple interfaces to seeing them as complex, problem-solving engines. And "Curiosity Storybook" is the tangible result of that journey.&lt;/p&gt;

</description>
      <category>googleaichallenge</category>
      <category>ai</category>
      <category>agents</category>
      <category>devchallenge</category>
    </item>
  </channel>
</rss>
