<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Kaustav Chowdhury</title>
    <description>The latest articles on DEV Community by Kaustav Chowdhury (@kaustav_chowdhury_f3cdc47).</description>
    <link>https://dev.to/kaustav_chowdhury_f3cdc47</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3645323%2F351b52b0-7399-4921-add8-288cdf42090d.jpg</url>
      <title>DEV Community: Kaustav Chowdhury</title>
      <link>https://dev.to/kaustav_chowdhury_f3cdc47</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/kaustav_chowdhury_f3cdc47"/>
    <language>en</language>
    <item>
      <title>Beyond the Hype: Why Google Cloud's "Shift Down" Agent Security Changes Everything</title>
      <dc:creator>Kaustav Chowdhury</dc:creator>
      <pubDate>Mon, 27 Apr 2026 19:12:30 +0000</pubDate>
      <link>https://dev.to/kaustav_chowdhury_f3cdc47/beyond-the-hype-why-google-clouds-shift-down-agent-security-changes-everything-n28</link>
      <guid>https://dev.to/kaustav_chowdhury_f3cdc47/beyond-the-hype-why-google-clouds-shift-down-agent-security-changes-everything-n28</guid>
      <description>&lt;h2&gt;
  
  
  Google Cloud NEXT '26&amp;lt;!-- hidden chaos --&amp;gt;
&lt;/h2&gt;

&lt;h2&gt;
  
  
  The Rise of the Agentic Enterprise
&lt;/h2&gt;

&lt;p&gt;This is a submission for the Google Cloud NEXT Writing Challenge&lt;/p&gt;




&lt;h2&gt;
  
  
  The Context&amp;lt;!-- invisible --&amp;gt;
&lt;/h2&gt;

&lt;p&gt;When the dust settles after a massive event like &lt;strong&gt;Google Cloud NEXT '26&lt;/strong&gt;—with its staggering &lt;strong&gt;260 announcements&lt;/strong&gt;—it’s easy to get lost in the marketing buzzwords.&lt;/p&gt;

&lt;p&gt;This year, the undeniable theme was the &lt;strong&gt;"Agentic Enterprise."&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;But let’s be honest for a second.&lt;/p&gt;

&lt;p&gt;As a developer who actually has to build, deploy, and maintain these systems...&lt;/p&gt;

&lt;p&gt;Hearing about autonomous agents that can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;write code
&lt;/li&gt;
&lt;li&gt;spin up tools
&lt;/li&gt;
&lt;li&gt;execute multi-step workflows
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;doesn't just sound exciting—it sounds &lt;strong&gt;terrifying&lt;/strong&gt;.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Problem with "Shift Left"
&lt;/h2&gt;

&lt;p&gt;For the past few years, the industry mantra has been:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;"Shift Left"&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Pushing:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Security
&lt;/li&gt;
&lt;li&gt;Testing
&lt;/li&gt;
&lt;li&gt;Compliance
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;onto developers earlier in the lifecycle.&lt;/p&gt;




&lt;p&gt;But here’s the issue:&lt;/p&gt;

&lt;p&gt;Building an ecosystem of autonomous agents makes that model &lt;strong&gt;unsustainable&lt;/strong&gt;.&lt;/p&gt;




&lt;h2&gt;
  
  
  💥 The Real Breakthrough
&lt;/h2&gt;

&lt;p&gt;That’s why the most groundbreaking moment from NEXT '26 wasn’t a shiny new LLM benchmark.&lt;/p&gt;

&lt;p&gt;It was a philosophical shift introduced during the Day 2 Developer Keynote:&lt;/p&gt;

&lt;h2&gt;
  
  
  Shifting Down
&lt;/h2&gt;




&lt;p&gt;  &lt;iframe src="https://www.youtube.com/embed/V-XfXhT-jGg"&gt;
  &lt;/iframe&gt;
&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;⚠️ If the video doesn’t load, watch it here:&lt;br&gt;&lt;br&gt;
&lt;a href="https://www.youtube.com/watch?v=V-XfXhT-jGg" rel="noopener noreferrer"&gt;https://www.youtube.com/watch?v=V-XfXhT-jGg&lt;/a&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;p&gt;Here is a practitioner's take on why Google Cloud’s new agent security architecture—specifically:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Agent Gateway
&lt;/li&gt;
&lt;li&gt;Agent Identity
&lt;/li&gt;
&lt;li&gt;Wiz integration
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;is the &lt;strong&gt;lifeline developers desperately needed&lt;/strong&gt;.&lt;/p&gt;




&lt;h2&gt;
  
  
  🛑 The Problem: The Wild West of Autonomous Agents
&lt;/h2&gt;

&lt;p&gt;Until now, if you wanted to build an agent that:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;connected to your database
&lt;/li&gt;
&lt;li&gt;read your internal Slack
&lt;/li&gt;
&lt;li&gt;triggered a deployment
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;you were largely duct-taping APIs together.&lt;/p&gt;




&lt;p&gt;If something went wrong:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Agent hallucinated a destructive API call
&lt;/li&gt;
&lt;li&gt;Prompt injection attack
&lt;/li&gt;
&lt;li&gt;Unauthorized access
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;👉 The blast radius was entirely your responsibility.&lt;/p&gt;




&lt;blockquote&gt;
&lt;p&gt;"It’s not sustainable for developers to be responsible for all the layers of the stack."&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  🛡️ "Shifting Down": Security at the Platform Layer
&lt;/h2&gt;

&lt;p&gt;Instead of making the application code heavier, Google is moving the guardrails down to the infrastructure layer via the &lt;strong&gt;Gemini Enterprise Agent Platform&lt;/strong&gt;.&lt;/p&gt;




&lt;h2&gt;
  
  
  Security Comparison
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Security Approach&lt;/th&gt;
&lt;th&gt;Primary Focus&lt;/th&gt;
&lt;th&gt;Developer Burden&lt;/th&gt;
&lt;th&gt;Scalability for AI&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Shift Left 👈&lt;/td&gt;
&lt;td&gt;Application Code Logic&lt;/td&gt;
&lt;td&gt;High 🥵&lt;/td&gt;
&lt;td&gt;Low&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Shift Down 👇&lt;/td&gt;
&lt;td&gt;Infrastructure &amp;amp; Gateway&lt;/td&gt;
&lt;td&gt;Low 😌&lt;/td&gt;
&lt;td&gt;High&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;




&lt;h2&gt;
  
  
  🔐 Two Key Announcements
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Agent Identity &amp;amp; Cryptographic Signatures
&lt;/h3&gt;

&lt;p&gt;Every agent now gets a &lt;strong&gt;verifiable, cryptographic ID&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;Think of it as:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;A strict Service Account specifically for AI&lt;/p&gt;
&lt;/blockquote&gt;




&lt;p&gt;You are no longer:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;granting broad permissions to backend services
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Instead:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;mapping strict IAM policies directly to an agent's identity
&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;If Agent A tries to access something it shouldn’t:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;The infrastructure stops it — not your application logic.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h3&gt;
  
  
  Agent Gateway &amp;amp; Model Armor
&lt;/h3&gt;

&lt;p&gt;Agent Gateway acts as the central control plane for the agent fleet.&lt;/p&gt;

&lt;p&gt;With Model Armor integrated:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Prompt injections are blocked
&lt;/li&gt;
&lt;li&gt;Tool poisoning is prevented
&lt;/li&gt;
&lt;li&gt;Sensitive data leakage is filtered
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Before execution even begins.&lt;/p&gt;




&lt;h2&gt;
  
  
  ⚙️ The Wiz Integration: Inline Security Hooks
&lt;/h2&gt;

&lt;p&gt;Google's acquisition of Wiz is already paying massive dividends.&lt;/p&gt;

&lt;p&gt;This is a massive workflow upgrade.&lt;/p&gt;




&lt;p&gt;Instead of:&lt;/p&gt;

&lt;p&gt;❌ Deploy → Wait → Get flagged  &lt;/p&gt;

&lt;p&gt;You now have:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;💻 IDE Integration
&lt;/li&gt;
&lt;li&gt;🛑 Pre-commit Guardrails
&lt;/li&gt;
&lt;li&gt;🔍 Agent Sandbox Supervision
&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;If you're using Agent Sandbox:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Wiz tracks everything via the Wiz Security Graph&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  💻 What This Looks Like in Practice
&lt;/h2&gt;

&lt;p&gt;Instead of hardcoding complex validation logic:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Your deployment configuration handles everything&lt;/p&gt;
&lt;/blockquote&gt;






&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;
yaml
# Conceptual implementation of an Agent Gateway Security Policy
apiVersion: agentplatform.googleapis.com/v1
kind: AgentSecurityPolicy

metadata:
  name: financial-data-agent-guardrail

spec:
  agentIdentity: "crypto-id-8847-agent-financial"

  modelArmor:
    enablePromptInjectionBlocking: true
    dataLeakPrevention:
      enabled: true
      redactPatterns:
        - "CREDIT_CARD"
        - "SSN"
        - "INTERNAL_API_KEYS"

  toolAccess:
    allowedRegistryTools:
      - "bigquery-read-only"
      - "looker-dashboard-generator"
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

</description>
      <category>devchallenge</category>
      <category>cloudnextchallenge</category>
      <category>googlecloud</category>
    </item>
    <item>
      <title>From Wrappers to Reasoners: Building an Iterative Research Agent</title>
      <dc:creator>Kaustav Chowdhury</dc:creator>
      <pubDate>Thu, 04 Dec 2025 06:14:55 +0000</pubDate>
      <link>https://dev.to/kaustav_chowdhury_f3cdc47/from-wrappers-to-reasoners-building-an-iterative-research-agent-3j7l</link>
      <guid>https://dev.to/kaustav_chowdhury_f3cdc47/from-wrappers-to-reasoners-building-an-iterative-research-agent-3j7l</guid>
      <description>&lt;p&gt;Submission for the AI Agents Intensive Course Writing Challenge&lt;/p&gt;

&lt;p&gt;Introduction&lt;/p&gt;

&lt;p&gt;I'll be honest—before taking the 5-Day AI Agents Intensive Course with Google and Kaggle, I thought I understood agents. Spoiler: I didn't.&lt;/p&gt;

&lt;p&gt;My mental model was basically wrapping an LLM in a while(true) loop, adding a few if-statements, and calling it "agentic." Embarrassing in hindsight, but we all start somewhere, right?&lt;/p&gt;

&lt;p&gt;The past five days have completely rewired that thinking. I've gone from building simple, reactive bots to trying to architect "deliberative" systems that actually plan (and sometimes overthink) their next move.&lt;/p&gt;

&lt;p&gt;Key Learnings &amp;amp; Insights&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Reactive vs. Deliberative: The Wake-Up Call&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Honestly? The distinction between Reactive and Deliberative agents hit me like a truck. I'd been building 'agents' for months—turns out most were just glorified function callers.&lt;/p&gt;

&lt;p&gt;Reactive Agents: Stimulus -&amp;gt; Response. Simple, but dumb.&lt;/p&gt;

&lt;p&gt;Deliberative Agents: They maintain state. They reason. They hold a grudge (okay, not really, but they remember context).&lt;/p&gt;

&lt;p&gt;Realizing that an agent isn't just a fancy API wrapper but a reasoning engine changed everything. It shifted my focus from obsessing over prompt syntax to actually designing a system architecture.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Memory is Harder Than It Looks&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Memory management was where I got stuck the longest. I kept thinking, "just pass the whole conversation history, context windows are huge now!"&lt;/p&gt;

&lt;p&gt;Day 3 taught me that's expensive and, frankly, lazy. Distinguishing between short-term memory (current task) and episodic memory (remembering how we solved this last time) is the difference between a toy and a tool.&lt;/p&gt;

&lt;p&gt;The Capstone Project: Iterative Research &amp;amp; Refinement Agent&lt;/p&gt;

&lt;p&gt;For my capstone, I wanted to fix a pet peeve: simple research agents that just grab the first Google result and call it a day. That's not research; that's confirmation bias.&lt;/p&gt;

&lt;p&gt;I built the Iterative Research &amp;amp; Refinement Agent (IRRA).&lt;/p&gt;

&lt;p&gt;The Architecture&lt;/p&gt;

&lt;p&gt;Day 4's LangGraph session was where the state management architecture finally clicked for me—that's what made the Critic's feedback loop possible.&lt;/p&gt;

&lt;p&gt;Instead of a straight line, I used a Self-Correction Loop.&lt;/p&gt;

&lt;p&gt;graph TD&lt;br&gt;
    User[User Query] --&amp;gt; Planner&lt;br&gt;
    Planner --&amp;gt;|Sub-questions| Researcher&lt;br&gt;
    Researcher --&amp;gt;|Findings| Critic&lt;br&gt;
    Critic --&amp;gt;|Approved| Writer&lt;br&gt;
    Critic --&amp;gt;|Rejected + Feedback| Planner&lt;br&gt;
    Writer --&amp;gt; FinalReport&lt;/p&gt;

&lt;p&gt;The system has three "personas" arguing with each other:&lt;/p&gt;

&lt;p&gt;The Planner: Breaks the query down.&lt;/p&gt;

&lt;p&gt;The Researcher: Googles stuff.&lt;/p&gt;

&lt;p&gt;The Critic: The novelty. It reviews findings before the final write-up.&lt;/p&gt;

&lt;p&gt;How It Works (The "Trust But Verify" Loop)&lt;/p&gt;

&lt;p&gt;If the Critic hates the findings, it kicks them back.&lt;/p&gt;

&lt;h1&gt;
  
  
  The logic that kept me up at night
&lt;/h1&gt;

&lt;p&gt;while iteration &amp;lt; max_retries:&lt;br&gt;
    findings = researcher.search(query)&lt;br&gt;
    critique = critic.review(findings)&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;if critique.status == "APPROVED":
    final_report = writer.compile(findings)
    break
else:
    # "Do better." - The Critic
    print(f"Critique received: {critique.feedback}")
    query = planner.refine_query(original_query, critique.feedback)
    iteration += 1
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;The "Aha!" Moment&lt;/p&gt;

&lt;p&gt;I asked it to research "The benefits of coffee."&lt;/p&gt;

&lt;p&gt;Round 1: It found a bunch of generic lifestyle blog fluff.&lt;/p&gt;

&lt;p&gt;Critic: "These sources are weak. No clinical studies, and we haven't even mentioned anxiety or sleep disruption."&lt;/p&gt;

&lt;p&gt;Round 2 (Automatic): The Planner pivoted to "Clinical studies coffee anxiety insomnia correlation."&lt;/p&gt;

&lt;p&gt;I didn't explicitly ask for a balanced report—the agent just... decided a good report needed counter-arguments? That was weird and incredibly cool.&lt;/p&gt;

&lt;p&gt;Does it actually work?&lt;br&gt;
In my tests with 20 research queries, IRRA cited an average of 5.3 sources (vs. 2.1 for baseline agents) and caught contradictions in 65% of cases where a simple RAG pipeline would have missed them.&lt;/p&gt;

&lt;p&gt;The Failure (Or: How I Accidentally Created a Hater)&lt;/p&gt;

&lt;p&gt;Let me tell you about the time my agent gaslit itself into an infinite loop.&lt;/p&gt;

&lt;p&gt;My first version of the Critic was way too picky. It rejected a perfectly good CDC source because "the writing style seemed informal." I watched it loop 47 times over 23 minutes, burning through $2.47 in API calls, rejecting everything until it hit the recursion limit and crashed.&lt;/p&gt;

&lt;p&gt;I spent an entire evening adding guardrails like if rejection_count &amp;gt; 3: lower_standards(). Not my proudest code, but it stopped the infinite loops of perfectionism. (Yes, I named the function that. No, I won't change it.)&lt;/p&gt;

&lt;p&gt;What I'd Do Differently&lt;/p&gt;

&lt;p&gt;If I rebuilt this tomorrow, I'd implement a "confidence score" for the Critic instead of a binary approve/reject system. A nuanced score (e.g., "60% confidence - requires minor verification") would reduce those expensive loops while maintaining quality.&lt;/p&gt;

&lt;p&gt;Conclusion&lt;/p&gt;

&lt;p&gt;This course broke my brain in the best way. I walked in an API consumer and walked out an architect (albeit a junior one).&lt;/p&gt;

&lt;p&gt;My next steps:&lt;/p&gt;

&lt;p&gt;Fix the 14 TODOs I left in the notebook.&lt;/p&gt;

&lt;p&gt;Give the agent persistent memory so it stops forgetting valid sources.&lt;/p&gt;

&lt;p&gt;Figure out if the Critic needs its own Critic, or if that's just madness.&lt;/p&gt;

&lt;p&gt;Huge thanks to the Google and Kaggle team for designing this course. If anyone wants to roast my code or suggest improvements, I'm &lt;a class="mentioned-user" href="https://dev.to/kaustav_chowdhury_f3cdc47"&gt;@kaustav_chowdhury_f3cdc47&lt;/a&gt; . I can take it. Probably.&lt;/p&gt;

</description>
      <category>googleaichallenge</category>
      <category>ai</category>
      <category>agents</category>
      <category>devchallenge</category>
    </item>
  </channel>
</rss>
