<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Siddharth Pandey</title>
    <description>The latest articles on DEV Community by Siddharth Pandey (@siddharth_pandey_27).</description>
    <link>https://dev.to/siddharth_pandey_27</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1878784%2Fc5935d13-014b-42fe-a1ab-a9374ff32cbb.jpg</url>
      <title>DEV Community: Siddharth Pandey</title>
      <link>https://dev.to/siddharth_pandey_27</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/siddharth_pandey_27"/>
    <language>en</language>
    <item>
      <title>Why “More Context” Still Doesn’t Fix Infrastructure</title>
      <dc:creator>Siddharth Pandey</dc:creator>
      <pubDate>Fri, 15 May 2026 17:11:03 +0000</pubDate>
      <link>https://dev.to/siddharth_pandey_27/why-more-context-still-doesnt-fix-infrastructure-58nh</link>
      <guid>https://dev.to/siddharth_pandey_27/why-more-context-still-doesnt-fix-infrastructure-58nh</guid>
      <description>&lt;p&gt;A common reaction to AI hallucinating infrastructure is:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“Just give it more context.”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;More code.&lt;br&gt;
More docs.&lt;br&gt;
More Terraform.&lt;br&gt;
More logs.&lt;br&gt;
More everything.&lt;/p&gt;

&lt;p&gt;It sounds reasonable.&lt;/p&gt;

&lt;p&gt;But it quietly assumes something important:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;infrastructure understanding is a “volume problem”.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;It is not.&lt;/p&gt;




&lt;h2&gt;
  
  
  More Context Doesn’t Fix Wrong Structure
&lt;/h2&gt;

&lt;p&gt;Let’s say you give an AI agent access to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Terraform&lt;/li&gt;
&lt;li&gt;CloudFormation&lt;/li&gt;
&lt;li&gt;CDK&lt;/li&gt;
&lt;li&gt;schema definitions&lt;/li&gt;
&lt;li&gt;deployment configs&lt;/li&gt;
&lt;li&gt;API code&lt;/li&gt;
&lt;li&gt;docs&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It can now “see more”.&lt;/p&gt;

&lt;p&gt;But infrastructure problems are rarely about missing visibility.&lt;/p&gt;

&lt;p&gt;They are about missing relationships.&lt;/p&gt;

&lt;p&gt;For example:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Which service consumes this queue?&lt;/li&gt;
&lt;li&gt;Which Lambda depends on this table?&lt;/li&gt;
&lt;li&gt;Which schema version is actually deployed?&lt;/li&gt;
&lt;li&gt;Which environment is stale?&lt;/li&gt;
&lt;li&gt;Which index is required by this access pattern?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;These are not text retrieval problems.&lt;/p&gt;

&lt;p&gt;These are graph problems.&lt;/p&gt;




&lt;h2&gt;
  
  
  RAG Works on Similarity, Not Truth
&lt;/h2&gt;

&lt;p&gt;RAG retrieves things that are:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;semantically similar&lt;/li&gt;
&lt;li&gt;textually relevant&lt;/li&gt;
&lt;li&gt;contextually close&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But infrastructure requires:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;exact state&lt;/li&gt;
&lt;li&gt;exact relationships&lt;/li&gt;
&lt;li&gt;exact deployment reality&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Similarity is not enough when:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;a resource exists in dev but not prod&lt;/li&gt;
&lt;li&gt;a schema changed but code didn’t&lt;/li&gt;
&lt;li&gt;an index was removed but code still assumes it exists&lt;/li&gt;
&lt;li&gt;two services are loosely coupled but heavily dependent&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The system becomes inconsistent very quickly.&lt;/p&gt;

&lt;p&gt;And AI happily reasons over that inconsistency as if it is unified truth.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Real Issue: No Deterministic Infrastructure Graph
&lt;/h2&gt;

&lt;p&gt;What AI actually needs is not just more text.&lt;/p&gt;

&lt;p&gt;It needs a deterministic representation of the system itself:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;services&lt;/li&gt;
&lt;li&gt;dependencies&lt;/li&gt;
&lt;li&gt;data flows&lt;/li&gt;
&lt;li&gt;infrastructure topology&lt;/li&gt;
&lt;li&gt;runtime relationships&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;In simple terms:&lt;br&gt;
a graph where nodes are real infrastructure resources and edges are verified relationships between them — not assumptions inferred from code similarity.&lt;/p&gt;

&lt;p&gt;Without that, the model is still guessing.&lt;/p&gt;

&lt;p&gt;Even if it has access to your entire repository.&lt;/p&gt;

&lt;p&gt;Because repos do not fully represent production systems.&lt;/p&gt;

&lt;p&gt;Production systems are behavior.&lt;/p&gt;




&lt;h2&gt;
  
  
  This Is Where Things Start Breaking Quietly
&lt;/h2&gt;

&lt;p&gt;Most infra failures don’t come from obvious mistakes.&lt;/p&gt;

&lt;p&gt;They come from:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;outdated assumptions&lt;/li&gt;
&lt;li&gt;invisible dependencies&lt;/li&gt;
&lt;li&gt;forgotten services&lt;/li&gt;
&lt;li&gt;implicit coupling&lt;/li&gt;
&lt;li&gt;undocumented flows&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;AI systems are especially vulnerable here because they treat code as the system.&lt;/p&gt;

&lt;p&gt;But production systems are not code.&lt;/p&gt;

&lt;p&gt;They are behavior.&lt;/p&gt;




&lt;h2&gt;
  
  
  This Is the Gap I Keep Thinking About
&lt;/h2&gt;

&lt;p&gt;This is the gap that pushed me toward building &lt;a href="https://www.npmjs.com/package/infrawise" rel="noopener noreferrer"&gt;Infrawise&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;The goal is not just:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“give AI more context.”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;It is to make infrastructure relationships structurally understandable for AI systems.&lt;/p&gt;

&lt;p&gt;Things like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;real schema relationships&lt;/li&gt;
&lt;li&gt;actual index existence&lt;/li&gt;
&lt;li&gt;service dependencies&lt;/li&gt;
&lt;li&gt;infrastructure topology&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Derived from infrastructure state and system analysis — not guessed purely from source code patterns.&lt;/p&gt;




&lt;h2&gt;
  
  
  Closing
&lt;/h2&gt;

&lt;p&gt;A lot of current AI tooling assumes that if the model sees enough files, eventually it will understand the system.&lt;/p&gt;

&lt;p&gt;I don’t think that is true.&lt;/p&gt;

&lt;p&gt;Because infrastructure problems are fundamentally relationship problems.&lt;/p&gt;

&lt;p&gt;Not autocomplete problems.&lt;/p&gt;

&lt;p&gt;Not retrieval problems.&lt;/p&gt;

&lt;p&gt;And definitely not “just increase the context window” problems.&lt;/p&gt;

&lt;p&gt;Current AI coding assistants are already very good at understanding source code.&lt;/p&gt;

&lt;p&gt;The next generation will need to understand:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;infrastructure topology&lt;/li&gt;
&lt;li&gt;operational relationships&lt;/li&gt;
&lt;li&gt;deployment reality&lt;/li&gt;
&lt;li&gt;system behavior&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Because repos do not fully represent production systems.&lt;/p&gt;

&lt;p&gt;Production systems are behavior.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>infrastructure</category>
      <category>mcp</category>
    </item>
    <item>
      <title>AI Generated a DynamoDB Query That Could Never Work</title>
      <dc:creator>Siddharth Pandey</dc:creator>
      <pubDate>Wed, 13 May 2026 06:34:02 +0000</pubDate>
      <link>https://dev.to/siddharth_pandey_27/ai-generated-a-dynamodb-query-that-could-never-work-53h9</link>
      <guid>https://dev.to/siddharth_pandey_27/ai-generated-a-dynamodb-query-that-could-never-work-53h9</guid>
      <description>&lt;p&gt;I asked an AI coding assistant to generate a DynamoDB query.&lt;/p&gt;

&lt;p&gt;A few seconds later it confidently gave me this:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight typescript"&gt;&lt;code&gt;&lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;result&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;await&lt;/span&gt; &lt;span class="nx"&gt;dynamo&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;query&lt;/span&gt;&lt;span class="p"&gt;({&lt;/span&gt;
  &lt;span class="na"&gt;TableName&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="s2"&gt;Orders&lt;/span&gt;&lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
  &lt;span class="na"&gt;IndexName&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="s2"&gt;customerId-createdAt-index&lt;/span&gt;&lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
  &lt;span class="na"&gt;KeyConditionExpression&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="s2"&gt;customerId = :customerId&lt;/span&gt;&lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
&lt;span class="p"&gt;});&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Looks clean.&lt;/p&gt;

&lt;p&gt;Looks professional.&lt;/p&gt;

&lt;p&gt;Looks like something a senior backend engineer would casually approve during code review while fighting for survival in their 14th Slack thread of the day.&lt;/p&gt;

&lt;p&gt;Tiny issue though.&lt;/p&gt;

&lt;p&gt;The index did not exist.&lt;/p&gt;

&lt;p&gt;And honestly, this is becoming one of the biggest problems with AI coding tools:&lt;br&gt;
they are extremely good at generating things that &lt;em&gt;look correct&lt;/em&gt;.&lt;/p&gt;




&lt;h2&gt;
  
  
  The AI Wasn't Being Stupid
&lt;/h2&gt;

&lt;p&gt;That’s the interesting part.&lt;/p&gt;

&lt;p&gt;The assistant actually made a pretty reasonable guess.&lt;/p&gt;

&lt;p&gt;It saw:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;DynamoDB usage&lt;/li&gt;
&lt;li&gt;query patterns&lt;/li&gt;
&lt;li&gt;naming conventions&lt;/li&gt;
&lt;li&gt;nearby code&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;…and predicted what &lt;em&gt;probably&lt;/em&gt; existed.&lt;/p&gt;

&lt;p&gt;Which is exactly what LLMs are designed to do.&lt;/p&gt;

&lt;p&gt;The problem is:&lt;br&gt;
production infrastructure is not based on probability.&lt;/p&gt;

&lt;p&gt;Either the GSI exists or it does not.&lt;/p&gt;

&lt;p&gt;AWS is unfortunately not very emotionally supportive about this distinction.&lt;/p&gt;




&lt;h2&gt;
  
  
  Infrastructure Is Not Just "More Context"
&lt;/h2&gt;

&lt;p&gt;A lot of AI tooling conversations eventually turn into:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“We just need better RAG.”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Or:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“We just need larger context windows.”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;That helps.&lt;/p&gt;

&lt;p&gt;But I think there’s a deeper issue here.&lt;/p&gt;

&lt;p&gt;Because infrastructure is not just information.&lt;/p&gt;

&lt;p&gt;It’s relationships.&lt;/p&gt;

&lt;p&gt;The assistant might read:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Terraform&lt;/li&gt;
&lt;li&gt;schema files&lt;/li&gt;
&lt;li&gt;docs&lt;/li&gt;
&lt;li&gt;migrations&lt;/li&gt;
&lt;li&gt;configs&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;…and still not reliably understand:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;which indexes are actually deployed&lt;/li&gt;
&lt;li&gt;which environments differ&lt;/li&gt;
&lt;li&gt;which access patterns are safe&lt;/li&gt;
&lt;li&gt;which infrastructure assumptions are outdated&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That’s not just a retrieval problem.&lt;/p&gt;

&lt;p&gt;That’s a system understanding problem.&lt;/p&gt;




&lt;h2&gt;
  
  
  DynamoDB Makes This Very Obvious
&lt;/h2&gt;

&lt;p&gt;DynamoDB is actually a great example here because it is brutally honest about access patterns.&lt;/p&gt;

&lt;p&gt;With SQL databases, developers sometimes get away with questionable decisions for a while.&lt;/p&gt;

&lt;p&gt;DynamoDB basically says:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“No index? That sounds like a you problem.”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;And honestly, fair enough.&lt;/p&gt;

&lt;p&gt;The entire database is designed around:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;partition strategy&lt;/li&gt;
&lt;li&gt;access patterns&lt;/li&gt;
&lt;li&gt;deliberate schema design&lt;/li&gt;
&lt;li&gt;predictable query paths&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Which means an AI assistant cannot just generate random “probably correct” queries and hope reality cooperates.&lt;/p&gt;

&lt;p&gt;The query either aligns with infrastructure design or it does not.&lt;/p&gt;

&lt;p&gt;There’s not much middle ground.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Dangerous Part Is Confidence
&lt;/h2&gt;

&lt;p&gt;What makes these failures tricky is that the generated code often looks completely believable.&lt;/p&gt;

&lt;p&gt;No syntax errors.&lt;br&gt;
No obvious red flags.&lt;br&gt;
No broken TypeScript.&lt;/p&gt;

&lt;p&gt;Just confident infrastructure hallucinations.&lt;/p&gt;

&lt;p&gt;And that’s much harder for engineers to detect quickly.&lt;/p&gt;

&lt;p&gt;Especially in large systems where:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;nobody remembers every index&lt;/li&gt;
&lt;li&gt;environments drift over time&lt;/li&gt;
&lt;li&gt;schemas evolve constantly&lt;/li&gt;
&lt;li&gt;infrastructure knowledge is fragmented across teams&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Every company has at least one cloud resource held together entirely by:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;historical accidents&lt;/li&gt;
&lt;li&gt;undocumented assumptions&lt;/li&gt;
&lt;li&gt;and collective organizational fear&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;AI assistants walk directly into these systems with the confidence of somebody who read half the README and decided they understand the architecture.&lt;/p&gt;




&lt;h2&gt;
  
  
  This Is The Gap I Keep Thinking About
&lt;/h2&gt;

&lt;p&gt;Most AI coding assistants today are optimized for:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;syntax&lt;/li&gt;
&lt;li&gt;implementation patterns&lt;/li&gt;
&lt;li&gt;framework familiarity&lt;/li&gt;
&lt;li&gt;autocomplete quality&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But production systems need something else:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;operational awareness&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The assistant needs deterministic understanding of:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;infrastructure topology&lt;/li&gt;
&lt;li&gt;indexes&lt;/li&gt;
&lt;li&gt;schema relationships&lt;/li&gt;
&lt;li&gt;runtime dependencies&lt;/li&gt;
&lt;li&gt;deployment reality&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Otherwise it is just making educated guesses about systems that may cost thousands of dollars per mistake.&lt;/p&gt;

&lt;p&gt;That feels slightly important.&lt;/p&gt;




&lt;h2&gt;
  
  
  This Is One of the Reasons I Started Building Infrawise
&lt;/h2&gt;

&lt;p&gt;This problem is one of the reasons I started working on opensource project &lt;a href="https://github.com/Sidd27/infrawise" rel="noopener noreferrer"&gt;Infrawise&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;The idea is simple:&lt;/p&gt;

&lt;p&gt;Instead of forcing AI assistants to infer infrastructure from scattered code and configs, provide deterministic infrastructure context directly.&lt;/p&gt;

&lt;p&gt;Things like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;DynamoDB index awareness&lt;/li&gt;
&lt;li&gt;schema relationships&lt;/li&gt;
&lt;li&gt;infrastructure mapping&lt;/li&gt;
&lt;li&gt;static analysis&lt;/li&gt;
&lt;li&gt;AI-consumable infrastructure context&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Not “AI magic.”&lt;/p&gt;

&lt;p&gt;Just explicit system understanding.&lt;/p&gt;

&lt;p&gt;Because hallucinated code is annoying.&lt;/p&gt;

&lt;p&gt;Hallucinated infrastructure is how people accidentally discover entirely new AWS billing experiences.&lt;/p&gt;




&lt;h2&gt;
  
  
  AI Needs To Understand Systems, Not Just Code
&lt;/h2&gt;

&lt;p&gt;Current AI coding assistants are already very good at understanding source code.&lt;/p&gt;

&lt;p&gt;The next generation will need to understand:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;architecture&lt;/li&gt;
&lt;li&gt;topology&lt;/li&gt;
&lt;li&gt;infrastructure relationships&lt;/li&gt;
&lt;li&gt;operational constraints&lt;/li&gt;
&lt;li&gt;deployed reality&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Because software does not actually run inside VS Code.&lt;/p&gt;

&lt;p&gt;It runs inside infrastructure.&lt;/p&gt;

&lt;p&gt;And infrastructure is where “probably correct” stops being good enough.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>devops</category>
      <category>architecture</category>
    </item>
    <item>
      <title>Why RAG Alone Cannot Understand Infrastructure</title>
      <dc:creator>Siddharth Pandey</dc:creator>
      <pubDate>Mon, 11 May 2026 12:04:13 +0000</pubDate>
      <link>https://dev.to/siddharth_pandey_27/why-rag-alone-cannot-understand-infrastructure-jfp</link>
      <guid>https://dev.to/siddharth_pandey_27/why-rag-alone-cannot-understand-infrastructure-jfp</guid>
      <description>&lt;p&gt;Every AI tooling discussion eventually reaches the same sentence:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“Just use RAG.”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;At this point, RAG is basically the duct tape of AI architecture.&lt;/p&gt;

&lt;p&gt;Missing context?&lt;br&gt;
RAG.&lt;/p&gt;

&lt;p&gt;Bad answers?&lt;br&gt;
RAG.&lt;/p&gt;

&lt;p&gt;AI hallucinating production infrastructure?&lt;br&gt;
Apparently also RAG.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;To be clear&lt;/strong&gt;: Retrieval-Augmented Generation is genuinely useful.&lt;/p&gt;

&lt;p&gt;It improves grounding.&lt;br&gt;
It reduces hallucinations.&lt;br&gt;
It helps AI systems access external knowledge.&lt;/p&gt;

&lt;p&gt;But there’s a growing misconception in AI engineering right now:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;More retrieval does not automatically create system understanding.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;And infrastructure problems are usually &lt;em&gt;system understanding&lt;/em&gt; problems.&lt;/p&gt;


&lt;h2&gt;
  
  
  The Problem Is Not Missing Text
&lt;/h2&gt;

&lt;p&gt;Most infrastructure failures in AI coding assistants are not happening because documentation is missing.&lt;/p&gt;

&lt;p&gt;The problem is that infrastructure knowledge is relational, fragmented, and operational.&lt;/p&gt;

&lt;p&gt;For example, imagine asking an AI assistant:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“Can this DynamoDB access pattern use an existing index?”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;That answer depends on:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;actual GSIs&lt;/li&gt;
&lt;li&gt;partition keys&lt;/li&gt;
&lt;li&gt;sort keys&lt;/li&gt;
&lt;li&gt;deployed schema state&lt;/li&gt;
&lt;li&gt;access patterns&lt;/li&gt;
&lt;li&gt;environment-specific infrastructure&lt;/li&gt;
&lt;li&gt;workload constraints&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is not just a “retrieve the right paragraph” problem.&lt;/p&gt;

&lt;p&gt;A vector database cannot magically infer operational relationships from semantically similar chunks of text.&lt;/p&gt;

&lt;p&gt;Infrastructure is not a Wikipedia article.&lt;/p&gt;

&lt;p&gt;It is a topology.&lt;/p&gt;


&lt;h2&gt;
  
  
  The “Looks Correct” Problem
&lt;/h2&gt;

&lt;p&gt;This is where AI gets dangerous.&lt;/p&gt;

&lt;p&gt;The generated output often looks completely reasonable.&lt;/p&gt;

&lt;p&gt;Example:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight typescript"&gt;&lt;code&gt;&lt;span class="k"&gt;await&lt;/span&gt; &lt;span class="nx"&gt;ordersTable&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;query&lt;/span&gt;&lt;span class="p"&gt;({&lt;/span&gt;
  &lt;span class="na"&gt;IndexName&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="s2"&gt;customerId-createdAt-index&lt;/span&gt;&lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
&lt;span class="p"&gt;});&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Looks valid.&lt;/p&gt;

&lt;p&gt;Compiles.&lt;/p&gt;

&lt;p&gt;Very professional-looking.&lt;/p&gt;

&lt;p&gt;Tiny problem: &lt;strong&gt;the index does not exist&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;The AI retrieved enough surrounding context to generate something statistically plausible.&lt;/p&gt;

&lt;p&gt;But plausible infrastructure is not the same thing as real infrastructure.&lt;/p&gt;

&lt;p&gt;This is the core limitation of relying entirely on semantic retrieval.&lt;/p&gt;

&lt;p&gt;RAG helps models retrieve information.&lt;/p&gt;

&lt;p&gt;Infrastructure awareness requires understanding relationships.&lt;/p&gt;

&lt;p&gt;Those are different problems.&lt;/p&gt;




&lt;h2&gt;
  
  
  Infrastructure Is a Graph Problem
&lt;/h2&gt;

&lt;p&gt;Modern systems are deeply interconnected.&lt;/p&gt;

&lt;p&gt;A single API endpoint might depend on:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;DynamoDB tables&lt;/li&gt;
&lt;li&gt;GSIs&lt;/li&gt;
&lt;li&gt;SQS queues&lt;/li&gt;
&lt;li&gt;Lambda functions&lt;/li&gt;
&lt;li&gt;deployment environments&lt;/li&gt;
&lt;li&gt;feature flags&lt;/li&gt;
&lt;li&gt;event consumers&lt;/li&gt;
&lt;li&gt;schema versions&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Now imagine asking:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“What breaks if I change this schema?”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;That is not a chunk retrieval problem.&lt;/p&gt;

&lt;p&gt;That is dependency analysis.&lt;/p&gt;

&lt;p&gt;The AI needs to understand:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;who consumes the schema&lt;/li&gt;
&lt;li&gt;where it is used&lt;/li&gt;
&lt;li&gt;which systems depend on it&lt;/li&gt;
&lt;li&gt;whether environments differ&lt;/li&gt;
&lt;li&gt;whether migrations already exist&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Most RAG systems fundamentally operate on semantic similarity.&lt;/p&gt;

&lt;p&gt;Infrastructure problems often require deterministic relationship mapping.&lt;/p&gt;

&lt;p&gt;Very different category of problem.&lt;/p&gt;




&lt;h2&gt;
  
  
  Bigger Context Windows Do Not Solve This Either
&lt;/h2&gt;

&lt;p&gt;A lot of people assume larger context windows will eventually solve infrastructure awareness.&lt;/p&gt;

&lt;p&gt;I do not think that is true.&lt;/p&gt;

&lt;p&gt;A 2 million token context window filled with:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Terraform&lt;/li&gt;
&lt;li&gt;CloudFormation&lt;/li&gt;
&lt;li&gt;CDK&lt;/li&gt;
&lt;li&gt;schemas&lt;/li&gt;
&lt;li&gt;docs&lt;/li&gt;
&lt;li&gt;configs&lt;/li&gt;
&lt;li&gt;deployment files&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;still does not automatically produce reliable topology understanding.&lt;/p&gt;

&lt;p&gt;Because the issue is not just visibility.&lt;/p&gt;

&lt;p&gt;It is interpretation.&lt;/p&gt;

&lt;p&gt;Even humans struggle with this.&lt;/p&gt;

&lt;p&gt;Every engineering organization has:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;infrastructure nobody wants to touch&lt;/li&gt;
&lt;li&gt;queues nobody fully understands&lt;/li&gt;
&lt;li&gt;“temporary” resources that became permanent&lt;/li&gt;
&lt;li&gt;databases held together by historical accidents and organizational fear&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Giving AI more tokens to read does not automatically create operational intelligence.&lt;/p&gt;

&lt;p&gt;Otherwise every engineer who read all the docs would already fully understand production.&lt;/p&gt;

&lt;p&gt;Which is obviously not how reality works.&lt;/p&gt;




&lt;h2&gt;
  
  
  Semantic Similarity vs Deterministic Context
&lt;/h2&gt;

&lt;p&gt;This distinction matters a lot.&lt;/p&gt;

&lt;p&gt;RAG systems answer questions like:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“What information is semantically related to this query?”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Infrastructure-aware systems need to answer questions like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Which indexes actually exist?&lt;/li&gt;
&lt;li&gt;Which services consume this queue?&lt;/li&gt;
&lt;li&gt;Which schema version is deployed?&lt;/li&gt;
&lt;li&gt;Which environment contains this resource?&lt;/li&gt;
&lt;li&gt;Which APIs depend on this table?&lt;/li&gt;
&lt;li&gt;Which resources are connected?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Those answers should not come from probabilistic guessing.&lt;/p&gt;

&lt;p&gt;They should come from deterministic infrastructure context.&lt;/p&gt;

&lt;p&gt;That is the missing layer most AI coding systems still lack.&lt;/p&gt;




&lt;h2&gt;
  
  
  This Is One of the Reasons I Started Building Infrawise
&lt;/h2&gt;

&lt;p&gt;One of the reasons I started working on &lt;a href="https://github.com/Sidd27/infrawise" rel="noopener noreferrer"&gt;Infrawise&lt;/a&gt; was this exact gap.&lt;/p&gt;

&lt;p&gt;The goal is not:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“put infrastructure docs into embeddings.”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The goal is to make infrastructure relationships explicit enough that AI systems stop guessing.&lt;/p&gt;

&lt;p&gt;Things like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;DynamoDB index awareness&lt;/li&gt;
&lt;li&gt;schema relationships&lt;/li&gt;
&lt;li&gt;infrastructure mapping&lt;/li&gt;
&lt;li&gt;static analysis&lt;/li&gt;
&lt;li&gt;AI-consumable infrastructure context&lt;/li&gt;
&lt;li&gt;deterministic extraction pipelines&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Not just “more retrieval.”&lt;/p&gt;

&lt;p&gt;Because honestly, some infrastructure mistakes are too expensive to leave to semantic similarity.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Future Is Not Just Better Retrieval
&lt;/h2&gt;

&lt;p&gt;RAG is useful.&lt;/p&gt;

&lt;p&gt;It will absolutely remain part of AI systems.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;But infrastructure-aware AI requires more than retrieval.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;It requires:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;topology understanding&lt;/li&gt;
&lt;li&gt;dependency mapping&lt;/li&gt;
&lt;li&gt;deterministic infrastructure state&lt;/li&gt;
&lt;li&gt;operational context&lt;/li&gt;
&lt;li&gt;relationship awareness&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The next generation of AI coding assistants will not just retrieve infrastructure knowledge.&lt;/p&gt;

&lt;p&gt;They will need to understand infrastructure reality.&lt;/p&gt;

&lt;p&gt;And that is a much harder problem than embedding documents into a vector database.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>architecture</category>
      <category>infrastructure</category>
      <category>rag</category>
    </item>
    <item>
      <title>Your AI Assistant Knows Your Code But Not Your Architecture</title>
      <dc:creator>Siddharth Pandey</dc:creator>
      <pubDate>Sun, 10 May 2026 11:03:22 +0000</pubDate>
      <link>https://dev.to/siddharth_pandey_27/your-ai-assistant-knows-your-code-but-not-your-architecture-2037</link>
      <guid>https://dev.to/siddharth_pandey_27/your-ai-assistant-knows-your-code-but-not-your-architecture-2037</guid>
      <description>&lt;p&gt;AI coding assistants are getting dangerously good.&lt;/p&gt;

&lt;p&gt;They can refactor code, generate APIs, write SQL queries, explain regex you wrote at 2 AM during a production incident, and confidently suggest changes that &lt;em&gt;look&lt;/em&gt; correct.&lt;/p&gt;

&lt;p&gt;And that’s exactly the problem.&lt;/p&gt;

&lt;p&gt;Because your AI assistant knows your code.&lt;/p&gt;

&lt;p&gt;It does &lt;strong&gt;not&lt;/strong&gt; know your architecture.&lt;/p&gt;

&lt;p&gt;There’s a huge difference between:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“I read your files”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;and&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“I understand your system.”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Right now, most AI tools are basically that one engineer who joined last week, skimmed the repo for 20 minutes, and immediately started suggesting database changes in production.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Confidence Problem
&lt;/h2&gt;

&lt;p&gt;Let’s say you ask your AI assistant:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“Generate a DynamoDB query for customer orders.”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;It happily gives you this:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight typescript"&gt;&lt;code&gt;&lt;span class="kd"&gt;const&lt;/span&gt; &lt;span class="nx"&gt;result&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="k"&gt;await&lt;/span&gt; &lt;span class="nx"&gt;dynamo&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;query&lt;/span&gt;&lt;span class="p"&gt;({&lt;/span&gt;
  &lt;span class="na"&gt;TableName&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="s2"&gt;Orders&lt;/span&gt;&lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
  &lt;span class="na"&gt;IndexName&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="s2"&gt;customerId-createdAt-index&lt;/span&gt;&lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
  &lt;span class="na"&gt;KeyConditionExpression&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="s2"&gt;customerId = :customerId&lt;/span&gt;&lt;span class="dl"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
&lt;span class="p"&gt;});&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Looks clean.&lt;/p&gt;

&lt;p&gt;Very professional.&lt;/p&gt;

&lt;p&gt;Tiny issue though.&lt;/p&gt;

&lt;p&gt;That GSI does not exist.&lt;/p&gt;

&lt;p&gt;The assistant saw patterns in code and statistically guessed what &lt;em&gt;probably&lt;/em&gt; exists.&lt;/p&gt;

&lt;p&gt;Which is honestly terrifying when you think about it.&lt;/p&gt;

&lt;p&gt;Because infrastructure is not autocomplete-friendly.&lt;/p&gt;




&lt;h2&gt;
  
  
  Your Architecture Lives Outside the Code
&lt;/h2&gt;

&lt;p&gt;Modern systems are scattered across:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Terraform&lt;/li&gt;
&lt;li&gt;CloudFormation&lt;/li&gt;
&lt;li&gt;CDK&lt;/li&gt;
&lt;li&gt;databases&lt;/li&gt;
&lt;li&gt;queues&lt;/li&gt;
&lt;li&gt;topics&lt;/li&gt;
&lt;li&gt;IAM policies&lt;/li&gt;
&lt;li&gt;deployment pipelines&lt;/li&gt;
&lt;li&gt;cloud consoles&lt;/li&gt;
&lt;li&gt;“temporary” scripts from 2022 that somehow became production-critical&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Meanwhile the AI assistant is sitting there reading:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight typescript"&gt;&lt;code&gt;&lt;span class="k"&gt;await&lt;/span&gt; &lt;span class="nx"&gt;ordersTable&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;query&lt;/span&gt;&lt;span class="p"&gt;(...)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;and internally going:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;“I have absolutely no idea what this system actually looks like, but statistically speaking… vibes.”&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;That’s how you end up with:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;PostgreSQL joins on non-indexed columns&lt;/li&gt;
&lt;li&gt;Event flow assumptions based entirely on naming conventions&lt;/li&gt;
&lt;li&gt;DynamoDB scans quietly disguised as “optimizations”&lt;/li&gt;
&lt;li&gt;APIs generated against schemas that were deleted three migrations ago&lt;/li&gt;
&lt;li&gt;Infrastructure suggestions that look plausible but do not match deployed reality&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The code compiles.&lt;/p&gt;

&lt;p&gt;The architecture cries silently in the background.&lt;/p&gt;




&lt;h2&gt;
  
  
  Inference Is Not Infrastructure Awareness
&lt;/h2&gt;

&lt;p&gt;This is the core issue.&lt;/p&gt;

&lt;p&gt;Today’s AI coding assistants are really good at inference.&lt;/p&gt;

&lt;p&gt;But infrastructure requires deterministic understanding.&lt;/p&gt;

&lt;p&gt;There’s a massive difference between:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;“This probably exists”
and&lt;/li&gt;
&lt;li&gt;“This definitely exists in production”&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Humans already struggle with this.&lt;/p&gt;

&lt;p&gt;Every company has at least one production resource that:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;nobody fully understands&lt;/li&gt;
&lt;li&gt;nobody wants to touch&lt;/li&gt;
&lt;li&gt;somehow costs thousands per month&lt;/li&gt;
&lt;li&gt;and is protected by pure organizational fear&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Now imagine an AI confidently modifying systems like that.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why Bigger Context Windows Won’t Solve This
&lt;/h2&gt;

&lt;p&gt;A lot of tooling right now is focused on:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;larger context windows&lt;/li&gt;
&lt;li&gt;better embeddings&lt;/li&gt;
&lt;li&gt;repository RAG&lt;/li&gt;
&lt;li&gt;more docs ingestion&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Helpful? Yes.&lt;/p&gt;

&lt;p&gt;Sufficient? No.&lt;/p&gt;

&lt;p&gt;Because infrastructure problems are relationship problems.&lt;/p&gt;

&lt;p&gt;Questions like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Which services consume this queue?&lt;/li&gt;
&lt;li&gt;Which indexes support this access pattern?&lt;/li&gt;
&lt;li&gt;Which schemas are actually deployed?&lt;/li&gt;
&lt;li&gt;Which Lambda depends on this topic?&lt;/li&gt;
&lt;li&gt;Which environment even has this resource?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;These are topology questions.&lt;/p&gt;

&lt;p&gt;Not autocomplete questions.&lt;/p&gt;

&lt;p&gt;A 2 million token context window still cannot magically understand infrastructure relationships if the information itself is fragmented or ambiguous.&lt;/p&gt;

&lt;p&gt;More tokens do not automatically create system awareness.&lt;/p&gt;




&lt;h2&gt;
  
  
  This Is Why I Started Building Infrawise
&lt;/h2&gt;

&lt;p&gt;This problem is one of the reasons I started working on &lt;a href="https://github.com/Sidd27/infrawise" rel="noopener noreferrer"&gt;Infrawise&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;The idea is simple:&lt;/p&gt;

&lt;p&gt;Instead of forcing AI assistants to &lt;em&gt;guess&lt;/em&gt; infrastructure,&lt;br&gt;
give them deterministic infrastructure context.&lt;/p&gt;

&lt;p&gt;Things like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;DynamoDB index awareness&lt;/li&gt;
&lt;li&gt;schema relationships&lt;/li&gt;
&lt;li&gt;infrastructure mapping&lt;/li&gt;
&lt;li&gt;static analysis&lt;/li&gt;
&lt;li&gt;infrastructure-aware context for AI systems&lt;/li&gt;
&lt;li&gt;MCP integration paths&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Not “AI magic.”&lt;/p&gt;

&lt;p&gt;Just explicit system understanding.&lt;/p&gt;

&lt;p&gt;Because honestly, hallucinated code is annoying.&lt;/p&gt;

&lt;p&gt;Hallucinated infrastructure is how people accidentally discover new billing tiers on AWS.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Next Generation of AI Coding Tools
&lt;/h2&gt;

&lt;p&gt;The current generation of AI assistants understands syntax.&lt;/p&gt;

&lt;p&gt;The next generation will need to understand systems.&lt;/p&gt;

&lt;p&gt;Not just:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;functions&lt;/li&gt;
&lt;li&gt;files&lt;/li&gt;
&lt;li&gt;classes&lt;/li&gt;
&lt;li&gt;frameworks&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;architecture&lt;/li&gt;
&lt;li&gt;runtime dependencies&lt;/li&gt;
&lt;li&gt;infrastructure topology&lt;/li&gt;
&lt;li&gt;deployment reality&lt;/li&gt;
&lt;li&gt;operational constraints&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Because software does not run inside VS Code.&lt;/p&gt;

&lt;p&gt;It runs inside infrastructure.&lt;/p&gt;

&lt;p&gt;And infrastructure is where bad assumptions become expensive.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>architecture</category>
      <category>typescript</category>
    </item>
  </channel>
</rss>
