<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: AYUSH SINGH</title>
    <description>The latest articles on DEV Community by AYUSH SINGH (@ayush_singh_cc755d41995c8).</description>
    <link>https://dev.to/ayush_singh_cc755d41995c8</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3998105%2F86454144-8ba4-472c-83f0-f4e3a5a0c62b.jpeg</url>
      <title>DEV Community: AYUSH SINGH</title>
      <link>https://dev.to/ayush_singh_cc755d41995c8</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/ayush_singh_cc755d41995c8"/>
    <language>en</language>
    <item>
      <title>I built a Threat Intelligence RAG System from scratch — here's what actually broke</title>
      <dc:creator>AYUSH SINGH</dc:creator>
      <pubDate>Tue, 23 Jun 2026 06:35:11 +0000</pubDate>
      <link>https://dev.to/ayush_singh_cc755d41995c8/i-built-a-threat-intelligence-rag-system-from-scratch-heres-what-actually-broke-p81</link>
      <guid>https://dev.to/ayush_singh_cc755d41995c8/i-built-a-threat-intelligence-rag-system-from-scratch-heres-what-actually-broke-p81</guid>
      <description>&lt;p&gt;CVE databases are massive. Searching them manually is painful. I wanted to ask plain English questions like "show me all critical RCE vulnerabilities from 2024" and get real answers — so I built a RAG system to do exactly that.&lt;/p&gt;

&lt;p&gt;The stack&lt;/p&gt;

&lt;p&gt;🔹 HuggingFace — embeddings&lt;br&gt;
🔹 FAISS — vector store&lt;br&gt;
🔹 Fully local LLM — no OpenAI costs&lt;br&gt;
🔹 AWS — deployment&lt;br&gt;
What actually broke (and how I fixed it)&lt;/p&gt;

&lt;p&gt;The local LLM hallucinated CVE numbers confidently. FAISS retrieval returned irrelevant chunks when queries were too short. Chunking strategy mattered way more than I expected. I'll walk through each failure and the fix.&lt;/p&gt;

&lt;p&gt;What you'll learn&lt;/p&gt;

&lt;p&gt;How to build a RAG pipeline without relying on OpenAI, why chunking strategy is underrated, common failure modes in local LLMs, and how to deploy the whole thing to AWS.&lt;br&gt;
Full article on Medium 👉 &lt;a href="https://medium.com/p/e9efd48d1799/edit" rel="noopener noreferrer"&gt;https://medium.com/p/e9efd48d1799/edit&lt;/a&gt;&lt;br&gt;
github: &lt;a href="https://github.com/letshck/threat-intelligence-RAG" rel="noopener noreferrer"&gt;https://github.com/letshck/threat-intelligence-RAG&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Building in AI/security? I'd love to connect.&lt;/p&gt;

</description>
      <category>machinelearning</category>
      <category>rag</category>
      <category>llm</category>
      <category>cybersecurity</category>
    </item>
  </channel>
</rss>
