<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Diggi</title>
    <description>The latest articles on DEV Community by Diggi (@digdep).</description>
    <link>https://dev.to/digdep</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3039704%2F09893434-347f-4841-8c77-7fd5bd6983aa.png</url>
      <title>DEV Community: Diggi</title>
      <link>https://dev.to/digdep</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/digdep"/>
    <language>en</language>
    <item>
      <title>How We Rebuilt 500+ Product Pages Using AI, Pipelines, and a Modular Content Backend</title>
      <dc:creator>Diggi</dc:creator>
      <pubDate>Sun, 20 Apr 2025 20:25:52 +0000</pubDate>
      <link>https://dev.to/digdep/how-we-rebuilt-500-product-pages-using-ai-pipelines-and-a-modular-content-backend-5am2</link>
      <guid>https://dev.to/digdep/how-we-rebuilt-500-product-pages-using-ai-pipelines-and-a-modular-content-backend-5am2</guid>
      <description>&lt;p&gt;At &lt;a href="https://digdep.com" rel="noopener noreferrer"&gt;Digdep&lt;/a&gt;, our goal is to help people find supplements that actually work — not just by claims, but by scientific research and user-reported outcomes.&lt;/p&gt;

&lt;p&gt;The catch? We had over 30000+ product-condition combinations (e.g. Vitamin A for acne, Omega 3 for ADHD) and needed to generate trustworthy, dynamic, evolving pages — without hiring a hundred content writers.&lt;/p&gt;

&lt;p&gt;So we did what any backend-leaning team would do:&lt;/p&gt;

&lt;p&gt;We built a pipeline-first, AI-assisted content system, structured around research data, user reviews, and intent-based modules.&lt;/p&gt;

&lt;p&gt;🧱 Architecture Overview&lt;br&gt;
We split the problem into three systems:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Content Orchestration Layer
A scheduled ETL engine (Airflow + custom workers) that:&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Fetches new research data from PubMed, clinical trial APIs, and internal annotations&lt;/p&gt;

&lt;p&gt;Pulls structured review data from reputable sellers.&lt;/p&gt;

&lt;p&gt;Normalizes supplement metadata (dosage, source, purity, etc.)&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;ML/NLP Layer
This is where the raw data gets meaning:&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Clinical research is chunked, embedded (SBERT), and summarized using a hybrid of GPT-4 + in-house fine-tuned classifiers&lt;/p&gt;

&lt;p&gt;Reviews are clustered by condition + sentiment, scored, and tagged (e.g. “2-week results”, “used with zinc”)&lt;/p&gt;

&lt;p&gt;FAQ candidates are extracted from natural language queries, Reddit, Quora, and Digdep’s internal search logs&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Headless CMS + API Delivery
The processed content lives in a GraphQL-accessible store (we use Strapi but heavily extended)&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Each page is assembled dynamically on the frontend via metadata-driven composition: which sections to show, what order, how they’re prioritized&lt;/p&gt;

&lt;p&gt;Content updates are non-destructive and versioned — users get fresh insights without pages losing their SEO/indexing&lt;/p&gt;

&lt;p&gt;🧠 AI Where It Makes Sense&lt;br&gt;
We were careful not to overfit with LLMs. Here’s how we actually use them:&lt;/p&gt;

&lt;p&gt;Summarization: Input = abstract + result + cohort size; Output = 2-sentence result with risk qualifiers&lt;/p&gt;

&lt;p&gt;Semantic clustering: We embed every user review and map it into symptom categories and conditions (some users don’t say “acne” — they say “skin bumps”)&lt;/p&gt;

&lt;p&gt;Question synthesis: LLMs turn query logs into human-readable FAQs, then we pass them through filters for duplication, bias, and hallucination&lt;/p&gt;

&lt;p&gt;We built a confidence scoring layer to decide when to show or suppress LLM output. If the model’s not sure, it defers to rules or hides the result.&lt;/p&gt;

&lt;p&gt;📦 How Pages Are Built&lt;br&gt;
Each product page is made of composable modules, injected via API:&lt;/p&gt;

&lt;p&gt; from the ML pipeline&lt;/p&gt;

&lt;p&gt; from review tagging&lt;/p&gt;

&lt;p&gt; from research weighting&lt;/p&gt;

&lt;p&gt; generated dynamically&lt;/p&gt;

&lt;p&gt; based on co-purchase graph&lt;/p&gt;

&lt;p&gt;The backend controls what renders, and the frontend just assembles.&lt;/p&gt;

&lt;p&gt;We also exposed a JSON manifest for each page so QA/devs can debug pipeline decisions without inspecting raw DB rows.&lt;/p&gt;

&lt;p&gt;📊 Feedback Loops&lt;br&gt;
This system let us do things we couldn’t before:&lt;/p&gt;

&lt;p&gt;Trigger model re-training when new research changes a supplement’s score&lt;/p&gt;

&lt;p&gt;Use search and review logs to automatically discover emerging use-cases (e.g. berberine + PCOS suddenly rising)&lt;/p&gt;

&lt;p&gt;Log anonymized click paths to see which modules drive trust, then tune the page structure accordingly&lt;/p&gt;

&lt;p&gt;🚀 Results &amp;amp; Takeaways&lt;br&gt;
We scaled to thousands of pages within 2 weeks without bottlenecks&lt;/p&gt;

&lt;p&gt;Pages adapt over time as new data/reviews/research arrives&lt;/p&gt;

&lt;p&gt;Everything is traceable, explainable, and testable — no “black box content”&lt;/p&gt;

&lt;p&gt;If you’re building content at scale in a high-trust domain (health, legal, finance), structured pipelines + LLM-assisted augmentation is a sweet spot. It’s not sexy, but it’s robust.&lt;/p&gt;

&lt;p&gt;💬 Curious how we handle edge cases (e.g. conflicting research, multi-supplement effects), cold-start products, or data validation? Drop a question below — always happy to nerd out.&lt;/p&gt;

</description>
    </item>
    <item>
      <title>Building DigDep.com: A Dev’s Quest to Open Source Supplement Science</title>
      <dc:creator>Diggi</dc:creator>
      <pubDate>Fri, 11 Apr 2025 00:47:26 +0000</pubDate>
      <link>https://dev.to/digdep/building-digdepcom-a-devs-quest-to-open-source-supplement-science-4eh1</link>
      <guid>https://dev.to/digdep/building-digdepcom-a-devs-quest-to-open-source-supplement-science-4eh1</guid>
      <description>&lt;p&gt;If you’ve ever searched for “best supplements for arthritis” or tried to decode ingredient lists on health blogs, you’ve probably landed on Examine.com or Healthline-style articles. They’re useful—but often limited by slow updates, paywalls, or one-size-fits-all summaries.&lt;/p&gt;

&lt;p&gt;That’s exactly the problem I’m trying to solve with DigDep.com — a developer-led project to map supplement products directly to clinical research, using AI pipelines and transparent data logic.&lt;/p&gt;

&lt;p&gt;🧪 From Ingredients to Research in One Click&lt;br&gt;
Take this for example:&lt;br&gt;
&lt;a href="https://digdep.com/listing/now-glucosamine-chondroitin-300mg-capsules/arthritis/" rel="noopener noreferrer"&gt;NOW Supplements, Glucosamine &amp;amp; Chondroitin with MSM – Joint Health &amp;amp; Comfort&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;On that page, you’ll find:&lt;/p&gt;

&lt;p&gt;A list of relevant clinical trials on glucosamine, chondroitin, and MSM&lt;/p&gt;

&lt;p&gt;Direct citations to PubMed and other research databases&lt;/p&gt;

&lt;p&gt;A breakdown of which studies link the supplement to outcomes like reduced joint pain or improved mobility&lt;/p&gt;

&lt;p&gt;User reviews, so you can contrast anecdotal experiences with peer-reviewed findings&lt;/p&gt;

&lt;p&gt;It’s not just a product page — it’s a research navigator with structured science behind it.&lt;/p&gt;

&lt;p&gt;🤖 The AI Behind It&lt;br&gt;
I use a multi-model LLM pipeline to parse research papers, identify connections between ingredients and outcomes (like “arthritis relief”), and then validate those connections with human-like accuracy.&lt;/p&gt;

&lt;p&gt;The Stack (Simplified):&lt;br&gt;
Discovery: Lightweight open models scan abstracts for substance–outcome–dosage signals&lt;/p&gt;

&lt;p&gt;Validation: GPT-4 or Claude reviews excerpts to eliminate false positives&lt;/p&gt;

&lt;p&gt;Summary Matching: A final model cross-references the claim against the research excerpt&lt;/p&gt;

&lt;p&gt;All this data is normalized across thousands of entries, so users can go from health goal → compound → product, or the other way around.&lt;/p&gt;

&lt;p&gt;🧠 Why Not Just Use Examine?&lt;br&gt;
Because Examine doesn’t link to actual products, and doesn’t let you filter for clinical evidence per product.&lt;br&gt;
DigDep does.&lt;/p&gt;

&lt;p&gt;Also:&lt;/p&gt;

&lt;p&gt;Examine is paywalled; DigDep is free&lt;/p&gt;

&lt;p&gt;Examine is slow to update; DigDep refreshes regularly via automation&lt;/p&gt;

&lt;p&gt;Examine doesn’t map individual supplements to reviews and research; DigDep is built for it&lt;/p&gt;

&lt;p&gt;And as developers, we can appreciate when a system is built modularly, using pipelines that evolve as the models get smarter.&lt;/p&gt;

&lt;p&gt;🧱 It’s a Work in Progress, but Already Useful&lt;br&gt;
So far, I’ve indexed:&lt;/p&gt;

&lt;p&gt;20,000+ research papers&lt;/p&gt;

&lt;p&gt;Hundreds of common health outcomes (e.g. arthritis, anxiety, weight loss, ADHD)&lt;/p&gt;

&lt;p&gt;5,000+ supplements, matched by ingredients and dose&lt;/p&gt;

&lt;p&gt;Each listing gets smarter as new research is added. The ultimate goal?&lt;br&gt;
To make DigDep the most trusted and usable research-backed supplement directory out there.&lt;/p&gt;

&lt;p&gt;💬 Try It and Tell Me What’s Missing&lt;br&gt;
Here’s that example again:&lt;br&gt;
&lt;a href="https://digdep.com/listing/now-glucosamine-chondroitin-300mg-capsules/arthritis/" rel="noopener noreferrer"&gt;NOW Glucosamine &amp;amp; Chondroitin – Arthritis Research &amp;amp; Reviews&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;If you're into LLM applications, health tech, or just curious about turning messy biomedical data into structured, navigable knowledge — I’d love feedback or ideas.&lt;/p&gt;

&lt;p&gt;This is open-source in spirit (and maybe soon in code too). If you'd like to collaborate, critique, or just discuss model design — hit me up.&lt;/p&gt;

&lt;p&gt;Thanks for reading 🙌&lt;/p&gt;

</description>
      <category>startup</category>
      <category>ai</category>
      <category>llm</category>
      <category>webdev</category>
    </item>
  </channel>
</rss>
