<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: BEDANTA CHATTERJEE</title>
    <description>The latest articles on DEV Community by BEDANTA CHATTERJEE (@bedanta_chatterjee_234b49).</description>
    <link>https://dev.to/bedanta_chatterjee_234b49</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3799584%2Fc4651029-9d8b-42d3-8b02-b3a5b3f85095.jpg</url>
      <title>DEV Community: BEDANTA CHATTERJEE</title>
      <link>https://dev.to/bedanta_chatterjee_234b49</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/bedanta_chatterjee_234b49"/>
    <language>en</language>
    <item>
      <title>Showcase: SuperBuilder — an open-source AI platform that scaffolds, builds, and deploys apps</title>
      <dc:creator>BEDANTA CHATTERJEE</dc:creator>
      <pubDate>Fri, 06 Mar 2026 18:17:53 +0000</pubDate>
      <link>https://dev.to/bedanta_chatterjee_234b49/showcase-superbuilder-an-open-source-ai-platform-that-scaffolds-builds-and-deploys-apps-1pbc</link>
      <guid>https://dev.to/bedanta_chatterjee_234b49/showcase-superbuilder-an-open-source-ai-platform-that-scaffolds-builds-and-deploys-apps-1pbc</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3f2u9wg1gufl2yci9xn8.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F3f2u9wg1gufl2yci9xn8.png" alt=" " width="800" height="387"&gt;&lt;/a&gt;# Showcase: SuperBuilder — an open-source AI platform that scaffolds, builds, and deploys apps&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;SuperBuilder&lt;/strong&gt; is an open-source platform that combines autonomous AI agents, a plugin SDK, and ready-to-use starter templates so you can generate, build, test, and deploy apps from prompts.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Demo / Repo: &lt;a href="https://github.com/rupac4530-creator/super-builder-platform" rel="noopener noreferrer"&gt;https://github.com/rupac4530-creator/super-builder-platform&lt;/a&gt;&lt;br&gt;&lt;br&gt;
Live demo: &lt;code&gt;YOUR_DEMO_URL&lt;/code&gt; (if available)&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  Why this project?
&lt;/h2&gt;

&lt;p&gt;Building a production app still takes a lot of wiring — scaffold, CI, tests, deploy, and iterate. SuperBuilder tries to automate those repetitive steps with:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Planner, Coder, Tester, and Deploy agents that coordinate work
&lt;/li&gt;
&lt;li&gt;Plugin SDK so the community can add model providers, tools, and deploy targets
&lt;/li&gt;
&lt;li&gt;Starter templates: SaaS, AI chatbot, portfolio, etc.
&lt;/li&gt;
&lt;li&gt;Docker one-command setup to try locally in ~60 seconds
&lt;/li&gt;
&lt;li&gt;A set of curated “good first issues” for newcomers&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It’s released under &lt;strong&gt;AGPL-3.0&lt;/strong&gt; to keep the platform community-first.&lt;/p&gt;




&lt;h2&gt;
  
  
  What you can do right now
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Try locally (60s Docker quickstart)
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;
bash
# clone
git clone https://github.com/rupac4530-creator/super-builder-platform
cd super-builder-platform

# run with docker-compose
docker compose up --build
# open http://localhost:3000
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

</description>
      <category>opensource</category>
      <category>ai</category>
      <category>webdev</category>
      <category>devops</category>
    </item>
    <item>
      <title>I Built a Real-Time Vision AI Agent</title>
      <dc:creator>BEDANTA CHATTERJEE</dc:creator>
      <pubDate>Sun, 01 Mar 2026 09:18:18 +0000</pubDate>
      <link>https://dev.to/bedanta_chatterjee_234b49/i-built-a-real-time-vision-ai-agent-4bod</link>
      <guid>https://dev.to/bedanta_chatterjee_234b49/i-built-a-real-time-vision-ai-agent-4bod</guid>
      <description>&lt;p&gt;Built Vision Agent — realtime multimodal AI that sees, listens &amp;amp; coaches.&lt;br&gt;
Demo ▶️ &lt;a href="https://vision-agent-m23w.onrender.com" rel="noopener noreferrer"&gt;https://vision-agent-m23w.onrender.com&lt;/a&gt;&lt;br&gt;
 · Code ▶️ &lt;a href="https://github.com/rupac4530-creator/vision-agent" rel="noopener noreferrer"&gt;https://github.com/rupac4530-creator/vision-agent&lt;/a&gt;&lt;br&gt;
 · 30s demo ▶️ &lt;a href="https://youtube.com/shorts/2F8jyKPJwTs" rel="noopener noreferrer"&gt;https://youtube.com/shorts/2F8jyKPJwTs&lt;/a&gt;&lt;/p&gt;

&lt;h1&gt;
  
  
  VisionPossible #AI #ComputerVision 🙌 Thanks to WeMakeDevs and the Vision-Agents team!
&lt;/h1&gt;

</description>
      <category>agents</category>
      <category>ai</category>
      <category>opensource</category>
      <category>showdev</category>
    </item>
    <item>
      <title>Vision Agent — Real-Time Multimodal AI with YOLO, STT &amp; LLM Cascade</title>
      <dc:creator>BEDANTA CHATTERJEE</dc:creator>
      <pubDate>Sun, 01 Mar 2026 08:38:12 +0000</pubDate>
      <link>https://dev.to/bedanta_chatterjee_234b49/vision-agent-real-time-multimodal-ai-with-yolo-stt-llm-cascade-37d3</link>
      <guid>https://dev.to/bedanta_chatterjee_234b49/vision-agent-real-time-multimodal-ai-with-yolo-stt-llm-cascade-37d3</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fzfmwjaj5vwshmz7n8vj0.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fzfmwjaj5vwshmz7n8vj0.jpg" alt=" " width="800" height="386"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fof5uuoa9gg0qzuzp33kf.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fof5uuoa9gg0qzuzp33kf.jpg" alt=" " width="800" height="387"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F5ha2u8hgpni5ifrpr8ok.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F5ha2u8hgpni5ifrpr8ok.jpg" alt=" " width="800" height="382"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6puh6u6um665v8p65zjz.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F6puh6u6um665v8p65zjz.jpg" alt=" " width="800" height="379"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Frx9tcp0cipke2e9dx8xt.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Frx9tcp0cipke2e9dx8xt.jpg" alt=" " width="800" height="391"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fe2dq90p7c76592x8xq2s.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fe2dq90p7c76592x8xq2s.jpg" alt=" " width="800" height="395"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Frguursrd5i0w9hy3f8ks.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Frguursrd5i0w9hy3f8ks.jpg" alt=" " width="800" height="391"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fu5nrowq8zerpifq45qs2.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fu5nrowq8zerpifq45qs2.jpg" alt=" " width="800" height="378"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fp4hrigyivchx17kjh4ch.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fp4hrigyivchx17kjh4ch.jpg" alt=" " width="800" height="373"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F7hqqciqpvuljvto4w7zy.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F7hqqciqpvuljvto4w7zy.jpg" alt=" " width="800" height="388"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqzbchiuppc7x3y07c23g.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fqzbchiuppc7x3y07c23g.jpg" alt=" " width="800" height="387"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fy336tjdpvnlkw5j8bcby.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fy336tjdpvnlkw5j8bcby.jpg" alt=" " width="800" height="381"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fiuolrf3da2r23xxuvlon.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fiuolrf3da2r23xxuvlon.jpg" alt=" " width="800" height="377"&gt;&lt;/a&gt;&lt;br&gt;
&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F8ovnen5wk7j85lwiksuk.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F8ovnen5wk7j85lwiksuk.jpg" alt=" " width="800" height="386"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;I built Vision Agent for the Vision Possible hackathon — a real-time multimodal AI platform that watches live video, transcribes audio, detects objects &amp;amp; human poses, and responds using a multi-tier LLM cascade.&lt;/p&gt;

&lt;p&gt;This isn’t a static demo.&lt;br&gt;
It’s a production-style, extensible system.&lt;/p&gt;

&lt;p&gt;🚀 What It Does&lt;/p&gt;

&lt;p&gt;Live webcam streaming (chunked WebM ingestion)&lt;/p&gt;

&lt;p&gt;YOLOv8 object detection + pose estimation&lt;/p&gt;

&lt;p&gt;Real-time rep counting &amp;amp; posture correction&lt;/p&gt;

&lt;p&gt;Speech-to-text processing&lt;/p&gt;

&lt;p&gt;Fast deterministic response (instant reply)&lt;/p&gt;

&lt;p&gt;Polished long-form LLM response (async cascade)&lt;/p&gt;

&lt;p&gt;Tool/function registry for structured actions&lt;/p&gt;

&lt;p&gt;SSE-based live UI updates&lt;/p&gt;

&lt;p&gt;Deployment-ready with Docker configs&lt;/p&gt;

&lt;p&gt;Demo: &lt;a href="https://youtube.com/shorts/2F8jyKPJwTs?feature=share" rel="noopener noreferrer"&gt;https://youtube.com/shorts/2F8jyKPJwTs?feature=share&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Repo: &lt;a href="https://github.com/rupac4530-creator/vision-agent" rel="noopener noreferrer"&gt;https://github.com/rupac4530-creator/vision-agent&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;🧠 Architecture Overview&lt;/p&gt;

&lt;p&gt;Browser&lt;br&gt;
→ 1–2s WebM chunks&lt;br&gt;
→ FastAPI backend&lt;br&gt;
→ Frame extraction&lt;br&gt;
→ YOLOv8 vision pipeline&lt;br&gt;
→ STT transcription&lt;br&gt;
→ FastReply (deterministic, instant)&lt;br&gt;
→ LLM cascade (quality escalation)&lt;br&gt;
→ SSE responses to UI&lt;/p&gt;

&lt;p&gt;The separation between FastReply and PolishReply ensures:&lt;/p&gt;

&lt;p&gt;Instant UX&lt;/p&gt;

&lt;p&gt;Higher-quality reasoning&lt;/p&gt;

&lt;p&gt;Cost control&lt;/p&gt;

&lt;p&gt;Provider fallback reliability&lt;/p&gt;

&lt;p&gt;⚙️ Key Engineering Decisions&lt;br&gt;
Streaming Approach&lt;/p&gt;

&lt;p&gt;Used MediaRecorder chunk uploads instead of WebRTC for simplicity and cross-browser compatibility. This trades a small latency increase for reliability during prototyping.&lt;/p&gt;

&lt;p&gt;LLM Cascade Strategy&lt;/p&gt;

&lt;p&gt;Rather than a single provider:&lt;/p&gt;

&lt;p&gt;Fast, low-cost model replies first&lt;/p&gt;

&lt;p&gt;Higher-tier model refines answers asynchronously&lt;/p&gt;

&lt;p&gt;Automatic fallback on timeouts / quota errors&lt;/p&gt;

&lt;p&gt;Provider health metrics and auto-fallback logic&lt;/p&gt;

&lt;p&gt;Pose Counting Logic&lt;/p&gt;

&lt;p&gt;Implemented robust rep counting using:&lt;/p&gt;

&lt;p&gt;Joint angle thresholds&lt;/p&gt;

&lt;p&gt;State-machine transitions (down → up → down)&lt;/p&gt;

&lt;p&gt;Hysteresis / cooldowns to avoid duplicate counts&lt;/p&gt;

&lt;p&gt;Per-exercise detectors (squat, pushup, curl)&lt;/p&gt;

&lt;p&gt;🧪 Practical Use Cases&lt;/p&gt;

&lt;p&gt;AI fitness coach (real-time posture feedback)&lt;/p&gt;

&lt;p&gt;Security monitoring assistant (live alerts)&lt;/p&gt;

&lt;p&gt;Accessibility tool (scene descriptions + captions)&lt;/p&gt;

&lt;p&gt;Smart classroom assistant (lecture summarization)&lt;/p&gt;

&lt;p&gt;Live gaming companion / strategy hints&lt;/p&gt;

&lt;p&gt;🛠 Tech Stack&lt;/p&gt;

&lt;p&gt;Python + FastAPI (backend)&lt;/p&gt;

&lt;p&gt;Ultralytics YOLOv8 (vision)&lt;/p&gt;

&lt;p&gt;Whisper-style STT (transcription)&lt;/p&gt;

&lt;p&gt;Multi-provider LLM cascade (Gemini / OpenAI / others)&lt;/p&gt;

&lt;p&gt;SSE for real-time UI updates&lt;/p&gt;

&lt;p&gt;Docker + Render / Railway deployment configs&lt;/p&gt;

&lt;p&gt;🏗 Lessons Learned&lt;/p&gt;

&lt;p&gt;Streaming video pipelines are complex — chunk continuity matters.&lt;/p&gt;

&lt;p&gt;Deploy build limits (e.g., PyTorch size) require careful dependency choices.&lt;/p&gt;

&lt;p&gt;Design for failure: timeouts, retries, and graceful fallbacks are essential.&lt;/p&gt;

&lt;p&gt;UX speed (instant replies) beats slow perfect responses for interactive apps.&lt;br&gt;
Built for the Vision Possible hackathon by WeMakeDevs.&lt;br&gt;
Inspired by Vision-Agents and realtime ideas from Stream.&lt;/p&gt;

&lt;p&gt;Open source: &lt;a href="https://github.com/rupac4530-creator/vision-agent" rel="noopener noreferrer"&gt;https://github.com/rupac4530-creator/vision-agent&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Feedback, issues, and contributions welcome — drop a PR or open an issue.&lt;/p&gt;

</description>
      <category>webdev</category>
      <category>wemakedevs</category>
      <category>hackathon</category>
      <category>visionagents</category>
    </item>
  </channel>
</rss>
