Amaar A.

Posted on May 19

Supervising AI Coding Agents from Anywhere on Earth: Why I'm Building Cipher

#ai #programming #productivity #agents

I decided to build AR Glasses for AI Coding Because I Was Trapped at My Desk

The 45-Minute Problem

I was coding at my desk when my AI agent hit a blocker.

I wanted to walk away. Grab coffee. Stretch. Literally anything but sit there staring at a progress bar.

I couldn't.

I sat there for 45 minutes babysitting it — watching a progress bar I couldn't leave.

This happens to developers constantly. AI made coding faster. But it trapped us at our desks.

The Productivity Trap

AI coding tools solved the speed problem.

They created a new one: the productivity trap.

The agent runs. You're stuck watching it. You can't leave. You can't multitask. You just... wait.

Remote Supervision Exists (But It's Broken)

Here's the thing: remote AI supervision already exists.

Claude Code launched mobile capability. You can technically supervise AI agents from your phone.

The capability is real. The experience is punishment.

Why?

📱 Tiny screens – Squinting at code on a phone is not supervision. It's torture.

⌨️ No keyboard – Typing corrections on mobile kills every productivity gain.

🔲 No visual context – Text-only output strips the spatial understanding developers rely on.

Enter: Cipher

So I decided to build Cipher – AR glasses for supervising AI coding agents from anywhere.

Here's how it works:

👓 AR Glasses

See code in context, not just raw text on a tiny display. Visual overlays. Spatial context. The way you'd see it on a monitor.

🎙️ Voice Control

Speak instructions. No keyboard. No friction. Natural language commands to your agent team.

🤖 Multi-Agent Team

Not one generalist AI. 7 specialist agents running in parallel:

🏛️ ATLAS – Architecture specialist
🔐 CYPHER – Security expert
⚒️ FORGE – Code generation
🔬 SAGE – Testing & QA
🚀 SWIFT – DevOps
🗄️ ORACLE – Data & SQL
🎨 PIXEL – Frontend & UI

Each agent has deep domain expertise. They cross-validate each other. Higher quality output than a single generalist.

🌍 Hands-Free

Supervise while walking. Coffee shop. Park bench. Anywhere. Same capability. 10× better experience.

What's Working Today

I have a working proof-of-concept (phone app):

✅ 7 specialist AI agents orchestrated in real time (CrewAI coordinating via Groq models)

✅ Sub-1.5 second latency globally over internet architecture

✅ Real coding tasks completed successfully end-to-end

✅ Daily personal use – I run this on my own projects

The backend is proven. The agent system is functional. The hard engineering is behind us.

The glasses are a better interface on top of a working foundation.

The Roadmap

Phase 1 – Phone App Beta (Next few months)

Launch to waitlist users. Pay-as-you-go API pricing. Validate demand. Gather usage data.

Phase 2 – Hardware Development (next 8-10 months)

Build AR glasses prototype:

Display: 0.39" 1920×1080 OLED (5644 PPI)
Chipset: ESP32-S3
Camera: OV5640 5MP
ODM partner: Devomech Solutions ($50K quote received)

Phase 3 – Kickstarter Launch (If we find a US-based co-founder)

Pre-orders from phone app users. Validate hardware demand

Phase 4 – Manufacturing & Scale

Deliver to backers. Subscription model for glasses users.

The Contingency

Hardware is hard. If the glasses take longer than expected, I'm not stuck.

Pivot path: Ship the phone app as a standalone product.

Multi-agent supervision on mobile is still valuable and differentiated.

The glasses are the 10x experience. The phone app is the 3x experience.

Both are defensible.

About Me

I'm 17. I'm a full-stack developer. I started coding at 14. I had an ML engineering internship at 15 (NYC-based company, remote).

I AM the target customer. I built Cipher because I needed it.

The phone app comes first.

Join the Waitlist

I'm launching the phone app beta in a few months.

AR glasses prototype in early 2027.

*Join the waitlist for early access: *https://tally.so/r/rjGk9R

Building in public. Feedback welcome.

Questions for the Community

Does this problem resonate with you? Do you feel stuck babysitting AI agents?
Would you use a phone app for this, or do you think glasses are essential?
What's your biggest concern with the multi-agent approach?

Drop your thoughts in the comments. All feedback welcome – especially critical feedback.

Let's build this together.

DEV Community