DEV Community

Prateek YJ
Prateek YJ

Posted on

πŸš€ We Open-Sourced XAI's Macrohard: Meet Open Computer Use - Autonomous AI Agents That Actually Control Computers

Remember when XAI teased "Macrohard" - their vision of AI agents that could actually control computers? Well, the open-source community just said: "Hold our coffee." β˜•

Introducing Open Computer Use 🎯

Today, we're thrilled to share Open Computer Use - a fully open-source platform that gives AI agents real computer control. Not just chat. Not just suggestions. Actual automation.

What Can It Do? πŸš€

βœ… Browse the web like a human - search, click, fill forms, extract data
βœ… Run terminal commands - file management, script execution, package installation

βœ… Control desktop apps - full UI automation with computer vision
βœ… Multi-agent orchestration - break down complex tasks across specialized agents
βœ… Real-time streaming - watch your agents work with live feedback
βœ… 100% open-source - Apache 2.0 license, self-hostable, fully auditable

Think Anthropic's Claude Computer Use capabilities, but completely open, extensible, and production-ready.

Why This Matters πŸ’‘

For too long, "computer use" capabilities have been locked behind closed APIs and proprietary systems. Open Computer Use changes that:

  • πŸ”“ Truly Open: Apache 2.0 licensed - fork it, modify it, deploy it anywhere
  • πŸ”’ Safe by Design: Isolated Docker VMs, sandboxed execution, no data persistence
  • 🎯 Production Ready: Real-time streaming, multi-provider AI support (OpenAI, Anthropic, Google, xAI, Mistral, and more)
  • πŸ› οΈ Developer First: Built with Next.js 15, FastAPI, TypeScript - modern stack you already know

See It In Action 🎬

Our agents can:

  • Research and summarize - "Find the latest AI research papers and create a summary report"
  • Automate workflows - "Deploy this app to production and run the test suite"
  • Data extraction - "Scrape competitor pricing and build a comparison dashboard"
  • Complex tasks - "Build a quantitative trading dashboard using QuantConnect data"

All executed autonomously. All streaming in real-time. All running in isolated, secure environments.

The Tech Stack πŸ—οΈ

Frontend: Next.js 15 (App Router), React 19, TypeScript, Tailwind CSS 4
Backend: FastAPI, Python 3.10+, asyncio, websockets
AI Providers: OpenAI, Anthropic, Google, xAI, Mistral, Perplexity, OpenRouter
Infrastructure: Docker, Ubuntu 22.04 + XFCE, Selenium, Playwright
Database: Supabase (Auth + Postgres)

Architecture Highlights 🎯

User Request β†’ AI Planner β†’ Specialized Agents β†’ Isolated VM
                    ↓
            [Browser | Terminal | Desktop]
                    ↓
          Real-time Streaming Feedback
Enter fullscreen mode Exit fullscreen mode

Each session runs in an isolated Docker container with:

  • Sandboxed execution environment
  • Ephemeral containers (no data persistence)
  • Network isolation options
  • Resource limits and monitoring

Quick Start πŸƒβ€β™‚οΈ

# Clone the repo
git clone https://github.com/LLmHub-dev/open-computer-use.git
cd open-computer-use

# Set up Supabase, add your API keys
cp .env.example .env
# Edit .env with your configuration

# Start with Docker
docker-compose up --build

# Access at http://localhost:3000
Enter fullscreen mode Exit fullscreen mode

Bring your own API keys (BYOK) for any AI provider. All keys are encrypted and stored securely.

Use Cases πŸ’Ό

✨ Research & Data Gathering - Web scraping, competitive analysis, market research
✨ Testing & QA - Automated UI testing, E2E test generation, regression testing

✨ DevOps & Automation - Server configuration, deployment automation, log analysis
✨ E-commerce Operations - Price monitoring, product research, inventory tracking
✨ Content Creation - Screenshot documentation, tutorial generation, demo creation

Join the Movement 🌟

This is just the beginning. We're building the future of autonomous computer agents - openly, safely, and together.

What we need from you:

⭐ Star the repo - github.com/LLmHub-dev/open-computer-use
πŸ’¬ Join our Discord - Share ideas, get help, contribute
πŸ”§ Contribute - PRs welcome! Check out our Contributing Guide
πŸ“’ Spread the word - Help us show what open-source can do

Roadmap Ahead πŸ—ΊοΈ

Q1 2026:

  • Multi-VM orchestration (parallel agents)
  • Visual workflow builder
  • Windows and macOS VM support
  • Mobile apps (iOS/Android)

Q2 2026:

  • Plugin system for custom tools
  • Collaborative agent sessions
  • Enterprise SSO support
  • Advanced analytics dashboard

The Big Picture 🌍

"Computer use" capabilities shouldn't be locked behind proprietary APIs. They should be:

  • βœ… Open - Auditable, modifiable, yours to control
  • βœ… Safe - Isolated, sandboxed, transparent
  • βœ… Accessible - Self-hostable, BYOK, no vendor lock-in
  • βœ… Collaborative - Built by the community, for the community

We're not just building an alternative to closed-source solutions. We're building the foundation for a new era of autonomous agents that developers can trust, extend, and deploy anywhere.


Ready to Give Your AI Agents Real Power? πŸ’ͺ

⭐ Star us on GitHub: LLmHub-dev/open-computer-use
πŸ’¬ Join Discord: discord.gg/llmhub

🐦 Follow on X: @llmhub_dev
πŸ“§ Contact: prateek@llmhub.dev


Built with ❀️ by the Open Computer Use community. Apache 2.0 licensed. 97 stars (and counting!)

Let's show the world what open-source can do. Star the repo, join the Discord, and help us shape the future of autonomous computer agents! πŸš€

Top comments (0)