For decades, we've taught computers through explicit instructions. Write code. Define selectors. Handle every edge case. But what if we could teach AI agents the same way we teach humans—by showing them what to do?
The Problem with Traditional Automation
Traditional browser automation relies on brittle selectors and rigid scripts. When a website updates its CSS classes, your automation breaks. When a button moves, your script fails. You're not describing what you want done—you're prescribing exactly how to do it.
This creates a maintenance nightmare. Every UI change requires code updates. Every new workflow demands engineering time. Domain experts who know the work can't automate it without developer help.
Teaching by Example
What if you could simply record yourself performing a task and have an AI extract the intent, workflow, and decision points?
That's the idea behind SkillForge—a tool that converts screen recordings into structured skill files that any AI agent can execute.
Here's how it works:
- Record: Capture yourself performing any web-based task
- Extract: AI analyzes the recording to identify goals, workflows, and UI elements
- Generate: Output is a human-readable SKILL.md file with context-aware instructions
- Execute: Any compatible agent can perform the skill, adapting to UI changes
Why This Changes Everything
The key insight is separating intent from implementation. When you record yourself booking a flight, you're not saying "click the button with ID #submit-btn"—you're saying "complete the booking."
This means:
- Skills survive UI updates because they describe intent, not selectors
- Domain experts can create automation without writing code
- Skills are portable across different agent frameworks
- The format is human-readable and auditable
The SKILL.md Format
The generated files follow a simple markdown structure:
- Goal: What the agent should accomplish
- Workflow: Step-by-step execution plan
- Context: How to identify elements semantically
- Error handling: Recovery strategies
This format works with AutoGen, LangChain, CrewAI, or any framework that can parse structured instructions.
Looking Ahead
We're moving toward a world where AI agents can learn from observation just like humans do. The ability to capture domain expertise through demonstration rather than code opens up automation to everyone.
Check out SkillForge:
🔗 Product Hunt: https://www.producthunt.com/products/skillforge-2
🌐 Website: https://skillforge.expert
What workflows would you teach an AI agent if you could just show it once?
Top comments (0)