Teaching AI Agents by Example: A New Paradigm in Automation

#showdev #webdev #ai #automation

For decades, we've taught computers through explicit instructions. Write code. Define selectors. Handle every edge case. But what if we could teach AI agents the same way we teach humans—by showing them what to do?

The Problem with Traditional Automation

Traditional browser automation relies on brittle selectors and rigid scripts. When a website updates its CSS classes, your automation breaks. When a button moves, your script fails. You're not describing what you want done—you're prescribing exactly how to do it.

This creates a maintenance nightmare. Every UI change requires code updates. Every new workflow demands engineering time. Domain experts who know the work can't automate it without developer help.

Teaching by Example

What if you could simply record yourself performing a task and have an AI extract the intent, workflow, and decision points?

That's the idea behind SkillForge—a tool that converts screen recordings into structured skill files that any AI agent can execute.

Here's how it works:

Record: Capture yourself performing any web-based task
Extract: AI analyzes the recording to identify goals, workflows, and UI elements
Generate: Output is a human-readable SKILL.md file with context-aware instructions
Execute: Any compatible agent can perform the skill, adapting to UI changes

Why This Changes Everything

The key insight is separating intent from implementation. When you record yourself booking a flight, you're not saying "click the button with ID #submit-btn"—you're saying "complete the booking."

This means:

Skills survive UI updates because they describe intent, not selectors
Domain experts can create automation without writing code
Skills are portable across different agent frameworks
The format is human-readable and auditable

The SKILL.md Format

The generated files follow a simple markdown structure:

Goal: What the agent should accomplish
Workflow: Step-by-step execution plan
Context: How to identify elements semantically
Error handling: Recovery strategies

This format works with AutoGen, LangChain, CrewAI, or any framework that can parse structured instructions.

Looking Ahead

We're moving toward a world where AI agents can learn from observation just like humans do. The ability to capture domain expertise through demonstration rather than code opens up automation to everyone.

Check out SkillForge:

🔗 Product Hunt: https://www.producthunt.com/products/skillforge-2

🌐 Website: https://skillforge.expert

What workflows would you teach an AI agent if you could just show it once?