Hey fellow developers! ๐ I'm excited to share something I've built and I'd love your feedback and contributions. I've created a web automation system that brings together the power of A2A (Agent-to-Agent) protocol and MCP (Model Context Protocol) with Selenium WebDriver. The best part? You can try it live right now!
๐ฎ Try the Live Demo
Want to see it in action before diving in? Here are some things to try (remember, results may vary based on your prompts):
- Run automated web tests (start with simple navigation tasks)
- Capture screenshots (works best with stable pages)
- Execute natural language commands (be specific and clear in your instructions)
- Watch agents communicate in real-time
๐ก Pro Tip: When using the demo, try to:
- Be specific in your instructions
- Start with simple commands and gradually increase complexity
- If something doesn't work, try rephrasing your prompt
- Share what prompts worked best for you!
๐ What I've Built (And You Can Too!)
I've created a web automation agent that's ready for you to use, extend, and improve. Here's what it can do:
โจ Core Features (Try them live!):
- Executes Selenium-based web automation tasks with natural language
- Captures and validates UI elements automatically
- Can Communicates between agents using A2A and MCP protocols using poc java client
- Integrates with your favorite AI models (Gemini, OpenAI, Claude, Grok)
- Provides real-time agent communication and task monitoring
๐ฏ Live Demo Highlights:
- Test web automation scenarios instantly
- Watch agents collaborate in real-time
- Experiment with different AI models
- No setup required - just visit the demo URL!
๐ ๏ธ Technology Stack
- a2ajava: The Swiss Army knife for building agent applications
- Selenium WebDriver: For web automation
- Spring Boot: For the application framework
- AI Integration: Support for multiple LLM platforms
๐ฅ Key Features
-
Multi-Protocol Support
- A2A (Agent-to-Agent) protocol for agent communication
- MCP (Model Context Protocol) for AI model integration
- Seamless interoperability between protocols
-
Multi-Language Support
- Java (primary)
- Kotlin
- Groovy
-
Multi-Platform AI Integration
- Gemini
- OpenAI
- Claude
- Grok
-
Advanced Integration Features
- Selenium automation
- Human-in-the-loop workflows
- Multi-LLM voting for consensus-based decisions
๐ป Get Started in Minutes
Want to join the development? Here's how to get started:
๐ Try it Online First
Visit our live demo and try these sample commands:
- Navigate to a website
- Capture screenshots
- Validate UI elements
- Watch real-time agent communication
๐ง Local Setup for MCP
{
"webbrowsingagent": {
"command": "java",
"args": [
"-jar",
"/work/a2a-mcp-bridge/target/mcp-connector-full.jar",
"http://localhost:7860/"
],
"timeout": 30000
}
}
- Remote Server Connection
{
"webbrowsingagent": {
"command": "java",
"args": [
"-jar",
"/work/a2a-mcp-bridge/target/mcp-connector-full.jar",
"https://vishalmysore-a2amcpselenium.hf.space"
],
"timeout": 30000
}
}
For Setup with A2A add this as remote agent in your a2a client, I have tried to add as much documetation as possible but if something is missing or not working please let me know
Source code is here
โ ๏ธ Work in Progress Notice: This project is under active development and may have bugs or unstable features. The system's behavior is highly dependent on the quality and clarity of prompts provided. I am continuously improving it and welcome your feedback!
Top comments (0)