Web Eval Agent
An MCP server that autonomously evaluates web applications.
What is Web Eval Agent?
Web Eval Agent is a Model Context Protocol (MCP) server that allows AI assistants like Claude, Cursor, and VS Code to mcp server that autonomously evaluates web applications.
An MCP server that autonomously evaluates web applications.
This server falls under the Browser Automation category on MCPgee, the world's largest MCP server directory with 33,000+ servers.
Features
- An MCP server that autonomously evaluates web applications.
Use Cases
Maintainer
Works with
Installation
Manual Installation
npx web-eval-agentConfiguration
Configuration Details
claude_desktop_config.json
Performance
Response Metrics
Resource Usage
How to Set Up and Use Web Eval Agent
Web Eval Agent MCP Server is an autonomous web application evaluation tool that uses AI-powered browser automation to test, debug, and report on web apps directly from within your code editor. Built on BrowserUse and Playwright, it navigates your application like a real user, captures screenshots at each step, records network requests and responses, and collects browser console errors — then delivers a structured report summarising what it found. It integrates with MCP-compatible clients so that AI coding assistants can validate their own generated code against a running application without manual browser testing.
Prerequisites
- An Operative.sh account and API key — obtain free at operative.sh/mcp
- The operative.sh CLI installed (includes the MCP server)
- An MCP-compatible client such as Claude Desktop, Claude Code, or Cursor
- A locally running web application or a publicly accessible URL to evaluate
Install the operative.sh CLI
The install script sets up the Web Eval Agent MCP server and all dependencies including BrowserUse and Playwright.
curl -LSf https://operative.sh/install.sh -o install.sh && bash install.sh && rm install.shGet your Operative API key
Visit operative.sh/mcp to create a free account and retrieve your OPERATIVE_API_KEY. This key authenticates all browser automation sessions.
Add the server to your MCP client config
Add the following entry to your claude_desktop_config.json (or equivalent config file for your MCP host).
{
"mcpServers": {
"web-eval-agent": {
"command": "npx",
"args": ["web-eval-agent"],
"env": {
"OPERATIVE_API_KEY": "your_operative_api_key"
}
}
}
}Set up browser authentication (optional)
If your app requires login, use the setup_browser_state tool first. It opens an interactive browser where you can sign in manually; session cookies are persisted for subsequent automated evaluations.
Restart your MCP client and launch your web app
Restart the MCP client. Start your application on a local port. The server exposes two tools: web_eval_agent (automated testing with URL and task inputs) and setup_browser_state (session persistence).
Web Eval Agent Examples
Client configuration
Minimal claude_desktop_config.json for the Web Eval Agent MCP server.
{
"mcpServers": {
"web-eval-agent": {
"command": "npx",
"args": ["web-eval-agent"],
"env": {
"OPERATIVE_API_KEY": "your_operative_api_key"
}
}
}
}Prompts to try
Sample prompts for autonomous web app evaluation using the web_eval_agent tool.
- "Test my app at http://localhost:3000 — try the full signup flow and report UX issues and console errors"
- "Navigate to http://localhost:5173 and verify that all navigation links work without 404 errors"
- "Evaluate the form validation on http://localhost:3000/contact — submit with empty fields and check error messages appear"
- "Run an end-to-end test of the shopping cart at http://localhost:3000/shop — add two items, update quantities, and report any issues"Troubleshooting Web Eval Agent
OPERATIVE_API_KEY not found or invalid
Sign up at operative.sh/mcp to get a free API key. Paste it exactly (no extra spaces) into the env block in your MCP config. Fully restart the MCP client after saving the config file.
Agent cannot access the app at localhost
Make sure your web application is already running and serving on the specified port before prompting the agent. Test with `curl http://localhost:3000` to confirm it is reachable. The agent's browser runs on the same host machine as the MCP server.
The evaluation report is vague or misses key interactions
Write more specific task descriptions. Instead of 'test my app', use 'click the Sign Up button, fill in a test email and password, submit the form, and verify the confirmation page loads'. Concrete step-by-step task descriptions produce much more useful evaluation reports.
Frequently Asked Questions about Web Eval Agent
What is Web Eval Agent?
Web Eval Agent is a Model Context Protocol (MCP) server that mcp server that autonomously evaluates web applications. It connects AI assistants to external tools and data sources through a standardized interface.
How do I install Web Eval Agent?
Follow the installation instructions on the Web Eval Agent GitHub repository. Clone the repo, install dependencies, and add the server config to your AI client.
Which AI clients work with Web Eval Agent?
Web Eval Agent works with all major MCP-compatible AI clients including Claude Desktop, Claude Code, Cursor, VS Code (GitHub Copilot), Windsurf, and Cline.
Is Web Eval Agent free to use?
Yes, Web Eval Agent is open source and available under the Apache-2.0 license. You can use it freely in both personal and commercial projects.
Web Eval Agent Alternatives — Similar Browser Automation Servers
Looking for alternatives to Web Eval Agent? Here are other popular browser automation servers you can use with Claude, Cursor, and VS Code.
Chrome DevTools MCP
★ 40.6kAI-powered Chrome automation server with natural language element detection. Control Chrome browser through MCP protocol for testing, debugging, and performance analysis. Features 91% accuracy in element location, works with free AI models, and suppo
UI TARS Desktop
★ 34.9k📇 🏠 - Browser automation capabilities using Puppeteer, both support local and remote browser connection.
Playwright
★ 32.8kA production-ready browser automation server that enables AI assistants to interact with web pages using tools for navigation, element interaction, and data extraction. It features a built-in Inspector UI and robust crash recovery for reliable automa
Page Agent
★ 18.0kJavaScript in-page GUI agent. Control web interfaces with natural language.
Chrome
★ 11.7kAn extension-based MCP server that enables AI assistants to control your browser, leveraging existing sessions and login states for automation and content analysis. It provides over 20 tools for semantic tab search, interactive element manipulation,
LAMDA
★ 7.8kThe most powerful Android RPA agent framework, next generation mobile automation.
Browse More Browser Automation MCP Servers
Explore all browser automation servers available in the MCPgee directory. Each server includes setup guides for Claude, Cursor, and VS Code.
Set Up Web Eval Agent in Your Editor
Choose your AI client for step-by-step setup instructions.
Quick Config Preview
Add this to your claude_desktop_config.json or .cursor/mcp.json
Ready to use Web Eval Agent?
Browse our complete directory of 33,000+ MCP servers, read setup guides for your editor, and start building with the Model Context Protocol.