Web Eval Agent

Name: Web Eval Agent
Author: refreshdotdev

An MCP server that autonomously evaluates web applications.

by refreshdotdev

Rating

0.0

Votes

score

Downloads

total

Price

Free

API key required

Works With

Claude CodeCursorWindsurfVS CodeDeveloper tool

About

⚠️ PROJECT HAS BEEN SUNSET ⚠️

This project has been discontinued. We're building something new at withrefresh.com

🚀 operative.sh web-eval-agent MCP Server

Let the coding agent debug itself, you've got better things to do.

🔥 Supercharge Your Debugging

operative.sh's MCP Server launches a browser-use powered agent to autonomously execute and debug web apps directly in your code editor.

⚡ Features

🌐 Navigate your webapp using BrowserUse (2x faster with operative backend)
📊 Capture network traffic - requests are intelligently filtered and returned into the context window
🚨 Collect console errors - captures logs & errors
🤖 Autonomous debugging - the Cursor agent calls the web QA agent mcp server to test if the code it wrote works as epected end-to-end.

🧰 MCP Tool Reference

Tool	Purpose
`web_eval_agent`	🤖 Automated UX evaluator that drives the browser, captures screenshots, console & network logs, and returns a rich UX report.
`setup_browser_state`	🔒 Opens an interactive (non-headless) browser so you can sign in once; the saved cookies/local-storage are reused by subsequent `web_eval_agent` runs.

Key arguments

web_eval_agent
url (required) – address of the running app (e.g. http://localhost:3000)
task (required) – natural-language description of what to test ("run through the signup flow and note any UX issues")
headless_browser (optional, default `false`) – set to true to hide the browser window

setup_browser_state
url (optional) – page to open first (handy to land directly on a login screen)

You can trigger these tools straight from your IDE chat, for example:

bash

Evaluate my app at http://localhost:3000 – run web_eval_agent with the task "Try the full signup flow and report UX issues".

🏁 Quick Start

Easy Setup with One-Click Integration

1.Get your API key (free) - when you create your API key, you'll see:

"Add to Cursor" button with a deeplink for instant Cursor installation
Prefilled Claude Code command with your API key automatically included

Manual Setup (macOS/Linux)

Don't lose this

Three weeks from now, you'll want Web Eval Agent again. Will you remember where to find it?

Save it to your library and the next time you need Web Eval Agent, it’s one tap away — from any AI app you use. Group it into a bench with the rest of the team for that kind of task and you can pull the whole stack at once.

⚡ Pro tip for geeks: add a-gnt 🤵🏻‍♂️ as a custom connector in Claude or a custom GPT in ChatGPT — one click and your library is right there in the chat. Or, if you’re in an editor, install the a-gnt MCP server and say “use my [bench name]” in Claude Code, Cursor, VS Code, or Windsurf.

🤵🏻‍♂️

a-gnt's Take

Our honest review

This plugs directly into your AI and gives it new abilities it didn't have before. An MCP server that autonomously evaluates web applications. . Once connected, just ask your AI to use it. It's completely free and works across most major AI apps.

Tips for getting started

Tap "Get" above, pick your AI app, and follow the steps. Most installs take under 30 seconds.

Heads up: this needs an API key to work. You'll get one from the service's website (usually free). The setup guide tells you exactly where.

Search & Web