- Home
- Search & Web
- Browser Agent
Rating
Votes
0
score
Downloads
0
total
Price
Free
No login needed
Works With
About
π New: Magnitude is state-of-the-art, scoring 94% on WebVoyager!
Magnitude uses vision AI to enable you to control your browser with natural language.
- π§ Navigate - Sees and understands any interface to plan out actions
- π±οΈ Interact - Executes precise actions using mouse and keyboard
- π Extract - Intelligently extracts useful structured data
- β Verify - Built-in test runner with powerful visual assertions
You can use it to automate tasks on the web, integrate between apps without APIs, extract data, test your web apps, or as a building block for your own browser agents.
βοΈ Magnitude in action! βοΈ
// Magnitude can handle high-level tasks
await agent.act('Create a task', {
// Optionally pass data that the agent will use where appropriate
data: {
title: 'Use Magnitude',
description: 'Run "npx create-magnitude-app" and follow the instructions',
},
});
// It can also handle low-level actions
await agent.act('Drag "Use Magnitude" to the top of the in progress column');
// Intelligently extract data based on the DOM content matching a provided zod schema
const tasks = await agent.extract(
'List in progress tasks',
z.array(z.object({
title: z.string(),
description: z.string(),
// Agent can extract existing data or new insights
difficulty: z.number().describe('Rate the difficulty between 1-5')
})),
);Get started
Running your first browser automation
npx create-magnitude-appThis will create a new project and walk you through the steps for setting up Magnitude. It will also create an example script that you can run right away!
Using the test runner
To install the test runner for use in an existing web app, please run:
npm i --save-dev magnitude-test && npx magnitude initThis will create a basic tests directory tests/magnitude with:
magnitude.config.ts: Magnitude test configuration fileexample.mag.ts: An example test file
For information on how to run tests and integrate into CI/CD see here.
!NOTE] Magnitude requires a large **visually grounded** model. We recommend Claude Sonnet 4 for the best performance, but are also compatible with Qwen-2.5VL 72B. See [docs for more information.
Why Magnitude?
β Problem #1: Most browser agents draw numbered boxes around page elements - doesn't generalize well due to complex modern sites
β Solution: Vision-first architecture
- Visually grounded LLM specifies pixel coordinates
- True generalization independent of DOM structure
- Future-proof architecture for desktop apps, VMs, etc.
β Problem #2: Most browser agents follow "high-level prompt + tools = work until done" - works for demos, not production
Don't lose this
Three weeks from now, you'll want Browser Agent again. Will you remember where to find it?
Save it to your library and the next time you need Browser Agent, itβs one tap away β from any AI app you use. Group it into a bench with the rest of the team for that kind of task and you can pull the whole stack at once.
β‘ Pro tip for geeks: add a-gnt π€΅π»ββοΈ as a custom connector in Claude or a custom GPT in ChatGPT β one click and your library is right there in the chat. Or, if youβre in an editor, install the a-gnt MCP server and say βuse my [bench name]β in Claude Code, Cursor, VS Code, or Windsurf.
a-gnt's Take
Our honest review
Open-source, vision-first browser agent. Best for anyone looking to make their AI assistant more capable in search & web. It's completely free and works across most major AI apps. This one just landed in the catalog β worth trying while it's fresh.
Tips for getting started
Tap "Get" above, pick your AI app, and follow the steps. Most installs take under 30 seconds.
What's New
Imported from GitHub
Ratings & Reviews
0.0
out of 5
0 ratings
No reviews yet. Be the first to share your experience.