Skip to main content
0
C

Cerebellum

Browser automation system that uses AI-driven planning to navigate web pages and perform goals.

Rating

0.0

Votes

0

score

Downloads

0

total

Price

Free

No login needed

Works With

Claude CodeCursorWindsurfVS CodeDeveloper tool

About

Cerebellum

A lightweight browser using agent that accomplishes user-defined goals on webpages using keyboard and mouse actions.

See It In Action

Goal: Find a USB C to C cable that is 10 feet long and add it to cart

[](https://github.com/user-attachments/assets/7a8500c9-35f4-45d3-bc0c-a765bc4aee6a)

Setup

Please see setup directions for your language:

How It Works

  1. 1.Web browsing is simplified to navigating a directed graph.
  2. 2.Each webpage is a node with visible elements and data.
  3. 3.User actions, such as clicking or typing, are edges that move between nodes.
  4. 4.Cerebellum starts at a webpage and aims to reach a target node that embodies the completed goal.
  5. 5.It uses a LLM to finds new nodes by analyzing page content and interactive elements.
  6. 6.The LLM decides the next action based on the current state and past actions.
  7. 7.Cerebellum executes the LLM's planned action and feeds the new state back into the LLM for next step.
  8. 8.The process ends when the LLM decides the goal has been reached or is unachieveable.

Currently, Claude 3.5 Sonnet is the only supported LLM

Features

  • Compatible with any Selenium-supported browser.
  • Fills forms using user-provided JSON data.
  • Accepts runtime instructions to dynamically adjust browsing strategies and actions.
  • TODO: Create training datasets from browsing sessions

Roadmap

  • [x] Integrate Claude 3.5 Sonnet as a ActionPlanner
  • [x] Demonstrate successful BrowserAgent across a variety of tasks
  • [x] Create Python SDK
  • [x] Handle tabbed browsing
  • [ ] Handle data extraction from website
  • [x] Improve vertical scrolling behavior
  • [ ] Improve horizontal scrolling behavior
  • [x] Improve system prompt performance
  • [x] Improve mouse position marking on screenshots
  • [ ] Add ability for converting browser sessions into training datasets
  • [ ] Support for additional LLMs as an ActionPlanner
  • [ ] Train a local model
  • [ ] Integrate local model as a ActionPlanner

Known Issues

  • Claude 3.5 safety refusals
  • Refuses to solve CAPTCHAs
  • Refuses to navigate when political content is on the page

Contributing

Contributions to Cerebellum are welcome. For details on how to get involved, please refer to our CONTRIBUTING.md.

We appreciate all contributions, whether they're bug reports, feature requests, or code changes.

License

This project is licensed under the MIT License.

Maintainer

Collaborators

Don't lose this

Three weeks from now, you'll want Cerebellum again. Will you remember where to find it?

Save it to your library and the next time you need Cerebellum, it’s one tap away — from any AI app you use. Group it into a bench with the rest of the team for that kind of task and you can pull the whole stack at once.

⚡ Pro tip for geeks: add a-gnt 🤵🏻‍♂️ as a custom connector in Claude or a custom GPT in ChatGPT — one click and your library is right there in the chat. Or, if you’re in an editor, install the a-gnt MCP server and say “use my [bench name]” in Claude Code, Cursor, VS Code, or Windsurf.

🤵🏻‍♂️

a-gnt's Take

Our honest review

Browser automation system that uses AI-driven planning to navigate web pages and perform goals. Best for anyone looking to make their AI assistant more capable in search & web. It's completely free and works across most major AI apps. This one just landed in the catalog — worth trying while it's fresh.

Tips for getting started

1

Tap "Get" above, pick your AI app, and follow the steps. Most installs take under 30 seconds.

What's New

Version 1.0.06 days ago

Imported from GitHub

Ratings & Reviews

0.0

out of 5

0 ratings

No reviews yet. Be the first to share your experience.