- Home
- Search & Web
- Scrap Mcp
Scrap Mcp
An MCP (Model Context Protocol) server that can scrape web pages and extract content using CSS selec
Rating
Votes
0
score
Downloads
0
total
Price
Free
No login needed
Works With
About
Web Scraper MCP Server
An MCP (Model Context Protocol) server that can scrape web pages and extract content using CSS selectors. Built with deno-dom for fast HTML parsing.
Why
Most LLM clients already have some HTTP fetching capabilities, but fetching a page directly often returns a lot of unnecessary content. This not only confuses the LLM, but also quickly fills up the context window.
That's where this MCP comes in—it enables targeted scraping using CSS selectors, so you only extract the content you actually need.
Example from Zed
See ZedExample.md for a real-world usage example.
Features
- 🌐 Fetch any publicly accessible web page by URL
- 🔍 Parse HTML content using the fast deno-dom library
- 📋 Extract text content using standard CSS selectors
- 🎯 Support for complex selectors (classes, IDs, attributes, pseudo-selectors)
- ⚡ Built-in error handling for network issues and parsing failures
- 🛡️ Safe execution with minimal required permissions
Prerequisites
- Deno installed on your system
- Network access for fetching web pages
Running the Server
deno run --allow-net jsr:@sigma/scrap-mcpYou can also run this with Bun and Node.js using bunx and npx respectively:
bunx rjsr @sigma/scrap-mcpnpx rjsr @sigma/scrap-mcpMCP Tool Reference
scrape_page
The main tool for scraping web pages and extracting content.
Parameters:
url(string, required): The URL of the page to scrapequery_selector(string, required): CSS selector to query elements
Return Format:
Found X elements matching selector "SELECTOR" on URL:
Element 1: TEXT_CONTENT
Element 2: TEXT_CONTENT
...Usage Examples
Basic Selectors
- 1.Extract all headings:
```json
{ "url": "https://example.com", "query_selector": "h1, h2, h3" }
```
- 1.Extract all paragraphs:
```json
{ "url": "https://example.com", "query_selector": "p" }
```
- 1.Extract content from specific classes:
```json
{ "url": "https://news.ycombinator.com", "query_selector": ".titleline > a" }
```
- 1.Extract all links:
```json
{ "url": "https://example.com", "query_selector": "a" }
```
Advanced Selectors
- 1.Extract navigation items:
```json
{ "url": "https://deno.land", "query_selector": "nav a" }
```
- 1.Extract elements with specific attributes:
```json
{ "url": "https://example.com", "query_selector": "a[href^='https://']" }
```
- 1.Extract form inputs:
```json
{ "url": "https://example.com", "query_selector": "input[type='text'], input[type='email']" }
```
CSS Selector Reference
Basic Selectors
h1- All H1 headings.className- All elements with class "className"#elementId- Element with ID "elementId"*- All elements
Combinators
Don't lose this
Three weeks from now, you'll want Scrap Mcp again. Will you remember where to find it?
Save it to your library and the next time you need Scrap Mcp, it’s one tap away — from any AI app you use. Group it into a bench with the rest of the team for that kind of task and you can pull the whole stack at once.
⚡ Pro tip for geeks: add a-gnt 🤵🏻♂️ as a custom connector in Claude or a custom GPT in ChatGPT — one click and your library is right there in the chat. Or, if you’re in an editor, install the a-gnt MCP server and say “use my [bench name]” in Claude Code, Cursor, VS Code, or Windsurf.
a-gnt's Take
Our honest review
This plugs directly into your AI and gives it new abilities it didn't have before. An MCP (Model Context Protocol) server that can scrape web pages and extract content using CSS selec. Once connected, just ask your AI to use it. It's completely free and works across most major AI apps. This one just landed in the catalog — worth trying while it's fresh.
Tips for getting started
Tap "Get" above, pick your AI app, and follow the steps. Most installs take under 30 seconds.
What's New
Imported from GitHub
Ratings & Reviews
0.0
out of 5
0 ratings
No reviews yet. Be the first to share your experience.