Skip to main content
0
S

Scrap Mcp

An MCP (Model Context Protocol) server that can scrape web pages and extract content using CSS selec

Rating

0.0

Votes

0

score

Downloads

0

total

Price

Free

No login needed

Works With

Claude CodeCursorWindsurfVS CodeDeveloper tool

About

Web Scraper MCP Server

An MCP (Model Context Protocol) server that can scrape web pages and extract content using CSS selectors. Built with deno-dom for fast HTML parsing.

Why

Most LLM clients already have some HTTP fetching capabilities, but fetching a page directly often returns a lot of unnecessary content. This not only confuses the LLM, but also quickly fills up the context window.

That's where this MCP comes in—it enables targeted scraping using CSS selectors, so you only extract the content you actually need.

Example from Zed

See ZedExample.md for a real-world usage example.

Features

  • 🌐 Fetch any publicly accessible web page by URL
  • 🔍 Parse HTML content using the fast deno-dom library
  • 📋 Extract text content using standard CSS selectors
  • 🎯 Support for complex selectors (classes, IDs, attributes, pseudo-selectors)
  • ⚡ Built-in error handling for network issues and parsing failures
  • 🛡️ Safe execution with minimal required permissions

Prerequisites

  • Deno installed on your system
  • Network access for fetching web pages

Running the Server

bash
deno run --allow-net jsr:@sigma/scrap-mcp

You can also run this with Bun and Node.js using bunx and npx respectively:

bash
bunx rjsr @sigma/scrap-mcp
bash
npx rjsr @sigma/scrap-mcp

MCP Tool Reference

scrape_page

The main tool for scraping web pages and extracting content.

Parameters:

  • url (string, required): The URL of the page to scrape
  • query_selector (string, required): CSS selector to query elements

Return Format:

Found X elements matching selector "SELECTOR" on URL:

Element 1: TEXT_CONTENT

Element 2: TEXT_CONTENT
...

Usage Examples

Basic Selectors

  1. 1.Extract all headings:

```json

{ "url": "https://example.com", "query_selector": "h1, h2, h3" }

```

  1. 1.Extract all paragraphs:

```json

{ "url": "https://example.com", "query_selector": "p" }

```

  1. 1.Extract content from specific classes:

```json

{ "url": "https://news.ycombinator.com", "query_selector": ".titleline > a" }

```

  1. 1.Extract all links:

```json

{ "url": "https://example.com", "query_selector": "a" }

```

Advanced Selectors

  1. 1.Extract navigation items:

```json

{ "url": "https://deno.land", "query_selector": "nav a" }

```

  1. 1.Extract elements with specific attributes:

```json

{ "url": "https://example.com", "query_selector": "a[href^='https://']" }

```

  1. 1.Extract form inputs:

```json

{ "url": "https://example.com", "query_selector": "input[type='text'], input[type='email']" }

```

CSS Selector Reference

Basic Selectors

  • h1 - All H1 headings
  • .className - All elements with class "className"
  • #elementId - Element with ID "elementId"
  • * - All elements

Combinators

Don't lose this

Three weeks from now, you'll want Scrap Mcp again. Will you remember where to find it?

Save it to your library and the next time you need Scrap Mcp, it’s one tap away — from any AI app you use. Group it into a bench with the rest of the team for that kind of task and you can pull the whole stack at once.

⚡ Pro tip for geeks: add a-gnt 🤵🏻‍♂️ as a custom connector in Claude or a custom GPT in ChatGPT — one click and your library is right there in the chat. Or, if you’re in an editor, install the a-gnt MCP server and say “use my [bench name]” in Claude Code, Cursor, VS Code, or Windsurf.

🤵🏻‍♂️

a-gnt's Take

Our honest review

This plugs directly into your AI and gives it new abilities it didn't have before. An MCP (Model Context Protocol) server that can scrape web pages and extract content using CSS selec. Once connected, just ask your AI to use it. It's completely free and works across most major AI apps. This one just landed in the catalog — worth trying while it's fresh.

Tips for getting started

1

Tap "Get" above, pick your AI app, and follow the steps. Most installs take under 30 seconds.

What's New

Version 1.0.06 days ago

Imported from GitHub

Ratings & Reviews

0.0

out of 5

0 ratings

No reviews yet. Be the first to share your experience.