Crawlbase

v1.0.0Search & Data Extractionstable

Crawlbase MCP Server connects AI agents and LLMs with real-time web data. It powers Claude, Cursor, and Windsurf integrations with battle-tested web scraping, JavaScript rendering, and anti-bot protection enabling structured, live data inside your AI

ai-agentscrawlbasecrawlbase-mcpllm-integrationmcp
Share:
55
Stars
0
Downloads
0
Weekly
0/5

What is Crawlbase?

Crawlbase is a Model Context Protocol (MCP) server that allows AI assistants like Claude, Cursor, and VS Code to crawlbase mcp server connects ai agents and llms with real-time web data. it powers claude, cursor, and windsurf integrations with battle-tested web scraping, javascript rendering, and anti-bot protec...

Crawlbase MCP Server connects AI agents and LLMs with real-time web data. It powers Claude, Cursor, and Windsurf integrations with battle-tested web scraping, JavaScript rendering, and anti-bot protection enabling structured, live data inside your AI

This server falls under the Search & Data Extraction category on MCPgee, the world's largest MCP server directory with 33,000+ servers.

Features

  • Crawlbase MCP Server connects AI agents and LLMs with real-t

Use Cases

Real-time web data extraction
JavaScript rendering and anti-bot protection
crawlbase

Maintainer

LicenseMIT
Languagejavascript
Versionv1.0.0
UpdatedMay 10, 2026
Statushealthy
Maintenanceactive

Works with

ClaudeOpenAIwindowsmacoslinux

Installation

Manual Installation

npx crawlbase

Configuration

Configuration Details

Config File

claude_desktop_config.json

Performance

Response Metrics

Response Time< 200ms
ThroughputMedium

Resource Usage

Memory UsageLow
CPU UsageLow

How to Set Up and Use Crawlbase

The Crawlbase MCP Server gives AI agents and LLMs real-time access to live web data by connecting them to the Crawlbase scraping infrastructure, which handles JavaScript rendering, proxy rotation, and anti-bot evasion automatically. It exposes three core tools—fetching raw HTML, extracting clean Markdown, and capturing screenshots—so Claude, Cursor, and Windsurf can retrieve accurate, up-to-date information from any public webpage without hitting bot-detection walls. Development teams and researchers use it when they need structured live data inside AI workflows without building and maintaining their own scraping stack.

Prerequisites

  • Node.js 18 or higher with npx available
  • A Crawlbase account with a Normal Token (for standard HTML) and a JavaScript Token (for JS-rendered pages) — sign up at crawlbase.com
  • An MCP-compatible client such as Claude Desktop, Cursor IDE, or Windsurf
1

Sign up for Crawlbase and retrieve your API tokens

Create a free Crawlbase account at crawlbase.com. After signup, navigate to your dashboard to copy your Normal Token (for standard HTML pages) and JavaScript Token (for JavaScript-rendered pages and screenshots).

2

Locate your MCP client configuration file

Find the configuration file for your MCP client. Claude Desktop uses claude_desktop_config.json at ~/Library/Application Support/Claude/ on macOS or %APPDATA%\Claude\ on Windows.

3

Add Crawlbase to your MCP configuration

Insert the Crawlbase server block into your MCP config, providing both API tokens as environment variables.

{
  "mcpServers": {
    "crawlbase": {
      "type": "stdio",
      "command": "npx",
      "args": ["@crawlbase/mcp@latest"],
      "env": {
        "CRAWLBASE_TOKEN": "your_normal_token_here",
        "CRAWLBASE_JS_TOKEN": "your_js_token_here"
      }
    }
  }
}
4

Restart your MCP client

Quit and reopen Claude Desktop (or reload your IDE) to load the Crawlbase server. The crawl, crawl_markdown, and crawl_screenshot tools will appear in the available tools list.

5

Test the integration with a sample crawl

Ask your AI assistant to crawl a webpage. The server will route the request through Crawlbase's infrastructure and return the content.

Crawlbase Examples

Client configuration

Complete claude_desktop_config.json entry for the Crawlbase MCP server with both tokens configured.

{
  "mcpServers": {
    "crawlbase": {
      "type": "stdio",
      "command": "npx",
      "args": ["@crawlbase/mcp@latest"],
      "env": {
        "CRAWLBASE_TOKEN": "your_normal_token_here",
        "CRAWLBASE_JS_TOKEN": "your_js_token_here"
      }
    }
  }
}

Prompts to try

After connecting, use these prompts to extract live web data inside your AI assistant.

- "Crawl Hacker News and return the top 10 stories in markdown format"
- "Fetch the Tesla investor relations page as raw HTML and extract all press release titles"
- "Take a screenshot of the TechCrunch homepage"
- "Get the current pricing page for Vercel as clean markdown"
- "Crawl the GitHub trending page and list the top 5 repositories with their descriptions"

Troubleshooting Crawlbase

crawl_screenshot returns an error or blank image

Screenshots require the JavaScript Token (CRAWLBASE_JS_TOKEN), not the Normal Token. Verify that CRAWLBASE_JS_TOKEN is set correctly in the env block and that you have JS crawl credits remaining in your Crawlbase account.

Pages with JavaScript-heavy content return incomplete or empty HTML

Use the crawl_markdown or crawl_screenshot tool (which use CRAWLBASE_JS_TOKEN) instead of crawl. JS-rendered pages require the JavaScript Token to execute scripts before returning content.

Server fails to start with 'npx: command not found'

Ensure Node.js 18+ is installed and npx is on your PATH. Run `npx --version` in a terminal to confirm. On macOS, install Node.js via Homebrew: `brew install node`.

Frequently Asked Questions about Crawlbase

What is Crawlbase?

Crawlbase is a Model Context Protocol (MCP) server that crawlbase mcp server connects ai agents and llms with real-time web data. it powers claude, cursor, and windsurf integrations with battle-tested web scraping, javascript rendering, and anti-bot protection enabling structured, live data inside your ai It connects AI assistants to external tools and data sources through a standardized interface.

How do I install Crawlbase?

Follow the installation instructions on the Crawlbase GitHub repository. Clone the repo, install dependencies, and add the server config to your AI client.

Which AI clients work with Crawlbase?

Crawlbase works with all major MCP-compatible AI clients including Claude Desktop, Claude Code, Cursor, VS Code (GitHub Copilot), Windsurf, and Cline.

Is Crawlbase free to use?

Yes, Crawlbase is open source and available under the MIT license. You can use it freely in both personal and commercial projects.

Browse More Search & Data Extraction MCP Servers

Explore all search & data extraction servers available in the MCPgee directory. Each server includes setup guides for Claude, Cursor, and VS Code.

Quick Config Preview

{ "mcpServers": { "crawlbase": { "command": "npx", "args": ["-y", "crawlbase"] } } }

Add this to your claude_desktop_config.json or .cursor/mcp.json

Read the full setup guide →

Ready to use Crawlbase?

Browse our complete directory of 33,000+ MCP servers, read setup guides for your editor, and start building with the Model Context Protocol.

33,000+ ServersFree & Open SourceStep-by-Step Guides