Anansi

v1.0.0Search & Data Extractionstable

A self-healing web scraper built for hostile sites: selectors repair themselves, browser rendering kicks in when needed, and Chrome TLS fingerprinting evades bot detection. Ships with an MCP server so any LLM can drive a full crawl through conversati

adaptive-scrapingai-agentanti-botcrawlerdata-extraction
Share:
86
Stars
0
Downloads
0
Weekly
0/5

What is Anansi?

Anansi is a Model Context Protocol (MCP) server that allows AI assistants like Claude, Cursor, and VS Code to self-healing web scraper built for hostile sites: selectors repair themselves, browser rendering kicks in when needed, and chrome tls fingerprinting evades bot detection. ships with an mcp server so a...

A self-healing web scraper built for hostile sites: selectors repair themselves, browser rendering kicks in when needed, and Chrome TLS fingerprinting evades bot detection. Ships with an MCP server so any LLM can drive a full crawl through conversati

This server falls under the Search & Data Extraction category on MCPgee, the world's largest MCP server directory with 33,000+ servers.

Features

  • A self-healing web scraper built for hostile sites: selector

Use Cases

Self-healing web scraper for resistant sites
Anti-bot detection evasion
Adaptive selector repair and rendering
mdowis

Maintainer

LicenseApache-2.0
Languagepython
Versionv1.0.0
UpdatedMay 21, 2026
Statushealthy
Maintenanceactive

Works with

ClaudeOpenAIwindowsmacoslinux

Installation

Manual Installation

npx anansi

Configuration

Configuration Details

Config File

claude_desktop_config.json

Performance

Response Metrics

Response Time< 200ms
ThroughputMedium

Resource Usage

Memory UsageLow
CPU UsageLow

How to Set Up and Use Anansi

Anansi is a self-healing web scraper MCP server designed to extract structured data from hostile and bot-protected websites. It repairs broken CSS selectors automatically using confidence scoring, upgrades to headless browser rendering when JavaScript is required, and mimics Chrome TLS fingerprints to evade Cloudflare and similar defenses. With 17 exposed MCP tools covering single-page fetching, full-site crawling, selector training, and screenshot capture, any LLM can drive complete data extraction pipelines through conversation.

Prerequisites

  • Python 3.10 or later with pip installed
  • Playwright and Chromium for browser-based rendering (optional but recommended for JS-heavy sites)
  • curl-cffi for TLS fingerprint mimicry (install with the [tls] extra)
  • An MCP client such as Claude Desktop or Claude Code
1

Install Anansi from GitHub

Install the core package. For sites protected by Cloudflare or Akamai, also install the TLS fingerprinting extra.

# Core install
pip install "git+https://github.com/mdowis/anansi"

# With TLS fingerprint mimicry (recommended for bot-protected sites)
pip install "anansi-scraper[tls] @ git+https://github.com/mdowis/anansi"
2

Install Playwright and Chromium for browser rendering

Install Playwright's Chromium binary to enable automatic browser upgrade when JS rendering is detected on a page.

pip install playwright
playwright install chromium
3

Configure operator environment variables (optional)

Set environment variables before starting the MCP server to control security and anti-bot behavior. These cannot be changed by the LLM client at runtime.

# Allow scraping internal/private network addresses (off by default)
export ANANSI_ALLOW_PRIVATE_NETWORKS=true

# Set default TLS fingerprint target (e.g. chrome124)
export ANANSI_IMPERSONATE=chrome124

# Disable anti-bot evasion if not needed
# export ANANSI_DISABLE_ANTIBOT=true
4

Register the MCP server with your client

Add anansi-mcp as the server command in your Claude Desktop or Claude Code configuration.

{
  "mcpServers": {
    "anansi": {
      "command": "anansi-mcp",
      "args": [],
      "env": {
        "ANANSI_IMPERSONATE": "chrome124"
      }
    }
  }
}
5

For Claude Code, use the CLI shortcut

If you use Claude Code, register the server directly with the mcp add command.

claude mcp add anansi -- anansi-mcp
6

Restart the client and test the connection

Restart your MCP client. Ask it to fetch a webpage or start a crawl to confirm the 17 Anansi tools are available.

Anansi Examples

Client configuration

Claude Desktop JSON config for Anansi with TLS fingerprint impersonation enabled.

{
  "mcpServers": {
    "anansi": {
      "command": "anansi-mcp",
      "args": [],
      "env": {
        "ANANSI_IMPERSONATE": "chrome124"
      }
    }
  }
}

Prompts to try

Example prompts that use Anansi's scraping and crawling tools.

- "Fetch https://example.com/products and extract all product names and prices as JSON."
- "Crawl https://news.ycombinator.com and collect the top 30 story titles and URLs."
- "Take a screenshot of https://example.com and show me what it looks like."
- "Start a crawl of https://shop.example.com/category/shoes and export results as CSV when done."
- "Check the selector health for the URL pattern https://example.com/products/*."

Troubleshooting Anansi

Scraping fails on Cloudflare-protected sites even with the package installed

Ensure you installed the TLS extra: 'pip install anansi-scraper[tls] @ git+https://github.com/mdowis/anansi'. Then set ANANSI_IMPERSONATE=chrome124 in the server environment. Playwright Chromium must also be installed for sites that serve a JS challenge page.

anansi-mcp command not found after installation

The entry point is installed into your Python environment's bin directory. If using a virtual environment, activate it first. If using pip with --user, ensure ~/.local/bin is in your PATH.

Browser rendering not triggering for JavaScript-heavy pages

Anansi auto-upgrades to browser rendering when it detects SPA markers, but you can force it by passing use_browser=true in the fetch_url tool call. Confirm that 'playwright install chromium' completed without errors.

Frequently Asked Questions about Anansi

What is Anansi?

Anansi is a Model Context Protocol (MCP) server that self-healing web scraper built for hostile sites: selectors repair themselves, browser rendering kicks in when needed, and chrome tls fingerprinting evades bot detection. ships with an mcp server so any llm can drive a full crawl through conversati It connects AI assistants to external tools and data sources through a standardized interface.

How do I install Anansi?

Follow the installation instructions on the Anansi GitHub repository. Clone the repo, install dependencies, and add the server config to your AI client.

Which AI clients work with Anansi?

Anansi works with all major MCP-compatible AI clients including Claude Desktop, Claude Code, Cursor, VS Code (GitHub Copilot), Windsurf, and Cline.

Is Anansi free to use?

Yes, Anansi is open source and available under the Apache-2.0 license. You can use it freely in both personal and commercial projects.

Browse More Search & Data Extraction MCP Servers

Explore all search & data extraction servers available in the MCPgee directory. Each server includes setup guides for Claude, Cursor, and VS Code.

Quick Config Preview

{ "mcpServers": { "anansi": { "command": "npx", "args": ["-y", "anansi"] } } }

Add this to your claude_desktop_config.json or .cursor/mcp.json

Read the full setup guide →

Ready to use Anansi?

Browse our complete directory of 33,000+ MCP servers, read setup guides for your editor, and start building with the Model Context Protocol.

33,000+ ServersFree & Open SourceStep-by-Step Guides