MediaCrawler MCP Server

v1.0.0Search & Data Extractionstable

Enables AI assistants to crawl and extract data from Chinese social media platforms like Bilibili, Xiaohongshu, and Douyin. Provides search, content detail retrieval, and creator information tools with persistent browser sessions and QR code login su

mediacrawler-mcp-servermcpai-integration
Share:
37
Stars
0
Downloads
0
Weekly
0/5

What is MediaCrawler MCP Server?

MediaCrawler MCP Server is a Model Context Protocol (MCP) server that allows AI assistants like Claude, Cursor, and VS Code to enables ai assistants to crawl and extract data from chinese social media platforms like bilibili, xiaohongshu, and douyin. provides search, content detail retrieval, and creator information tools wit...

Enables AI assistants to crawl and extract data from Chinese social media platforms like Bilibili, Xiaohongshu, and Douyin. Provides search, content detail retrieval, and creator information tools with persistent browser sessions and QR code login su

This server falls under the Search & Data Extraction category on MCPgee, the world's largest MCP server directory with 33,000+ servers.

Features

  • Enables AI assistants to crawl and extract data from Chinese

Use Cases

Extract data from Chinese social media platforms like Bilibili, Xiaohongshu, and Douyin. Search content and retrieve creator information with persistent sessions.
mcp-service

Maintainer

LicenseMIT
Languagepython
Versionv1.0.0
UpdatedMay 5, 2026
Statushealthy
Maintenanceactive

Works with

ClaudeOpenAIwindowsmacoslinux

Installation

Manual Installation

npx mediacrawler-mcp-server

Configuration

Configuration Details

Config File

claude_desktop_config.json

Performance

Response Metrics

Response Time< 200ms
ThroughputMedium

Resource Usage

Memory UsageLow
CPU UsageLow

How to Set Up and Use MediaCrawler MCP Server

MediaCrawler MCP Server is a Python-based service that enables AI assistants to search and extract data from major Chinese social media platforms — including Bilibili, Xiaohongshu (Little Red Book), and Douyin — through a persistent browser session managed by Playwright. It provides platform-specific tools for keyword search, content detail retrieval, creator profile lookup, and comment thread extraction, with QR code login support for platforms that require authentication. Researchers, marketers, and developers who need to gather structured data from Chinese social platforms as part of AI workflows can use this server as an automated extraction backend.

Prerequisites

  • Python 3.10 or later with Poetry package manager
  • Redis server running locally (used for session and job management)
  • Playwright with Chromium browser installed
  • Platform accounts for Bilibili, Xiaohongshu, or Douyin (QR code login is supported)
  • An MCP-compatible client or SSE-capable integration
1

Clone the repository and install Python dependencies

Clone the MediaCrawler MCP service repository and install all dependencies using Poetry.

git clone https://github.com/mcp-service/media-crawler-mcp-service.git
cd media-crawler-mcp-service
poetry install
2

Install the Playwright Chromium browser

MediaCrawler uses Playwright to drive a real browser for platform interactions. Install the Chromium browser binary.

poetry run playwright install chromium
3

Configure the environment

Copy the example environment file and adjust settings such as the service port (default 9090), Redis connection parameters, and which platforms to enable.

cp .env.example .env
4

Start Redis and launch the service

Start your Redis server, then launch the MediaCrawler MCP service. The service will be available on port 9090 by default.

redis-server &
poetry run python main.py
5

Log in to platforms via QR code

Visit the admin UI at http://localhost:9090/admin to initiate QR code login sessions for the platforms you want to crawl. Sessions are persisted across restarts.

6

Configure your MCP client to connect via SSE

Point your MCP client at the SSE endpoint. The server uses Server-Sent Events transport rather than stdio.

MediaCrawler MCP Server Examples

Client configuration

Configure your MCP client to connect to the MediaCrawler service via its SSE endpoint.

{
  "mcpServers": {
    "mediacrawler": {
      "url": "http://localhost:9090/mcp",
      "transport": "sse"
    }
  }
}

Prompts to try

Once connected, use these prompts to search and extract data from Chinese social media platforms.

- "Search Bilibili for videos about Python machine learning and show me the top 5 results"
- "Get the details and stats for Bilibili video BV1234567890"
- "Search Xiaohongshu for posts about skincare routines with the keyword 保湿"
- "Get the creator profile and recent posts for this Xiaohongshu user"
- "Show me the comment thread on this Bilibili video"

Troubleshooting MediaCrawler MCP Server

Service fails to start with Redis connection error

Ensure Redis is running before starting main.py. Run redis-server in a separate terminal or as a background service. Check the .env file for the correct Redis host and port settings.

Xiaohongshu detail requests fail or return empty results

The xhs_crawler_detail tool requires a valid xsec_token and xsec_source from a prior search result. Always use the token returned by xhs_search when fetching post details.

Platform login session expires or QR code scan fails

Visit http://localhost:9090/admin to reinitiate a fresh QR code login session for the affected platform. Persistent sessions are stored but can expire if the platform revokes them.

Frequently Asked Questions about MediaCrawler MCP Server

What is MediaCrawler MCP Server?

MediaCrawler MCP Server is a Model Context Protocol (MCP) server that enables ai assistants to crawl and extract data from chinese social media platforms like bilibili, xiaohongshu, and douyin. provides search, content detail retrieval, and creator information tools with persistent browser sessions and qr code login su It connects AI assistants to external tools and data sources through a standardized interface.

How do I install MediaCrawler MCP Server?

Follow the installation instructions on the MediaCrawler MCP Server GitHub repository. Clone the repo, install dependencies, and add the server config to your AI client.

Which AI clients work with MediaCrawler MCP Server?

MediaCrawler MCP Server works with all major MCP-compatible AI clients including Claude Desktop, Claude Code, Cursor, VS Code (GitHub Copilot), Windsurf, and Cline.

Is MediaCrawler MCP Server free to use?

Yes, MediaCrawler MCP Server is open source and available under the MIT license. You can use it freely in both personal and commercial projects.

Browse More Search & Data Extraction MCP Servers

Explore all search & data extraction servers available in the MCPgee directory. Each server includes setup guides for Claude, Cursor, and VS Code.

Quick Config Preview

{ "mcpServers": { "mediacrawler-mcp-server": { "command": "npx", "args": ["-y", "mediacrawler-mcp-server"] } } }

Add this to your claude_desktop_config.json or .cursor/mcp.json

Read the full setup guide →

Ready to use MediaCrawler MCP Server?

Browse our complete directory of 33,000+ MCP servers, read setup guides for your editor, and start building with the Model Context Protocol.

33,000+ ServersFree & Open SourceStep-by-Step Guides