MCPTube
Transform YouTube videos into a compounding knowledge base with transcripts, vision analysis, and agentic search. Works as an MCP server for Claude, Copilot & more.
What is MCPTube?
MCPTube is a Model Context Protocol (MCP) server that allows AI assistants like Claude, Cursor, and VS Code to transform youtube videos into a compounding knowledge base with transcripts, vision analysis, and agentic search. works as an mcp server for claude, copilot & more.
Transform YouTube videos into a compounding knowledge base with transcripts, vision analysis, and agentic search. Works as an MCP server for Claude, Copilot & more.
This server falls under the Search & Data Extraction category on MCPgee, the world's largest MCP server directory with 33,000+ servers.
Features
- Transform YouTube videos into a compounding knowledge base w
Use Cases
Maintainer
Works with
Installation
Manual Installation
npx mcptubeConfiguration
Configuration Details
claude_desktop_config.json
Performance
Response Metrics
Resource Usage
How to Set Up and Use MCPTube
MCPTube (mcptube-vision) is a Python MCP server that transforms YouTube videos into a persistent, structured knowledge base using both transcripts and visual frame analysis. Unlike simple transcript extractors, it applies a LLM-powered WikiEngine that extracts typed knowledge objects (entities, topics, concepts, and video summaries), merges them across videos, and stores them in a local SQLite FTS5 index plus JSON files on disk. The result is a compounding knowledge base that gets smarter with every video you ingest — you can then ask questions and the agent reasons over compiled wiki pages rather than re-searching raw chunks from scratch.
Prerequisites
- Python 3.10 or higher
- pip or uv package manager
- ffmpeg installed and on PATH (required for scene-change frame extraction)
- An API key for a vision-capable LLM: OpenAI (GPT-4o), Anthropic (Claude), or Google (Gemini) — auto-detected by priority
- An MCP-compatible client: Claude Desktop, GitHub Copilot in VSCode, or any stdio MCP client
Install mcptube and ffmpeg
Install the mcptube package via pip and ensure ffmpeg is available on your system for scene-change frame detection.
pip install mcptube
# macOS: brew install ffmpeg
# Ubuntu/Debian: sudo apt install ffmpeg
# Windows: download from ffmpeg.org and add to PATHSet your LLM API key
MCPTube auto-detects which LLM provider to use based on which API key is set. Set at least one of the following environment variables. In MCP client config mode, set these in the env block.
export OPENAI_API_KEY=your-openai-api-key
# or: export ANTHROPIC_API_KEY=your-anthropic-api-key
# or: export GOOGLE_API_KEY=your-google-api-keyAdd MCPTube to your MCP client configuration
Register mcptube as an stdio MCP server in your client configuration. Pass your API key via the env block.
{
"mcpServers": {
"mcptube": {
"command": "python",
"args": ["-m", "mcptube"],
"env": {
"OPENAI_API_KEY": "your-openai-api-key"
}
}
}
}Ingest your first YouTube video
Ask your AI client to add a YouTube video to the knowledge base. The server will extract the transcript, detect scene changes with ffmpeg, run vision analysis on key frames, and write wiki pages to disk.
Query the knowledge base
After ingesting videos, ask questions. The agent uses FTS5 search to find relevant wiki pages, then reasons over them with full structural context — citations point back to specific videos and timestamps.
MCPTube Examples
Client configuration (Claude Desktop)
Add this block to your claude_desktop_config.json to enable MCPTube with an OpenAI API key for vision analysis.
{
"mcpServers": {
"mcptube": {
"command": "python",
"args": ["-m", "mcptube"],
"env": {
"OPENAI_API_KEY": "your-openai-api-key"
}
}
}
}Prompts to try
Use these prompts to ingest videos and query the compounding knowledge base MCPTube builds.
- "Add this YouTube video to the knowledge base: https://www.youtube.com/watch?v=dQw4w9WgXcQ"
- "What did the speaker say about attention mechanisms in the videos I've added?"
- "Summarize all the content about reinforcement learning from human feedback (RLHF) across my video library"
- "Which videos in the knowledge base mention Andrej Karpathy?"
- "What visual content (diagrams, code on screen) appeared in the machine learning lecture I ingested?"Troubleshooting MCPTube
ffmpeg not found or scene extraction fails
MCPTube requires ffmpeg on your system PATH for scene-change frame detection. Install it via brew install ffmpeg (macOS), sudo apt install ffmpeg (Linux), or download from ffmpeg.org (Windows). Verify with ffmpeg -version in your terminal.
Transcript extraction fails for a YouTube video
MCPTube uses youtube-transcript-api which requires the video to have captions enabled (auto-generated or manual). Videos with captions disabled or age-restricted videos cannot be transcribed. Try a different video, or check if captions are available in the YouTube video settings.
LLM API errors during wiki extraction or vision analysis
Verify your API key environment variable is correctly set in the MCP client config env block. GPT-4o vision requires an OpenAI account with billing enabled. If you hit rate limits during ingest, wait and retry — already-processed stages are not repeated for the same video.
Frequently Asked Questions about MCPTube
What is MCPTube?
MCPTube is a Model Context Protocol (MCP) server that transform youtube videos into a compounding knowledge base with transcripts, vision analysis, and agentic search. works as an mcp server for claude, copilot & more. It connects AI assistants to external tools and data sources through a standardized interface.
How do I install MCPTube?
Follow the installation instructions on the MCPTube GitHub repository. Clone the repo, install dependencies, and add the server config to your AI client.
Which AI clients work with MCPTube?
MCPTube works with all major MCP-compatible AI clients including Claude Desktop, Claude Code, Cursor, VS Code (GitHub Copilot), Windsurf, and Cline.
Is MCPTube free to use?
Yes, MCPTube is open source and available under the MIT license. You can use it freely in both personal and commercial projects.
MCPTube Alternatives — Similar Search & Data Extraction Servers
Looking for alternatives to MCPTube? Here are other popular search & data extraction servers you can use with Claude, Cursor, and VS Code.
TrendRadar
★ 58.0kA real-time hotspot monitoring and news aggregation assistant that provides AI-powered analysis of trending topics across multiple platforms via the Model Context Protocol. It enables users to track news and receive automated notifications through va
Scrapling
★ 52.7k🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
PDF Math Translate
★ 33.9k[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
GPT Researcher
★ 27.2kAn autonomous agent that conducts deep research on any data using any LLM providers
Agent Reach
★ 20.1kGive your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.
Xiaohongshu
★ 13.7kMCP for xiaohongshu.com
Browse More Search & Data Extraction MCP Servers
Explore all search & data extraction servers available in the MCPgee directory. Each server includes setup guides for Claude, Cursor, and VS Code.
Set Up MCPTube in Your Editor
Choose your AI client for step-by-step setup instructions.
Quick Config Preview
Add this to your claude_desktop_config.json or .cursor/mcp.json
Ready to use MCPTube?
Browse our complete directory of 33,000+ MCP servers, read setup guides for your editor, and start building with the Model Context Protocol.