MCPTube

v1.0.0Search & Data Extractionstable

Transform YouTube videos into a compounding knowledge base with transcripts, vision analysis, and agentic search. Works as an MCP server for Claude, Copilot & more.

aiclaudecodexgeminimcp
Share:
109
Stars
0
Downloads
0
Weekly
0/5

What is MCPTube?

MCPTube is a Model Context Protocol (MCP) server that allows AI assistants like Claude, Cursor, and VS Code to transform youtube videos into a compounding knowledge base with transcripts, vision analysis, and agentic search. works as an mcp server for claude, copilot & more.

Transform YouTube videos into a compounding knowledge base with transcripts, vision analysis, and agentic search. Works as an MCP server for Claude, Copilot & more.

This server falls under the Search & Data Extraction category on MCPgee, the world's largest MCP server directory with 33,000+ servers.

Features

  • Transform YouTube videos into a compounding knowledge base w

Use Cases

YouTube video transcript extraction
Vision-based video analysis
Knowledge base building
0xchamin

Maintainer

LicenseMIT
Languagepython
Versionv1.0.0
UpdatedMay 19, 2026
Statushealthy
Maintenanceactive

Works with

ClaudeOpenAIwindowsmacoslinux

Installation

Manual Installation

npx mcptube

Configuration

Configuration Details

Config File

claude_desktop_config.json

Performance

Response Metrics

Response Time< 200ms
ThroughputMedium

Resource Usage

Memory UsageLow
CPU UsageLow

How to Set Up and Use MCPTube

MCPTube (mcptube-vision) is a Python MCP server that transforms YouTube videos into a persistent, structured knowledge base using both transcripts and visual frame analysis. Unlike simple transcript extractors, it applies a LLM-powered WikiEngine that extracts typed knowledge objects (entities, topics, concepts, and video summaries), merges them across videos, and stores them in a local SQLite FTS5 index plus JSON files on disk. The result is a compounding knowledge base that gets smarter with every video you ingest — you can then ask questions and the agent reasons over compiled wiki pages rather than re-searching raw chunks from scratch.

Prerequisites

  • Python 3.10 or higher
  • pip or uv package manager
  • ffmpeg installed and on PATH (required for scene-change frame extraction)
  • An API key for a vision-capable LLM: OpenAI (GPT-4o), Anthropic (Claude), or Google (Gemini) — auto-detected by priority
  • An MCP-compatible client: Claude Desktop, GitHub Copilot in VSCode, or any stdio MCP client
1

Install mcptube and ffmpeg

Install the mcptube package via pip and ensure ffmpeg is available on your system for scene-change frame detection.

pip install mcptube
# macOS: brew install ffmpeg
# Ubuntu/Debian: sudo apt install ffmpeg
# Windows: download from ffmpeg.org and add to PATH
2

Set your LLM API key

MCPTube auto-detects which LLM provider to use based on which API key is set. Set at least one of the following environment variables. In MCP client config mode, set these in the env block.

export OPENAI_API_KEY=your-openai-api-key
# or: export ANTHROPIC_API_KEY=your-anthropic-api-key
# or: export GOOGLE_API_KEY=your-google-api-key
3

Add MCPTube to your MCP client configuration

Register mcptube as an stdio MCP server in your client configuration. Pass your API key via the env block.

{
  "mcpServers": {
    "mcptube": {
      "command": "python",
      "args": ["-m", "mcptube"],
      "env": {
        "OPENAI_API_KEY": "your-openai-api-key"
      }
    }
  }
}
4

Ingest your first YouTube video

Ask your AI client to add a YouTube video to the knowledge base. The server will extract the transcript, detect scene changes with ffmpeg, run vision analysis on key frames, and write wiki pages to disk.

5

Query the knowledge base

After ingesting videos, ask questions. The agent uses FTS5 search to find relevant wiki pages, then reasons over them with full structural context — citations point back to specific videos and timestamps.

MCPTube Examples

Client configuration (Claude Desktop)

Add this block to your claude_desktop_config.json to enable MCPTube with an OpenAI API key for vision analysis.

{
  "mcpServers": {
    "mcptube": {
      "command": "python",
      "args": ["-m", "mcptube"],
      "env": {
        "OPENAI_API_KEY": "your-openai-api-key"
      }
    }
  }
}

Prompts to try

Use these prompts to ingest videos and query the compounding knowledge base MCPTube builds.

- "Add this YouTube video to the knowledge base: https://www.youtube.com/watch?v=dQw4w9WgXcQ"
- "What did the speaker say about attention mechanisms in the videos I've added?"
- "Summarize all the content about reinforcement learning from human feedback (RLHF) across my video library"
- "Which videos in the knowledge base mention Andrej Karpathy?"
- "What visual content (diagrams, code on screen) appeared in the machine learning lecture I ingested?"

Troubleshooting MCPTube

ffmpeg not found or scene extraction fails

MCPTube requires ffmpeg on your system PATH for scene-change frame detection. Install it via brew install ffmpeg (macOS), sudo apt install ffmpeg (Linux), or download from ffmpeg.org (Windows). Verify with ffmpeg -version in your terminal.

Transcript extraction fails for a YouTube video

MCPTube uses youtube-transcript-api which requires the video to have captions enabled (auto-generated or manual). Videos with captions disabled or age-restricted videos cannot be transcribed. Try a different video, or check if captions are available in the YouTube video settings.

LLM API errors during wiki extraction or vision analysis

Verify your API key environment variable is correctly set in the MCP client config env block. GPT-4o vision requires an OpenAI account with billing enabled. If you hit rate limits during ingest, wait and retry — already-processed stages are not repeated for the same video.

Frequently Asked Questions about MCPTube

What is MCPTube?

MCPTube is a Model Context Protocol (MCP) server that transform youtube videos into a compounding knowledge base with transcripts, vision analysis, and agentic search. works as an mcp server for claude, copilot & more. It connects AI assistants to external tools and data sources through a standardized interface.

How do I install MCPTube?

Follow the installation instructions on the MCPTube GitHub repository. Clone the repo, install dependencies, and add the server config to your AI client.

Which AI clients work with MCPTube?

MCPTube works with all major MCP-compatible AI clients including Claude Desktop, Claude Code, Cursor, VS Code (GitHub Copilot), Windsurf, and Cline.

Is MCPTube free to use?

Yes, MCPTube is open source and available under the MIT license. You can use it freely in both personal and commercial projects.

Browse More Search & Data Extraction MCP Servers

Explore all search & data extraction servers available in the MCPgee directory. Each server includes setup guides for Claude, Cursor, and VS Code.

Quick Config Preview

{ "mcpServers": { "mcptube": { "command": "npx", "args": ["-y", "mcptube"] } } }

Add this to your claude_desktop_config.json or .cursor/mcp.json

Read the full setup guide →

Ready to use MCPTube?

Browse our complete directory of 33,000+ MCP servers, read setup guides for your editor, and start building with the Model Context Protocol.

33,000+ ServersFree & Open SourceStep-by-Step Guides