How do I install MCPTube MCP Server?

Follow the setup instructions on the MCPTube GitHub repository, then add the server configuration to your AI client.

What category is MCPTube MCP Server?

MCPTube is categorized under Search & Data Extraction. Browse more servers in these categories on MCPgee.

MCPTube

Name: Mcptube MCP Server
Author: 0xchamin

v1.0.0•Search & Data Extraction•stable

Transform YouTube videos into a compounding knowledge base with transcripts, vision analysis, and agentic search. Works as an MCP server for Claude, Copilot & more.

aiclaudecodexgeminimcp

109

Stars

Downloads

Weekly

0/5

View on GitHub

What is MCPTube?

MCPTube is a Model Context Protocol (MCP) server that allows AI assistants like Claude, Cursor, and VS Code to transform youtube videos into a compounding knowledge base with transcripts, vision analysis, and agentic search. works as an mcp server for claude, copilot & more.

Transform YouTube videos into a compounding knowledge base with transcripts, vision analysis, and agentic search. Works as an MCP server for Claude, Copilot & more.

This server falls under the Search & Data Extraction category on MCPgee, the world's largest MCP server directory with 33,000+ servers.

Features

Transform YouTube videos into a compounding knowledge base w

Use Cases

YouTube video transcript extraction

Vision-based video analysis

Knowledge base building

0xchamin

Maintainer

LicenseMIT

Languagepython

Versionv1.0.0

UpdatedMay 19, 2026

Statushealthy

Maintenanceactive

Works with

ClaudeOpenAIwindowsmacoslinux

View Source Browse All Servers

Installation

Manual Installation

npx mcptube

Configuration

Configuration Details

Config File

claude_desktop_config.json

Performance

Response Metrics

Response Time< 200ms

ThroughputMedium

Resource Usage

Memory UsageLow

CPU UsageLow

How to Set Up and Use MCPTube

MCPTube (mcptube-vision) is a Python MCP server that transforms YouTube videos into a persistent, structured knowledge base using both transcripts and visual frame analysis. Unlike simple transcript extractors, it applies a LLM-powered WikiEngine that extracts typed knowledge objects (entities, topics, concepts, and video summaries), merges them across videos, and stores them in a local SQLite FTS5 index plus JSON files on disk. The result is a compounding knowledge base that gets smarter with every video you ingest — you can then ask questions and the agent reasons over compiled wiki pages rather than re-searching raw chunks from scratch.

Prerequisites

Python 3.10 or higher
pip or uv package manager
ffmpeg installed and on PATH (required for scene-change frame extraction)
An API key for a vision-capable LLM: OpenAI (GPT-4o), Anthropic (Claude), or Google (Gemini) — auto-detected by priority
An MCP-compatible client: Claude Desktop, GitHub Copilot in VSCode, or any stdio MCP client

Install mcptube and ffmpeg

Install the mcptube package via pip and ensure ffmpeg is available on your system for scene-change frame detection.

pip install mcptube
# macOS: brew install ffmpeg
# Ubuntu/Debian: sudo apt install ffmpeg
# Windows: download from ffmpeg.org and add to PATH

Set your LLM API key

MCPTube auto-detects which LLM provider to use based on which API key is set. Set at least one of the following environment variables. In MCP client config mode, set these in the env block.

export OPENAI_API_KEY=your-openai-api-key
# or: export ANTHROPIC_API_KEY=your-anthropic-api-key
# or: export GOOGLE_API_KEY=your-google-api-key

Add MCPTube to your MCP client configuration

{
  "mcpServers": {
    "mcptube": {
      "command": "python",
      "args": ["-m", "mcptube"],
      "env": {
        "OPENAI_API_KEY": "your-openai-api-key"
      }
    }
  }
}

Ingest your first YouTube video

Ask your AI client to add a YouTube video to the knowledge base. The server will extract the transcript, detect scene changes with ffmpeg, run vision analysis on key frames, and write wiki pages to disk.

Query the knowledge base

After ingesting videos, ask questions. The agent uses FTS5 search to find relevant wiki pages, then reasons over them with full structural context — citations point back to specific videos and timestamps.

MCPTube Examples

Client configuration (Claude Desktop)

Add this block to your claude_desktop_config.json to enable MCPTube with an OpenAI API key for vision analysis.

{
  "mcpServers": {
    "mcptube": {
      "command": "python",
      "args": ["-m", "mcptube"],
      "env": {
        "OPENAI_API_KEY": "your-openai-api-key"
      }
    }
  }
}

Prompts to try

Use these prompts to ingest videos and query the compounding knowledge base MCPTube builds.

- "Add this YouTube video to the knowledge base: https://www.youtube.com/watch?v=dQw4w9WgXcQ"
- "What did the speaker say about attention mechanisms in the videos I've added?"
- "Summarize all the content about reinforcement learning from human feedback (RLHF) across my video library"
- "Which videos in the knowledge base mention Andrej Karpathy?"
- "What visual content (diagrams, code on screen) appeared in the machine learning lecture I ingested?"

Troubleshooting MCPTube

ffmpeg not found or scene extraction fails

MCPTube requires ffmpeg on your system PATH for scene-change frame detection. Install it via brew install ffmpeg (macOS), sudo apt install ffmpeg (Linux), or download from ffmpeg.org (Windows). Verify with ffmpeg -version in your terminal.

Transcript extraction fails for a YouTube video

MCPTube uses youtube-transcript-api which requires the video to have captions enabled (auto-generated or manual). Videos with captions disabled or age-restricted videos cannot be transcribed. Try a different video, or check if captions are available in the YouTube video settings.

LLM API errors during wiki extraction or vision analysis

Verify your API key environment variable is correctly set in the MCP client config env block. GPT-4o vision requires an OpenAI account with billing enabled. If you hit rate limits during ingest, wait and retry — already-processed stages are not repeated for the same video.

Frequently Asked Questions about MCPTube

What is MCPTube?

MCPTube is a Model Context Protocol (MCP) server that transform youtube videos into a compounding knowledge base with transcripts, vision analysis, and agentic search. works as an mcp server for claude, copilot & more. It connects AI assistants to external tools and data sources through a standardized interface.

How do I install MCPTube?

Follow the installation instructions on the MCPTube GitHub repository. Clone the repo, install dependencies, and add the server config to your AI client.

Which AI clients work with MCPTube?

MCPTube works with all major MCP-compatible AI clients including Claude Desktop, Claude Code, Cursor, VS Code (GitHub Copilot), Windsurf, and Cline.

Is MCPTube free to use?

Yes, MCPTube is open source and available under the MIT license. You can use it freely in both personal and commercial projects.

Learn More About MCP Servers

Getting Started with MCP

Set up your first MCP server in minutes

MCP Setup Guide

Configure MCP in Claude, Cursor & VS Code

All MCP Tutorials

18+ hands-on guides for developers

MCP FAQ

40+ answers about Model Context Protocol

MCPTube Alternatives — Similar Search & Data Extraction Servers

Looking for alternatives to MCPTube? Here are other popular search & data extraction servers you can use with Claude, Cursor, and VS Code.

TrendRadar

★ 58.0k

A real-time hotspot monitoring and news aggregation assistant that provides AI-powered analysis of trending topics across multiple platforms via the Model Context Protocol. It enables users to track news and receive automated notifications through va

Scrapling

★ 52.7k

🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!

PDF Math Translate

★ 33.9k

[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/MCP/Docker/Zotero

GPT Researcher

★ 27.2k

An autonomous agent that conducts deep research on any data using any LLM providers

Agent Reach

★ 20.1k

Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.

Xiaohongshu

★ 13.7k

MCP for xiaohongshu.com

Browse More Search & Data Extraction MCP Servers

Explore all search & data extraction servers available in the MCPgee directory. Each server includes setup guides for Claude, Cursor, and VS Code.

Search & Data Extraction Browse All Servers

Set Up MCPTube in Your Editor

Choose your AI client for step-by-step setup instructions.

🖥️

Claude Desktop

macOS & Windows app

⌨️

Claude Code

CLI & terminal

📝

Cursor

AI-first code editor

💻

VS Code

GitHub Copilot MCP

🏄

Windsurf

Codeium AI editor

🔌

Cline

VS Code extension

Quick Config Preview

{
  "mcpServers": {
    "mcptube": {
      "command": "npx",
      "args": ["-y", "mcptube"]
    }
  }
}

Add this to your claude_desktop_config.json or .cursor/mcp.json

Read the full setup guide →

Ready to use MCPTube?

Browse our complete directory of 33,000+ MCP servers, read setup guides for your editor, and start building with the Model Context Protocol.

33,000+ ServersFree & Open SourceStep-by-Step Guides

Explore All Servers Read Our Guides