by nabid-pf
Extracts YouTube video captions, subtitles, and metadata to supply structured information for AI assistants to generate concise video summaries.
A lightweight MCP server that pulls subtitles, captions, and key metadata (title, description, duration) from any YouTube URL and presents the data in a format readily consumable by AI assistants for summarization tasks.
npm install -g youtube-video-summarizer-mcp or use the provided MCP configuration with npx.{
  "mcpServers": {
    "youtube-video-summarizer": {
      "command": "youtube-video-summarizer",
      "args": []
    }
  }
}
get-video-info-for-summary-from-url with a YouTube link.youtube-caption-extractor.get-video-info-for-summary-from-url, get-video-captions, get-video-metadata.Q: Do I need a YouTube API key? A: No. The server extracts publicly available captions and metadata without requiring an API key.
Q: Which languages are supported for caption extraction? A: Any language for which the video provides subtitles; the underlying extractor will fetch all available tracks.
Q: Can I run the server locally without global installation?
A: Yes. Use npx -y youtube-video-summarizer-mcp as shown in the serverConfig below.
Q: What if a video has no captions?
A: The get-video-captions command will return an empty result, while metadata commands still work.
Q: Is the project open source? A: Yes, it is released under the MIT License.
An MCP (Model Context Protocol) server that enables AI assistants to analyze and summarize YouTube videos by extracting captions, descriptions, and metadata.
To add the MCP server to your MCP client:
npm install -g youtube-video-summarizer-mcp{
  "mcpServers": {
    "youtube-video-summarizer": {
      "command": "youtube-video-summarizer",
      "args": []
    }
  }
}
When integrated with an MCP client, the following commands become available:
Once integrated with your MCP client, you can use natural language to request video summaries:
"Can you summarize this YouTube video: https://youtube.com/watch?v=VIDEO_ID"
"What are the main points from this video's captions?"
"Extract the key information from this YouTube link"
npm install -g youtube-video-summarizer-mcp
git clone https://github.com/nabid-pf/youtube-video-summarizer-mcp.git
cd youtube-video-summarizer-mcp
npm install
npm run build
Contributions are welcome! Please feel free to submit a Pull Request.
This project is licensed under the MIT License - see the LICENSE file for details.
Please log in to share your review and rating for this MCP.
{
  "mcpServers": {
    "youtube-video-summarizer": {
      "command": "npx",
      "args": [
        "-y",
        "youtube-video-summarizer-mcp"
      ],
      "env": {}
    }
  }
}claude mcp add youtube-video-summarizer npx -y youtube-video-summarizer-mcpExplore related MCPs that share similar capabilities and solve comparable challenges
by burningion
Upload, edit, search, and generate videos by leveraging LLM capabilities together with Video Jungle's media library.
by mamertofabian
Generate speech audio from text via ElevenLabs API and manage voice generation tasks through a Model Context Protocol server with a companion SvelteKit web client.
by Flyworks-AI
Create fast, free lip‑sync videos for digital avatars by providing audio or text, with optional avatar generation from images or videos.
by mberg
Generates spoken audio from text, outputting MP3 files locally and optionally uploading them to Amazon S3.
by allvoicelab
Generate natural speech, translate and dub videos, clone voices, remove hardcoded subtitles, and extract subtitles using powerful AI APIs.
by omergocmen
Provides video generation and status checking via the json2video API for seamless integration with LLMs, agents, and other MCP‑compatible clients.
by cartesia-ai
Provides clients such as Cursor, Claude Desktop, and OpenAI agents with capabilities to localize speech, convert text to audio, and infill voice clips via Cartesia's API.
by TSavo
Provides an enterprise‑grade MCP server that exposes 12 AI video generation tools, enabling AI assistants to create avatar videos, URL‑to‑video conversions, short videos, scripts, custom avatars, advanced lip‑sync, and more through natural language interactions.
by netdata
Delivers real‑time, per‑second infrastructure monitoring with zero‑configuration agents, on‑edge machine‑learning anomaly detection, and built‑in dashboards.