by allvoicelab
Generate natural speech, translate and dub videos, clone voices, remove hardcoded subtitles, and extract subtitles using powerful AI APIs.
The server provides a set of AI‑driven tools that enable text‑to‑speech, speech‑to‑speech voice conversion, rapid voice cloning, video translation & dubbing, subtitle extraction, subtitle removal, and plain text translation. It is designed to be invoked by MCP‑compatible clients such as Claude Desktop, Cursor, Windsurf, and OpenAI Agents.
uv):
curl -LsSf https://astral.sh/uv/install.sh | sh
uvx allvoicelab-mcp
ALLVOICELAB_API_KEY – your personal API key.ALLVOICELAB_API_DOMAIN – https://api.allvoicelab.com (global) or https://api.allvoicelab.cn (mainland).ALLVOICELAB_BASE_PATH (optional) – directory for storing generated files.https://api.allvoicelab.com) for worldwide keys, or the mainland domain (https://api.allvoicelab.cn) for Chinese keys.C:\Users\<Username>\.mcp\allvoicelab_mcp.log
macOS/Linux: ~/.mcp/allvoicelab_mcp.logALLVOICELAB_BASE_PATH to the desired folder path.
Official AllVoiceLab Model Context Protocol (MCP) server, supporting interaction with powerful text-to-speech and video translation APIs. Enables MCP clients like Claude Desktop, Cursor, Windsurf, OpenAI Agents to generate speech, translate videos, and perform intelligent voice conversion. Serves scenarios such as short drama localization for global markets, AI-Generated audiobooks, AI-Powered production of film/TV narration.
Multi-engine technology unlocks infinite possibilities for voice: With simple text input, you can access video generation, speech synthesis, voice cloning, and more.
AI Voice Generator (TTS): Natural voice generation in 30+ languages with ultra-high realism
Voice Changer: Real-time voice conversion, ideal for gaming, live streaming, and privacy protection
Multilingual Dubbing: One-click translation and dubbing for short videos/films, preserving emotional tone and rhythm
Speech-to-Text (STT): AI-powered multilingual subtitle generation with over 98% accuracy
Subtitle Removal: Seamless hard subtitle erasure, even on complex backgrounds
Voice Cloning: 3-Second Ultra-Fast Cloning with Human-like Voice Synthesis
uv (Python package manager), install with curl -LsSf https://astral.sh/uv/install.sh | sh| Region | Global | Mainland | 
|---|---|---|
| ALLVOICELAB_API_KEY | go get from AllVoiceLab | go get from AllVoiceLab | 
| ALLVOICELAB_API_DOMAIN | https://api.allvoicelab.com | https://api.allvoicelab.cn | 
Go to Claude > Settings > Developer > Edit Config > claude_desktop_config.json to include the following:
{
  "mcpServers": {
    "AllVoiceLab": {
      "command": "uvx",
      "args": ["allvoicelab-mcp"],
      "env": {
        "ALLVOICELAB_API_KEY": "<insert-your-api-key-here>",
        "ALLVOICELAB_API_DOMAIN": "<insert-api-domain-here>",
        "ALLVOICELAB_BASE_PATH":"optional, default is user home directory.This is uesd to store the output files."
      }
    }
  }
}
If you're using Windows, you will have to enable "Developer Mode" in Claude Desktop to use the MCP server. Click "Help" in the hamburger menu in the top left and select "Enable Developer Mode".
Go to Cursor -> Preferences -> Cursor Settings -> MCP -> Add new global MCP Server to add above config.
That's it. Your MCP client can now interact with AllVoiceLab.
| Methods | Brief description | 
|---|---|
| text_to_speech | Convert text to speech | 
| speech_to_speech | Convert audio to another voice while preserving the speech content | 
| clone_voice | Create a custom voice profile by cloning from an audio sample | 
| remove_subtitle | Remove hardcoded subtitles from a video using OCR | 
| video_translation_dubbing | Translate and dub video speech into different languages  | 
| text_translation | Translate a text file into another language | 
| subtitle_extraction | Extract subtitles from a video using OCR | 
⚠️ Warning: AllVoiceLab credits are needed to use these tools.
Try asking: Convert "At All Voice Lab, we’re reshaping the future of audio workflows with AI-powered solutions, making authentic voices accessible to creators everywhere." into voice.

After generating the audio from the previous example, select the audio file and ask: Convert this to a male voice.

Select an audio file with a single voice and ask: Clone this voice.

Select a video file (English) and ask: Translate this video to japanese.

Original video:

After translation:

Select a video with subtitles and ask: Remove the subtitles from this video.

Original video:

After the task is completed:

Select a long text (for example, "The Foolish Old Man Removes the Mountains") and ask: Translate this text to japanese. If no language is specified, it will be translated to English by default.

Select a video with subtitles and ask: Extract the subtitles from this video.

After the task is completed, you will get an SRT file as shown below:

Logs can be found at:
Please contact us by email(tech@allvoicelab.com) with log files
Please log in to share your review and rating for this MCP.
Explore related MCPs that share similar capabilities and solve comparable challenges
by burningion
Upload, edit, search, and generate videos by leveraging LLM capabilities together with Video Jungle's media library.
by mamertofabian
Generate speech audio from text via ElevenLabs API and manage voice generation tasks through a Model Context Protocol server with a companion SvelteKit web client.
by Flyworks-AI
Create fast, free lip‑sync videos for digital avatars by providing audio or text, with optional avatar generation from images or videos.
by mberg
Generates spoken audio from text, outputting MP3 files locally and optionally uploading them to Amazon S3.
by nabid-pf
Extracts YouTube video captions, subtitles, and metadata to supply structured information for AI assistants to generate concise video summaries.
by omergocmen
Provides video generation and status checking via the json2video API for seamless integration with LLMs, agents, and other MCP‑compatible clients.
by cartesia-ai
Provides clients such as Cursor, Claude Desktop, and OpenAI agents with capabilities to localize speech, convert text to audio, and infill voice clips via Cartesia's API.
by TSavo
Provides an enterprise‑grade MCP server that exposes 12 AI video generation tools, enabling AI assistants to create avatar videos, URL‑to‑video conversions, short videos, scripts, custom avatars, advanced lip‑sync, and more through natural language interactions.
by netdata
Delivers real‑time, per‑second infrastructure monitoring with zero‑configuration agents, on‑edge machine‑learning anomaly detection, and built‑in dashboards.