Loading...

ElevenLabs MCP Server - MCP Server Space

ElevenLabs MCP Server

ElevenLabs MCP Server

by ElevenLabs MCP Server mamertofabian

Generate speech audio from text via ElevenLabs API and manage voice generation tasks through a Model Context Protocol server with a companion SvelteKit web client.

PythonMIT Licenseaudiotext-to-speechelevenlabssveltekitsqlite

115stars

Updated12 days ago

ElevenLabs MCP Server Overview

ElevenLabs MCP Server

What is ElevenLabs MCP Server about?

Provides a Model Context Protocol (MCP) server that converts text to speech using the ElevenLabs API, stores job history in SQLite, and offers a sample SvelteKit client for interacting with the server.

How to use ElevenLabs MCP Server?

Installation – Prefer the uvx approach: no manual install needed, just configure the MCP settings JSON with the command uvx and the appropriate environment variables.
Running the server – The MCP client (e.g., Claude Desktop) invokes uvx elevenlabs-mcp-server based on the configuration.
Using the web client – Navigate to clients/web-ui, install dependencies with pnpm, copy .env.example to .env, then start the UI via pnpm dev and open http://localhost:5174.
Available tools – Call tools like generate_audio_simple, generate_audio_script, delete_job, list_voices, etc., through MCP.

Key features of ElevenLabs MCP Server

Audio generation from plain text or structured scripts.
Support for multiple voices, actors, and voice‑over parameters (stability, similarity boost, style).
Persistent job history stored in SQLite.
Sample SvelteKit client for text‑to‑speech conversion, script management, history playback, and audio download.
Simple configuration via environment variables.

Use cases of ElevenLabs MCP Server

Building voice‑over pipelines for videos, podcasts, or e‑learning content.
Integrating real‑time speech synthesis into AI assistants or chatbots.
Managing multi‑speaker dialogues for interactive storytelling.
Archiving generated audio for later review or reuse.

FAQ from the ElevenLabs MCP Server

Q: Which API key is required? A: An ElevenLabs API key (ELEVENLABS_API_KEY).

Q: How to choose a voice? A: Set ELEVENLABS_VOICE_ID in the environment or use the list_voices tool to retrieve available IDs.

Q: Can I change the synthesis model? A: Yes, adjust ELEVENLABS_MODEL_ID (default eleven_flash_v2).

Q: Where are audio files saved? A: To the directory defined by ELEVENLABS_OUTPUT_DIR (default output).

Q: How is job history accessed? A: Via the get_voiceover_history tool or the resource URI voiceover://history/{job_id}.

ElevenLabs MCP Server's README

ElevenLabs MCP Server

A Model Context Protocol (MCP) server that integrates with ElevenLabs text-to-speech API, featuring both a server component and a sample web-based MCP Client (SvelteKit) for managing voice generation tasks.

Features

Generate audio from text using ElevenLabs API
Support for multiple voices and script parts
SQLite database for persistent history storage
Sample SvelteKit MCP Client for:
- Simple text-to-speech conversion
- Multi-part script management
- Voice history tracking and playback
- Audio file downloads

Installation

Installing via Smithery

To install ElevenLabs MCP Server for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install elevenlabs-mcp-server --client claude

Using uvx (recommended)

When using uvx, no specific installation is needed.

Add the following configuration to your MCP settings file (e.g., cline_mcp_settings.json for Claude Desktop):

{
  "mcpServers": {
    "elevenlabs": {
      "command": "uvx",
      "args": ["elevenlabs-mcp-server"],
      "env": {
        "ELEVENLABS_API_KEY": "your-api-key",
        "ELEVENLABS_VOICE_ID": "your-voice-id",
        "ELEVENLABS_MODEL_ID": "eleven_flash_v2",
        "ELEVENLABS_STABILITY": "0.5",
        "ELEVENLABS_SIMILARITY_BOOST": "0.75",
        "ELEVENLABS_STYLE": "0.1",
        "ELEVENLABS_OUTPUT_DIR": "output"
      }
    }
  }
}

Development Installation

Clone this repository
Install dependencies:
```
uv venv
```
Copy .env.example to .env and fill in your ElevenLabs credentials

{
  "mcpServers": {
    "elevenlabs": {
      "command": "uv",
      "args": [
        "--directory",
        "path/to/elevenlabs-mcp-server",
        "run",
        "elevenlabs-mcp-server"
      ],
      "env": {
        "ELEVENLABS_API_KEY": "your-api-key",
        "ELEVENLABS_VOICE_ID": "your-voice-id",
        "ELEVENLABS_MODEL_ID": "eleven_flash_v2",
        "ELEVENLABS_STABILITY": "0.5",
        "ELEVENLABS_SIMILARITY_BOOST": "0.75",
        "ELEVENLABS_STYLE": "0.1",
        "ELEVENLABS_OUTPUT_DIR": "output"
      }
    }
  }
}

Using the Sample SvelteKit MCP Client

Navigate to the web UI directory:
```
cd clients/web-ui
```
Install dependencies:
```
pnpm install
```
Copy .env.example to .env and configure as needed
Run the web UI:
```
pnpm dev
```
Open http://localhost:5174 in your browser

Available Tools

generate_audio_simple: Generate audio from plain text using default voice settings
generate_audio_script: Generate audio from a structured script with multiple voices and actors
delete_job: Delete a job by its ID
get_audio_file: Get the audio file by its ID
list_voices: List all available voices
get_voiceover_history: Get voiceover job history. Optionally specify a job ID for a specific job.

Available Resources

voiceover://history/{job_id}: Get the audio file by its ID
voiceover://voices: List all available voices

License

This project is licensed under the MIT License - see the LICENSE file for details.

ElevenLabs MCP Server Reviews

Share Your Experience

Login Required

Please log in to share your review and rating for this MCP.

Similar MCP Servers like ElevenLabs MCP Server

Explore related MCPs that share similar capabilities and solve comparable challenges

MiniMax MCP

by MiniMax-AI

Enables interaction with powerful text‑to‑speech, image generation and video generation APIs through a Model Context Protocol server.

Video Editor MCP

by burningion

Upload, edit, search, and generate videos by leveraging LLM capabilities together with Video Jungle's media library.

Flyworks MCP

by Flyworks-AI

Create fast, free lip‑sync videos for digital avatars by providing audio or text, with optional avatar generation from images or videos.

Kokoro Text to Speech

by mberg

Generates spoken audio from text, outputting MP3 files locally and optionally uploading them to Amazon S3.

AllVoiceLab MCP Server

by allvoicelab

Generate natural speech, translate and dub videos, clone voices, remove hardcoded subtitles, and extract subtitles using powerful AI APIs.

Youtube Video Summarizer MCP Server

by nabid-pf

Extracts YouTube video captions, subtitles, and metadata to supply structured information for AI assistants to generate concise video summaries.

Json2Video MCP Server

by omergocmen

Provides video generation and status checking via the json2video API for seamless integration with LLMs, agents, and other MCP‑compatible clients.

Creatify MCP Server

by TSavo

Provides an enterprise‑grade MCP server that exposes 12 AI video generation tools, enabling AI assistants to create avatar videos, URL‑to‑video conversions, short videos, scripts, custom avatars, advanced lip‑sync, and more through natural language interactions.

Cartesia MCP Server

by cartesia-ai

Provides clients such as Cursor, Claude Desktop, and OpenAI agents with capabilities to localize speech, convert text to audio, and infill voice clips via Cartesia's API.