by cartesia-ai
Provides clients such as Cursor, Claude Desktop, and OpenAI agents with capabilities to localize speech, convert text to audio, and infill voice clips via Cartesia's API.
The server acts as a bridge between local client applications and Cartesia's cloud API, enabling operations like voice list retrieval, text‑to‑speech synthesis, speech localization to different languages, and audio segment infilling.
pip install cartesia-mcp
which cartesia-mcp
CARTESIA_API_KEY
: your Cartesia API keyOUTPUT_DIRECTORY
(optional): directory where generated audio files will be savedQ: Do I need a paid Cartesia plan? A: No. The free tier provides 20,000 credits per month, sufficient for most development and testing scenarios.
Q: Which environment variable stores the API key?
A: CARTESIA_API_KEY
.
Q: Where are generated audio files saved?
A: By default they are written to the current working directory; you can set OUTPUT_DIRECTORY
to change the location.
Q: Can I run the server on Windows?
A: Yes, as long as Python and the cartesia-mcp
package are installed and the executable is on the system PATH.
Q: How do I integrate with Cursor?
A: Create a .cursor/mcp.json
(project‑level) or ~/.cursor/mcp.json
(global) containing the same configuration used for Claude Desktop.
The Cartesia MCP server provides a way for clients such as Cursor, Claude Desktop, and OpenAI agents to interact with Cartesia's API. Users can localize speech, convert text to audio, infill voice clips etc.
Ensure that you have created an account on Cartesia, there is a free tier with 20,000 credits per month. Once in the Cartesia playground, create an API key under API Keys --> New.
pip install cartesia-mcp
which cartesia-mcp # absolute path to executable
Add the following to claude_desktop_config.json
which can be found through Settings --> Developer --> Edit Config.
{
"mcpServers": {
"cartesia-mcp": {
"command": "<absolute-path-to-executable>",
"env": {
"CARTESIA_API_KEY": "<insert-your-api-key-here>",
"OUTPUT_DIRECTORY": // directory to store generated files (optional)
}
}
}
}
Try asking Claude to
Create either a .cursor/mcp.json
in your project or a global ~/.cursor/mcp.json
. The same config as for Claude can be used.
Please log in to share your review and rating for this MCP.
Explore related MCPs that share similar capabilities and solve comparable challenges
by burningion
Upload, edit, search, and generate videos by leveraging LLM capabilities together with Video Jungle's media library.
by mamertofabian
Generate speech audio from text via ElevenLabs API and manage voice generation tasks through a Model Context Protocol server with a companion SvelteKit web client.
by Flyworks-AI
Create fast, free lip‑sync videos for digital avatars by providing audio or text, with optional avatar generation from images or videos.
by mberg
Generates spoken audio from text, outputting MP3 files locally and optionally uploading them to Amazon S3.
by allvoicelab
Generate natural speech, translate and dub videos, clone voices, remove hardcoded subtitles, and extract subtitles using powerful AI APIs.
by nabid-pf
Extracts YouTube video captions, subtitles, and metadata to supply structured information for AI assistants to generate concise video summaries.
by omergocmen
Provides video generation and status checking via the json2video API for seamless integration with LLMs, agents, and other MCP‑compatible clients.
by TSavo
Provides an enterprise‑grade MCP server that exposes 12 AI video generation tools, enabling AI assistants to create avatar videos, URL‑to‑video conversions, short videos, scripts, custom avatars, advanced lip‑sync, and more through natural language interactions.
by zed-industries
A high‑performance, multiplayer code editor designed for speed and collaboration.