Kokoro TTS MCP
Converts text to MP3 using the open-weight Kokoro TTS models locally, with optional S3 upload support.
Delv Safety Grade: C
Score 58/100 · assessed 2026-04-28
Kokoro TTS MCP is a solo-maintained community server that runs open-weight text-to-speech models locally. The maintainer (mberg) appears to be an individual developer with limited public profile. Installation requires cloning the repository and running via uv, with no package registry distribution. The server writes MP3 files to the local filesystem and optionally uploads to S3, giving it moderate filesystem and network permissions. Transparency is reasonable with open source code and clear documentation of the Kokoro model integration. The local-first approach avoids sending text to third parties, which is a privacy positive. However, supply chain risk is elevated due to manual installation, no dependency pinning visible in the repository, and lack of versioned releases. No security incidents are known. Suitable for users comfortable evaluating Python dependencies and accepting solo-maintainer risk for non-critical audio generation tasks.
Lethal Trifecta (prompt-injection exposure)
CLEARLocal TTS. No I/O.
Green flags
- Runs models locally, no third-party API calls for TTS
- Open source with clear README and usage examples
- Uses established Kokoro TTS models
- Optional S3 upload keeps core functionality local-only
Red flags
- Solo maintainer with minimal public track record
- No package registry distribution, clone-and-run only
- S3 upload feature requires AWS credentials in environment
- No visible dependency pinning or lock file
- No versioned releases or changelog
Permissions requested
Install
uv run mcp-tts.py
Review
Install this if you need offline text-to-speech for drafting, privacy, or air-gapped workflows. Skip it if you want real-time responses or studio-quality voices. It's a solid local option that respects your data and your budget.
Good at
- Runs entirely offline, so your text never leaves your machine.
- No API costs or subscription fees, just local compute.
- Supports multiple Kokoro voices out of the box, enough variety for most drafting needs.
- Optional S3 upload for archiving or sharing output without manual file handling.
Watch out
- Generation speed depends on your hardware, and longer passages can take noticeable time.
- Voice quality is decent but not competitive with commercial APIs like ElevenLabs for final production.
- S3 upload setup isn't documented in the repo, so you'll need to configure AWS credentials yourself.
- Limited to the voices Kokoro provides, no custom voice cloning or fine-tuning.
Use cases
- audiobook generation
- voiceovers
- podcast drafting
- offline voice workflows
Getting started
Works with
Similar MCPs
- DaVinci Resolve MCPFull coverage of the DaVinci Resolve scripting API so agents can drive timelines, edits, colour grading, and media management via Claude.
- Free Will MCPExperimental tools that let an AI give itself prompts, ignore user requests, or go to sleep, for studying autonomy.
- Godot MCPInteracts with the Godot game engine for scene editing, running, debugging, and project management.
- QGIS MCPConnects QGIS Desktop to Claude for prompt-assisted project creation, layer loading, and code execution.