ScriptsApr 2, 2026·3 min read
Coqui TTS — Deep Learning Text-to-Speech Engine
Generate speech in 1100+ languages with voice cloning. XTTS v2 streams with under 200ms latency. 44K+ GitHub stars.
TO
TokRepo精选 · Community
Quick Use
Use it first, then decide how deep to go
This block should tell both the user and the agent what to copy, install, and apply first.
```bash
pip install TTS
```
```bash
# List available models
tts --list_models
# Generate speech from text (English)
tts --text "Hello, welcome to TokRepo." --out_path output.wav
# Use XTTS v2 for multilingual + voice cloning
tts --model_name tts_models/multilingual/multi-dataset/xtts_v2 \
--text "你好,欢迎来到TokRepo。" \
--speaker_wav reference_voice.wav \
--language_idx zh-cn \
--out_path output_zh.wav
```
```python
from TTS.api import TTS
# Initialize XTTS v2
tts = TTS("tts_models/multilingual/multi-dataset/xtts_v2").to("cuda")
# Generate speech with voice cloning
tts.tts_to_file(
text="Welcome to the future of AI voice.",
speaker_wav="my_voice.wav",
language="en",
file_path="output.wav"
)
```
---
Intro
Coqui TTS is the most comprehensive open-source text-to-speech library with 44,900+ GitHub stars, supporting 1,100+ languages via pretrained models. Its flagship XTTS v2 model delivers production-quality multilingual speech with voice cloning in just 6 seconds of reference audio and under 200ms streaming latency. The library implements every major TTS architecture — VITS, Tacotron 2, Glow-TTS, Bark, Tortoise — with a unified Python API and CLI. While Coqui the company closed in 2023, the open-source project remains the go-to TTS toolkit for developers worldwide.
Works with: Python, CUDA GPUs, CPU (slower), any application via CLI or Python API. Best for developers adding voice to AI agents, chatbots, accessibility tools, or content creation pipelines. Setup time: under 3 minutes.
---
🙏
Source & Thanks
> Created by [Coqui AI](https://github.com/coqui-ai). Licensed under MPL-2.0.
>
> [TTS](https://github.com/coqui-ai/TTS) — ⭐ 44,900+
Thanks to the Coqui AI team and community for building the most comprehensive open-source TTS toolkit.
Discussion
Sign in to join the discussion.
No comments yet. Be the first to share your thoughts.
Related Assets
OpenLIT — OpenTelemetry LLM Observability
Monitor LLM costs, latency, and quality with OpenTelemetry-native tracing. GPU monitoring and guardrails built in. 2.3K+ stars.
TokRepo精选
Agenta — Open-Source LLMOps Platform
Prompt playground, evaluation, and observability in one platform. Compare prompts, run evals, trace production calls. 4K+ stars.
TokRepo精选
Rerun — Visualize Multimodal AI Data in Real-Time
SDK for logging, storing, and visualizing 3D, images, time series, and text in real-time. Built for robotics and AI. 10K+ stars.
TokRepo精选