Key Features
- 200K+ hours training data
- Zero-shot voice cloning
- 5 languages (EN, JA, ZH, FR, DE)
- Rate, pitch, emotion, quality control
- ~2x realtime on RTX 4090
- Gradio UI and Docker
FAQ
Q: What is Zonos? A: Open-weight TTS with 7.2K+ stars. 200K+ hours training, voice cloning, 5 languages, emotion control. Apache 2.0.
Q: How do I install Zonos? A: Clone repo, pip install -e . Requires Linux/macOS with 6GB+ GPU.