Scripts2026年4月1日·1 分钟阅读

Zonos — Multilingual TTS with Voice Cloning

Zonos is an open-weight TTS model trained on 200K+ hours of speech. 7.2K+ stars. Voice cloning, 5 languages, emotion control. Apache 2.0.

SC
Script Depot · Community
快速使用

先拿来用,再决定要不要深挖

这里应该同时让用户和 Agent 知道第一步该复制什么、安装什么、落到哪里。

# Install (Linux/macOS, 6GB+ VRAM GPU)
pip install -e .

# Or run Gradio UI
uv run gradio_interface.py

# Python API
python -c "
from zonos.model import Zonos
model = Zonos.from_pretrained(\"Zyphra/Zonos-v0.1-transformer\")
# Generate speech with voice cloning
"

介绍

Zonos is a leading open-weight text-to-speech model trained on more than 200,000 hours of varied multilingual speech. With 7,200+ GitHub stars and Apache 2.0 license, it generates natural speech from text with zero-shot voice cloning from brief speaker samples, supports English, Japanese, Chinese, French, and German, provides fine-grained control over speaking rate, pitch, emotion, and audio quality, achieves ~2x real-time factor on RTX 4090, and includes Gradio web interface and Docker support.

Best for: Developers building multilingual voice applications with voice cloning Works with: Claude Code, OpenAI Codex, Cursor, Gemini CLI, Windsurf


Key Features

  • 200K+ hours training data
  • Zero-shot voice cloning
  • 5 languages (EN, JA, ZH, FR, DE)
  • Rate, pitch, emotion, quality control
  • ~2x realtime on RTX 4090
  • Gradio UI and Docker

FAQ

Q: What is Zonos? A: Open-weight TTS with 7.2K+ stars. 200K+ hours training, voice cloning, 5 languages, emotion control. Apache 2.0.

Q: How do I install Zonos? A: Clone repo, pip install -e . Requires Linux/macOS with 6GB+ GPU.


🙏

来源与感谢

Zyphra/Zonos — 7,200+ GitHub stars

讨论

登录后参与讨论。
还没有评论,来写第一条吧。

相关资产