May 19, 2026·1 min read

CosyVoice — Multilingual Voice Generation with LLM-Based TTS

CosyVoice is an open-source text-to-speech system built on large language models by Alibaba's FunAudioLLM team. It supports 9 languages and 18+ Chinese dialects with zero-shot voice cloning, streaming synthesis, and fine-grained prosody control.

Agent ready

This asset can be read and installed directly by agents

TokRepo exposes a universal CLI command, install contract, metadata JSON, adapter-aware plan, and raw content links so agents can judge fit, risk, and next actions.

0/100Policy: pending
Agent surface
Any MCP/CLI agent
Kind
General
Install
Bundle
Trust
Community asset
Entrypoint
SKILL.md
Universal CLI install command
npx tokrepo install 7141df5f-537e-11f1-9bc6-00163e2b0d79

CosyVoice is an open-source text-to-speech system built on large language models by Alibaba's FunAudioLLM team. It supports 9 languages and 18+ Chinese dialects with zero-shot voice cloning, streaming synthesis, and fine-grained prosody control.

Discussion

Sign in to join the discussion.
No comments yet. Be the first to share your thoughts.

Related Assets