Esta página se muestra en inglés. Una traducción al español está en curso.
KnowledgeMay 11, 2026·5 min de lectura

ElevenLabs Voice Design — Generate Voices from Prompts

ElevenLabs Voice Design generates new voices from text prompts. Describe age, accent, tone — get a voice you own and reuse via TTS API.

Listo para agents

Staging seguro para este activo

Este activo primero queda en staging. El prompt copiado pide inspeccionar los archivos staged antes de activar scripts, config MCP o config global.

Stage only · 27/100Política: staging
Superficie agent
Cualquier agent MCP/CLI
Tipo
Knowledge
Instalación
Stage only
Confianza
Confianza: Community
Entrada
Asset
Comando de staging seguro
npx -y tokrepo@latest install ad11ab44-64e2-4ece-9d8a-2a072fda98e3 --target codex

Primero deja archivos en staging; la activación requiere revisar el README y el plan staged.

Introducción

ElevenLabs Voice Design generates entirely new synthesized voices from a text prompt — describe the age, accent, gender, energy, and timbre, get back a unique voice you own and can reuse across the TTS API. No source audio required, no clone consent issues. Best for: characters in audio fiction, podcast hosts, app personas where you don't have a real reference voice, multilingual voices in languages with thin voice libraries. Works with: ElevenLabs Voice Lab dashboard or API. Setup time: 5 minutes.


Generate a voice via API

import requests, json

resp = requests.post(
    "https://api.elevenlabs.io/v1/voice-generation/generate-voice",
    headers={"xi-api-key": os.environ["ELEVENLABS_API_KEY"]},
    json={
        "voice_description": "An elderly Scottish man with a warm, smoky voice — like a retired sea captain telling stories by a fireplace",
        "text": "Aye, that was the night the lighthouse went dark, and we were three miles off the rocks.",
    },
)
generated_voice_id = resp.headers["voice_id"]
with open("preview.mp3", "wb") as f:
    f.write(resp.content)

Save the design to your voice library

requests.post(
    "https://api.elevenlabs.io/v1/voice-generation/create-voice",
    headers={"xi-api-key": API_KEY},
    json={
        "voice_name": "Captain Hamish",
        "voice_description": "Elderly Scottish sea captain, warm and smoky",
        "generated_voice_id": generated_voice_id,
        "labels": {"accent": "Scottish", "age": "elderly", "use": "narration"},
    },
)

Use the saved voice in TTS

from elevenlabs.client import ElevenLabs
from elevenlabs import play

client = ElevenLabs()
audio = client.text_to_speech.convert(
    voice_id="Captain Hamish",        # name or ID from your library
    model_id="eleven_turbo_v2_5",
    text="Ah, you wouldn't believe what we saw that morning on Skye.",
)
play(audio)

Prompt patterns that work

Goal Prompt template
Narrator "Calm, mid-40s, neutral American accent, warm timbre"
Energetic host "High-energy mid-20s podcast host, slight rasp, fast pace"
Authority figure "Deep-voiced 50s news anchor, RP British accent, measured"
Child character "8-year-old curious child, light pitch, occasional giggle"
Villain "Cold, controlled, low-pitched, slight whisper, mid-30s"

Cost

Voice Design generation costs ~1,000 credits per call (preview); saving a voice is free; TTS usage is the standard per-character rate. Starter plan ($5/mo) covers ~50 designs.


FAQ

Q: Are these voices royalty-free for commercial use? A: Yes — voices you generate via Voice Design are yours under the ElevenLabs commercial license tied to your plan. Read the current Terms — Creator plan and above explicitly allow commercial publishing. Free plan is non-commercial only.

Q: Can I tweak a voice after generating? A: Yes — saved voices have stability and similarity sliders. For more dramatic changes, re-prompt Voice Design with refined description text. The platform doesn't let you literally re-mix two designed voices, but iterating on prompts gets close.

Q: How does this compare to cloning a real voice? A: Voice Cloning needs 1-3 minutes of source audio and consent; outputs sound very close to the source. Voice Design needs only a prompt; outputs are novel synthetic voices. Use Design when you don't have a source or for ethically clean characters.


Quick Use

  1. POST /v1/voice-generation/generate-voice with description + preview text
  2. POST /v1/voice-generation/create-voice to save to library
  3. Use the saved voice_id in standard TTS calls

Intro

ElevenLabs Voice Design generates entirely new synthesized voices from a text prompt — describe the age, accent, gender, energy, and timbre, get back a unique voice you own and can reuse across the TTS API. No source audio required, no clone consent issues. Best for: characters in audio fiction, podcast hosts, app personas where you don't have a real reference voice, multilingual voices in languages with thin voice libraries. Works with: ElevenLabs Voice Lab dashboard or API. Setup time: 5 minutes.


Generate a voice via API

import requests, json

resp = requests.post(
    "https://api.elevenlabs.io/v1/voice-generation/generate-voice",
    headers={"xi-api-key": os.environ["ELEVENLABS_API_KEY"]},
    json={
        "voice_description": "An elderly Scottish man with a warm, smoky voice — like a retired sea captain telling stories by a fireplace",
        "text": "Aye, that was the night the lighthouse went dark, and we were three miles off the rocks.",
    },
)
generated_voice_id = resp.headers["voice_id"]
with open("preview.mp3", "wb") as f:
    f.write(resp.content)

Save the design to your voice library

requests.post(
    "https://api.elevenlabs.io/v1/voice-generation/create-voice",
    headers={"xi-api-key": API_KEY},
    json={
        "voice_name": "Captain Hamish",
        "voice_description": "Elderly Scottish sea captain, warm and smoky",
        "generated_voice_id": generated_voice_id,
        "labels": {"accent": "Scottish", "age": "elderly", "use": "narration"},
    },
)

Use the saved voice in TTS

from elevenlabs.client import ElevenLabs
from elevenlabs import play

client = ElevenLabs()
audio = client.text_to_speech.convert(
    voice_id="Captain Hamish",        # name or ID from your library
    model_id="eleven_turbo_v2_5",
    text="Ah, you wouldn't believe what we saw that morning on Skye.",
)
play(audio)

Prompt patterns that work

Goal Prompt template
Narrator "Calm, mid-40s, neutral American accent, warm timbre"
Energetic host "High-energy mid-20s podcast host, slight rasp, fast pace"
Authority figure "Deep-voiced 50s news anchor, RP British accent, measured"
Child character "8-year-old curious child, light pitch, occasional giggle"
Villain "Cold, controlled, low-pitched, slight whisper, mid-30s"

Cost

Voice Design generation costs ~1,000 credits per call (preview); saving a voice is free; TTS usage is the standard per-character rate. Starter plan ($5/mo) covers ~50 designs.


FAQ

Q: Are these voices royalty-free for commercial use? A: Yes — voices you generate via Voice Design are yours under the ElevenLabs commercial license tied to your plan. Read the current Terms — Creator plan and above explicitly allow commercial publishing. Free plan is non-commercial only.

Q: Can I tweak a voice after generating? A: Yes — saved voices have stability and similarity sliders. For more dramatic changes, re-prompt Voice Design with refined description text. The platform doesn't let you literally re-mix two designed voices, but iterating on prompts gets close.

Q: How does this compare to cloning a real voice? A: Voice Cloning needs 1-3 minutes of source audio and consent; outputs sound very close to the source. Voice Design needs only a prompt; outputs are novel synthetic voices. Use Design when you don't have a source or for ethically clean characters.


Source & Thanks

Built by ElevenLabs. Voice Design docs at elevenlabs.io/docs/voices/voice-lab.

elevenlabs/elevenlabs-python — official SDK

🙏

Fuente y agradecimientos

Built by ElevenLabs. Voice Design docs at elevenlabs.io/docs/voices/voice-lab.

elevenlabs/elevenlabs-python — official SDK

Discusión

Inicia sesión para unirte a la discusión.
Aún no hay comentarios. Sé el primero en compartir tus ideas.

Activos relacionados