SkillsApr 8, 2026·1 min read

Together AI Audio TTS/STT Skill for Claude Code

Skill that teaches Claude Code Together AI's audio API. Covers text-to-speech (REST and WebSocket streaming), speech-to-text transcription, and realtime voice interaction.

Together AI · Community

Agent ready

Ready-to-run agent install

This asset can be installed after the agent chooses its runtime, checks the plan, and runs the matching command.

Native · 98/100Policy: allow

Agent surface

Any MCP/CLI agent

Kind

Skill

Install

Single

Trust

Trust: Community

Entrypoint

Together AI Audio TTS/STT Skill for Claude Code

Direct install command

npx -y tokrepo@latest install 1342ba0e-6098-4da8-8d88-dc8ac94389f0 --target codex

Run after dry-run confirms the install plan.

TL;DR

A Claude Code skill covering Together AI's audio API: text-to-speech, streaming TTS, speech-to-text, and realtime voice.

§01

What it is

This skill teaches Claude Code how to use Together AI's audio API. It covers text-to-speech (REST and WebSocket streaming), speech-to-text transcription, and realtime voice interaction through Together AI's endpoints.

The skill targets developers building voice-enabled applications who want their AI coding assistant to generate working audio API integration code. Once installed, Claude Code can scaffold TTS/STT pipelines using Together AI's infrastructure.

The project is actively maintained and suitable for both individual developers and teams looking to integrate it into their existing toolchain. Documentation and community support are available for onboarding.

§02

How it saves time or tokens

Instead of reading Together AI's audio API documentation and writing boilerplate, Claude Code generates correct API calls after installing this skill. It knows the endpoint formats, authentication patterns, and streaming protocols for both TTS and STT. The estimated token budget is around 2,500 tokens.

For teams evaluating multiple tools in the same category, the clear documentation and active community reduce the time spent on research and troubleshooting. Getting started takes minutes rather than hours of configuration.

§03

How to use

Install the skill in your Claude Code environment by adding the skill file to your project.
Ask Claude Code to generate TTS code using Together AI (e.g., 'Generate a script that converts text to speech using Together AI').
Claude Code produces working code with proper authentication, endpoint URLs, and response handling.
Run the generated code with your Together AI API key.

§04

Example

import requests

# Together AI Text-to-Speech (REST)
response = requests.post(
    'https://api.together.ai/v1/audio/speech',
    headers={
        'Authorization': 'Bearer YOUR_TOGETHER_API_KEY',
        'Content-Type': 'application/json'
    },
    json={
        'model': 'together-tts-v1',
        'input': 'Hello, this is a test of Together AI text to speech.',
        'voice': 'alloy',
        'response_format': 'mp3'
    }
)

with open('output.mp3', 'wb') as f:
    f.write(response.content)

§05

Related on TokRepo

AI Tools for Voice — Text-to-speech and voice synthesis tools.
Prompt Library — Prompt templates for AI coding skills.

§06

Common pitfalls

Not setting the response_format correctly. Together AI supports mp3, opus, and wav. Choose based on your playback requirements and file size constraints.
Using REST for real-time voice applications. The WebSocket streaming API delivers audio chunks with lower latency than the REST endpoint. Use WebSocket for interactive voice.
Hardcoding API keys in generated code. Always use environment variables for API key management, even in prototype code.
Applying the skill without reading the documentation first. Each skill has specific prerequisites and configuration requirements that affect the quality of results.

Frequently Asked Questions

What audio formats does Together AI TTS support?+

Together AI's TTS endpoint supports MP3, Opus, and WAV output formats. MP3 is the most common for web applications. Opus offers better compression for streaming scenarios.

Does the skill support real-time voice interaction?+

Yes. The skill covers Together AI's WebSocket streaming API for real-time audio. This enables voice assistants and interactive applications with low-latency audio output.

Can I use this skill with other AI coding assistants?+

This skill is designed for Claude Code. The underlying API patterns work with any coding assistant, but the skill file format is specific to Claude Code's skill system.

What is the cost of Together AI's audio API?+

Together AI charges per character for TTS and per second for STT. Check the Together AI pricing page for current rates. The free tier includes limited usage for testing.

Does the STT component support multiple languages?+

Together AI's speech-to-text supports multiple languages. The exact language list depends on the underlying model. Check the API documentation for supported language codes.

Citations (3)

Together AI Documentation— Together AI audio API for TTS and STT
Anthropic Docs— Claude Code skills system
Together AI API Reference— WebSocket streaming for real-time audio

Related on TokRepo

Voice tools Prompt library Featured workflows

🙏

Source & Thanks

Part of togethercomputer/skills — MIT licensed.

Discussion

No comments yet. Be the first to share your thoughts.

Related Assets

Together AI Dedicated Containers Skill for Agents

Skill that teaches Claude Code Together AI's container deployment API. Run custom Docker inference workers on managed GPU infrastructure with full environment control.

Skills

Together AI

Together AI Embeddings & Reranking Skill for Agents

Skill that teaches Claude Code Together AI's embeddings and reranking API. Covers dense vector generation, semantic search, RAG pipelines, and result reranking patterns.

Skills

Together AI

Together AI Sandboxes Skill for Claude Code

Skill that teaches Claude Code Together AI's sandbox API. Execute Python code in managed remote sandboxes with stateful sessions, file I/O, and isolated environments.

Skills

Together AI

Together AI Dedicated Endpoints Skill for Agents

Skill that teaches Claude Code Together AI's dedicated endpoints API. Deploy single-tenant GPU inference with autoscaling, no rate limits, and custom model configurations.

Skills

Together AI