Esta página se muestra en inglés. Una traducción al español está en curso.
SkillsApr 8, 2026·1 min de lectura

Together AI Video Generation Skill for Claude Code

Skill that teaches Claude Code Together AI's video generation API. Covers text-to-video, image-to-video, and keyframe control for AI-powered video creation workflows.

Listo para agents

Instalación lista para agent

Este activo puede instalarse después de elegir el runtime, revisar el plan y ejecutar el comando correspondiente.

Native · 98/100Política: permitir
Superficie agent
Cualquier agent MCP/CLI
Tipo
Skill
Instalación
Single
Confianza
Confianza: Community
Entrada
Together AI Video Generation Skill for Claude Code
Comando de instalación directa
npx -y tokrepo@latest install d848ded0-8bff-4424-8ef5-dda71b903327 --target codex

Ejecutar después de confirmar el plan con dry-run.

TL;DR
Claude Code skill for Together AI's video generation API covering text-to-video and image-to-video.
§01

What it is

This skill teaches Claude Code how to use Together AI's video generation API. It covers text-to-video generation, image-to-video conversion, and keyframe control for creating AI-powered video content. With this skill loaded, Claude Code produces correct API calls with proper model names and parameters.

Content creators, developers building video pipelines, and teams prototyping visual content will find this skill useful when they want to generate video programmatically through Claude Code without memorizing API details.

§02

How it saves time or tokens

Together AI's video API has specific model identifiers, resolution constraints, and generation parameters. Without this skill, you would spend tokens describing these details in every prompt or correcting hallucinated parameter names. The skill encodes approximately 2,400 tokens of API knowledge so your prompts stay focused on creative intent.

§03

How to use

  1. Add the Together AI Video Generation skill to your Claude Code project configuration.
  2. Prompt Claude Code to generate video using Together AI's API.
  3. Specify the generation mode: text-to-video, image-to-video, or keyframe-controlled.
§04

Example

import requests

url = 'https://api.together.xyz/v1/video/generations'
headers = {
    'Authorization': 'Bearer YOUR_API_KEY',
    'Content-Type': 'application/json'
}
payload = {
    'model': 'Wan-AI/Wan2.1-T2V-14B',
    'prompt': 'A drone shot over a misty forest at dawn, cinematic',
    'height': 480,
    'width': 832,
    'num_frames': 81,
    'num_steps': 30
}
response = requests.post(url, headers=headers, json=payload)
video_url = response.json()['data'][0]['url']
print(video_url)
§05

Related on TokRepo

§06

Common pitfalls

  • Requesting resolutions the model does not support. Each video model has fixed resolution options; arbitrary dimensions will fail.
  • Expecting real-time generation. Video generation takes seconds to minutes depending on frame count and resolution.
  • Not handling the asynchronous response pattern. Some video generation endpoints return a job ID that requires polling for completion.

Preguntas frecuentes

What video generation modes does this skill cover?+

The skill covers three modes: text-to-video (generate from a text prompt), image-to-video (animate a static image), and keyframe control (specify start and end frames to guide the video direction).

How long are the generated videos?+

Video length depends on the model and num_frames parameter. Typical outputs range from 2 to 6 seconds. Longer videos require chaining multiple generations or using models that support extended durations.

Do I need a Together AI account?+

Yes. A Together AI API key is required to make video generation requests. The skill teaches Claude Code the correct API patterns but does not provide authentication.

Can I use this for commercial projects?+

Check Together AI's terms of service and the specific model's license for commercial usage rights. Some models have open licenses while others may have restrictions.

How does this differ from the image generation skill?+

The image generation skill covers FLUX and Kontext models for still images. This video skill covers video-specific models with parameters like num_frames, frame rate, and temporal consistency settings.

Referencias (3)
🙏

Fuente y agradecimientos

Part of togethercomputer/skills — MIT licensed.

Discusión

Inicia sesión para unirte a la discusión.
Aún no hay comentarios. Sé el primero en compartir tus ideas.

Activos relacionados