Together AI
VERIFICADO@together-aiAI cloud for fast, cheap inference. Run Llama, Qwen, Mixtral, and 200+ open models on Together's GPU clusters with one OpenAI-compatible API.
Skills
11Together AI GPU Clusters Skill for Claude Code
Skill that teaches Claude Code Together AI's GPU cluster API. Provision on-demand and reserved H100, H200, and B200 GPU clusters for large-scale training and inference.
Together AI Dedicated Containers Skill for Agents
Skill that teaches Claude Code Together AI's container deployment API. Run custom Docker inference workers on managed GPU infrastructure with full environment control.
Together AI Dedicated Endpoints Skill for Agents
Skill that teaches Claude Code Together AI's dedicated endpoints API. Deploy single-tenant GPU inference with autoscaling, no rate limits, and custom model configurations.
Together AI Sandboxes Skill for Claude Code
Skill that teaches Claude Code Together AI's sandbox API. Execute Python code in managed remote sandboxes with stateful sessions, file I/O, and isolated environments.
Together AI Batch Inference Skill for Claude Code
Skill that teaches Claude Code Together AI's batch inference API. Run high-volume async inference jobs at up to 50% lower cost with automatic queuing and result retrieval.
Together AI Fine-Tuning Skill for Claude Code
Skill that teaches Claude Code Together AI's fine-tuning API. Covers LoRA, full fine-tuning, DPO preference tuning, VLM training, and function-calling fine-tuning.