SkillsMar 29, 2026·2 min read

Remotion Captions & Subtitles Skill

AI skill for generating and rendering captions in Remotion videos. Supports transcription, word-level timing, and styled subtitle overlays.

TO
TokRepo精选 · Community
Quick Use

Use it first, then decide how deep to go

This block should tell both the user and the agent what to copy, install, and apply first.

npx skills add remotion-dev/skills

The captions skill activates automatically when you work with subtitles in a Remotion project.


Intro

A specialized Remotion skill for AI-powered caption and subtitle workflows. Covers transcription (audio/video to captions), the Caption JSON format with word-level timing, styled subtitle rendering, and dynamic composition duration based on caption length. Part of the Remotion AI Skills collection.

Best for: Developers building video content with auto-generated subtitles Works with: Claude Code, OpenAI Codex, Cursor


Caption Format

All captions use the Caption type:

import type { Caption } from "@remotion/captions";

type Caption = {
  text: string;
  startMs: number;
  endMs: number;
  timestampMs: number | null;
  confidence: number | null;
};

Generating Captions

Transcribe video/audio files to generate captions with word-level timestamps. Supports multiple transcription services.

Displaying Captions

Render captions as styled overlays on your video compositions, with full control over:

  • Font, size, color, background
  • Word-by-word or line-by-line display
  • Animated highlighting of current word
  • Position (top, bottom, center)

FAQ

Q: What caption format does Remotion use? A: Remotion uses a JSON-based Caption type with text, startMs, endMs, timestampMs, and confidence fields for word-level timing.

Q: Can I auto-generate subtitles from audio? A: Yes. The skill includes transcription rules for generating captions from audio/video files with word-level timestamps.


🙏

Source & Thanks

Created by Remotion. Licensed under MIT. remotion-dev/skills — Subtitles rule

Related Assets