Caption Format
All captions use the Caption type:
import type { Caption } from "@remotion/captions";
type Caption = {
text: string;
startMs: number;
endMs: number;
timestampMs: number | null;
confidence: number | null;
};Generating Captions
Transcribe video/audio files to generate captions with word-level timestamps. Supports multiple transcription services.
Displaying Captions
Render captions as styled overlays on your video compositions, with full control over:
- Font, size, color, background
- Word-by-word or line-by-line display
- Animated highlighting of current word
- Position (top, bottom, center)
FAQ
Q: What caption format does Remotion use?
A: Remotion uses a JSON-based Caption type with text, startMs, endMs, timestampMs, and confidence fields for word-level timing.
Q: Can I auto-generate subtitles from audio? A: Yes. The skill includes transcription rules for generating captions from audio/video files with word-level timestamps.