Cette page est affichée en anglais. Une traduction française est en cours.

SkillsMar 29, 2026·1 min de lecture

Remotion AI Voiceover Skill — ElevenLabs TTS

AI skill for adding ElevenLabs text-to-speech voiceover to Remotion videos. Auto-sizes composition duration to match generated audio.

ElevenLabs · Community

Prêt pour agents

Staging sûr pour cet actif

Cet actif est d'abord staged. Le prompt copié demande à l'agent d'inspecter les fichiers staged avant d'activer scripts, config MCP ou config globale.

Stage only · 29/100Policy : staging

Surface agent

Tout agent MCP/CLI

Type

Skill

Installation

Stage only

Confiance

Confiance : Community

Point d'entrée

Remotion AI Voiceover Skill — ElevenLabs TTS

Commande de staging sûr

npx -y tokrepo@latest install ff8cbccc-09d0-4b0e-a3e2-d58268ca2d39 --target codex

Stage les fichiers d'abord; l'activation exige la revue du README et du plan staged.

TL;DR

A Remotion skill that generates ElevenLabs TTS voiceover and auto-sizes video composition duration to match the audio length.

§01

What it is

This is a Remotion skill for adding AI-generated voiceover to programmatic videos using ElevenLabs text-to-speech. The skill generates speech audio per scene and uses Remotion's calculateMetadata function to dynamically size the video composition duration to match the generated audio. This ensures that the video and voiceover are always synchronized.

The skill targets developers creating automated video content with Remotion who want professional-quality voiceover without recording audio manually. It integrates ElevenLabs' voice synthesis directly into the Remotion build pipeline.

§02

How it saves time or tokens

Without this skill, adding voiceover to Remotion videos requires manually generating audio files, calculating their duration, and adjusting video composition length. This skill automates the entire flow: write text, generate audio, and the video adapts automatically. AI agents using this skill can produce narrated videos in a single pass.

§03

How to use

Install the Remotion skills: npx skills add remotion-dev/skills.
Set your ElevenLabs API key: export ELEVENLABS_API_KEY=your_key.
Use the voiceover pattern in your Remotion compositions with calculateMetadata for dynamic duration.

§04

Example

# Install Remotion skills
npx skills add remotion-dev/skills

# Set ElevenLabs API key
export ELEVENLABS_API_KEY=your_key

import { Audio, useCurrentFrame } from 'remotion';

// calculateMetadata generates TTS and returns duration
export const calculateMetadata = async ({ props }) => {
  const audio = await generateVoiceover(props.script);
  const durationInFrames = Math.ceil(audio.duration * 30);
  return {
    durationInFrames,
    props: { ...props, audioUrl: audio.url },
  };
};

export const NarratedScene = ({ audioUrl, script }) => {
  return (
    <>
      <Audio src={audioUrl} />
      <div style={{ fontSize: 24, padding: 40 }}>
        {script}
      </div>
    </>
  );
};

§05

Related on TokRepo

Video AI Tools — Video production tools
Voice AI Tools — Text-to-speech and voice synthesis

§06

Common pitfalls

ElevenLabs API calls cost money per character of generated speech. Long scripts accumulate costs. Estimate costs before generating voiceover for large video batches.
Audio generation adds build time to Remotion renders. Cache generated audio files to avoid regenerating unchanged voiceovers on every render.
Voice quality depends on the ElevenLabs model and voice selection. Test different voices for your content type before committing to a production workflow.

Questions fréquentes

What is ElevenLabs?+

ElevenLabs is an AI voice synthesis platform that generates realistic text-to-speech audio. It offers multiple voice models with different characteristics. This skill uses ElevenLabs' API to generate voiceover audio for Remotion videos.

How does auto-sizing work?+

The calculateMetadata function generates the voiceover audio, measures its duration, and sets the Remotion composition's durationInFrames accordingly. This ensures the video is exactly as long as the voiceover.

Can I use a custom voice?+

Yes. ElevenLabs supports custom cloned voices and a library of pre-made voices. Configure the voice ID in your skill settings or composition props.

Does this work without ElevenLabs?+

This specific skill requires ElevenLabs. For other TTS providers, you would need to modify the audio generation function to call a different API while keeping the calculateMetadata pattern.

How much does ElevenLabs cost?+

ElevenLabs offers a free tier with limited characters per month. Paid plans start at $5/month for more quota. Pricing is per character of generated speech. Check elevenlabs.io for current pricing.

Sources citées (3)

Remotion Skills GitHub— Remotion AI voiceover skill with ElevenLabs TTS integration
ElevenLabs— ElevenLabs text-to-speech API
Remotion Documentation— Remotion calculateMetadata for dynamic composition sizing

En lien sur TokRepo

Video tools Voice tools Featured workflows

🙏

Source et remerciements

Created by Remotion. Licensed under MIT. remotion-dev/skills — Voiceover rule

Fil de discussion

Connectez-vous pour rejoindre la discussion.

Aucun commentaire pour l'instant. Soyez le premier à partager votre avis.

Actifs similaires

Remotion Rule: Voiceover

Remotion skill rule: Adding AI-generated voiceover to Remotion compositions using TTS. Part of the official Remotion Agent Skill for programmatic video in React.

Skills

Skill Factory

Remotion AI Skill — Programmatic Video in React

Official Remotion Agent Skill for Claude Code and Codex. 30+ rules covering animations, transitions, captions, FFmpeg, audio visualization, voiceover, 3D, and more.

Skills

TokRepo精选

ElevenLabs ConvAI — Full-Duplex Voice Agent Platform

ElevenLabs ConvAI bundles STT, LLM, TTS, VAD, barge-in into one managed voice agent. Define prompt, attach tools, point at Twilio number.

Skills

ElevenLabs

ElevenLabs Python SDK — AI Text-to-Speech

Official ElevenLabs Python SDK for AI voice generation. Create realistic voiceovers with 30+ languages, voice cloning, and streaming support.

SkillsCLI Tools

ElevenLabs