Pipeline
Video → Whisper (transcription) → LLM (correction & segmentation)
→ Translation (99 languages) → Styled subtitle export (SRT/ASS)Features
- Whisper transcription: Multiple backend options for accuracy
- LLM correction: Fixes recognition errors, improves sentence segmentation
- 99 languages: Full multilingual translation support
- Styled output: SRT and ASS format with customizable fonts, colors, positions
- Batch processing: Process multiple videos in queue
- GUI: User-friendly interface, no command-line needed
FAQ
Q: What is VideoCaptioner? A: An AI-powered subtitle generation tool that combines Whisper speech recognition with LLM-based correction and 99-language translation. 13,800+ GitHub stars.
Q: Is VideoCaptioner free? A: Yes, the tool is GPL-3.0 licensed. You need API keys for the LLM correction service.