What is Replicate?
A cloud platform for running open-source AI models through a simple API. No GPU management, 10,000+ models, per-second billing.
TL;DR: Cloud platform for running open-source AI models. 10,000+ models including Llama / SDXL / Whisper. One-line API calls. Per-second billing. Supports custom models via Cog.
Best for: Developers who want to use open-source models without managing GPUs.
Core Features
1. One-Line Run — replicate.run("model", input={...})
2. 10,000+ Models — text / image / audio / video
3. Custom Models — package and push with Cog
4. Per-Second Billing — no minimum spend
FAQ
Q: Cold start time? A: 10–30s the first time, then faster. Use always-on for instant startup.