Esta página se muestra en inglés. Una traducción al español está en curso.
ScriptsMar 30, 2026·2 min de lectura

LiveKit Agents — Build Real-Time Voice AI Agents

Framework for building real-time voice AI agents. STT, LLM, TTS pipeline with sub-second latency. Supports OpenAI, Anthropic, Deepgram, ElevenLabs. 9.9K+ stars.

Introducción

LiveKit Agents is a framework for building real-time, multimodal voice AI agents. It provides a pipeline architecture for Speech-to-Text, LLM processing, and Text-to-Speech with sub-second end-to-end latency. Built on LiveKit's WebRTC infrastructure for production-grade voice communication. Supports OpenAI Realtime API, Anthropic, Deepgram, ElevenLabs, and more. 9,900+ GitHub stars.

Best for: Developers building voice assistants, phone agents, video call AI, and conversational interfaces Works with: OpenAI, Anthropic, Deepgram, ElevenLabs, Azure, Google, Cartesia, AssemblyAI


Key Features

Voice Pipeline

Modular STT → LLM → TTS pipeline with automatic voice activity detection (VAD):

  • STT: OpenAI Whisper, Deepgram, Google, Azure, AssemblyAI
  • LLM: OpenAI (including Realtime), Anthropic, Google, Ollama
  • TTS: OpenAI, ElevenLabs, Cartesia, Azure, Google

Sub-Second Latency

Optimized for real-time conversation with streaming at every stage. Turn detection and interruption handling built in.

Multimodal

Beyond voice — supports video input, screen sharing, and data channels for rich agent interactions.

Production Infrastructure

Built on LiveKit's WebRTC platform — handles scaling, room management, recording, and telephony (SIP/PSTN).

Function Calling

Agents can use tools mid-conversation:

@agent.tool
async def check_weather(city: str) -> str:
    return f"It's 72F and sunny in {city}"

FAQ

Q: What is LiveKit Agents? A: A framework for building real-time voice AI agents with STT/LLM/TTS pipelines and sub-second latency. Built on LiveKit's WebRTC infrastructure. 9.9K+ stars.

Q: Can I build a phone agent with LiveKit Agents? A: Yes, LiveKit supports SIP/PSTN telephony integration, so you can build agents that answer phone calls.


🙏

Fuente y agradecimientos

Created by LiveKit. Licensed under Apache 2.0. livekit/agents — 9,900+ GitHub stars

Discusión

Inicia sesión para unirte a la discusión.
Aún no hay comentarios. Sé el primero en compartir tus ideas.

Activos relacionados