Configs2026年5月30日·1 分钟阅读

Dograh — Open-Source Self-Hosted Voice AI Platform

Dograh is an open-source voice AI platform for building conversational voice agents with telephony support. It offers a self-hosted alternative to cloud voice APIs with a visual workflow builder, MCP integration, and support for speech-to-speech and LLM-based pipelines.

Agent 就绪

Agent 可直接安装

这个资产可安装;Agent 先选择当前运行时、检查安装计划,再运行匹配命令。

Native · 98/100策略:允许
Agent 入口
任意 MCP/CLI Agent
类型
Skill
安装
Single
信任
信任等级:Established
入口
Dograh
直接安装命令
npx -y tokrepo@latest install 173db922-5c22-11f1-9bc6-00163e2b0d79 --target codex

先 dry-run 确认安装计划,再运行此命令。

Introduction

Dograh provides the infrastructure for building voice AI agents that handle phone calls, voice interactions, and real-time conversations. It runs on your own servers, giving you full control over voice data and eliminating per-minute cloud API costs.

What Dograh Does

  • Processes inbound and outbound voice calls with AI agents
  • Provides a visual no-code workflow builder for designing call flows
  • Supports speech-to-speech and LLM-driven conversation pipelines
  • Integrates with telephony systems via SIP and WebRTC
  • Connects to local LLMs and TTS/STT models for on-premise deployment

Architecture Overview

Dograh is built in Python with a modular pipeline architecture. Voice input flows through configurable stages: speech-to-text, language model processing, and text-to-speech. The visual workflow builder generates pipeline configurations that the runtime engine executes. Telephony integration uses Asterisk ARI for SIP connectivity, while WebRTC handles browser-based voice interactions.

Self-Hosting & Configuration

  • Deploy with Docker Compose for a complete stack setup
  • Configure telephony providers through the web dashboard
  • Connect your own STT, LLM, and TTS models or use integrated defaults
  • Set up SIP trunks for production phone number integration
  • Scale horizontally by adding worker nodes for concurrent call handling

Key Features

  • Visual workflow builder for designing voice agent logic without code
  • Native MCP integration for connecting AI agents to external tools
  • Support for both cloud and local AI model backends
  • Real-time speech-to-speech mode for low-latency conversations
  • Telephony-ready with SIP, WebRTC, and PSTN connectivity

Comparison with Similar Tools

  • Vapi — cloud-based voice AI API with per-minute billing; Dograh is self-hosted with no usage fees
  • Retell AI — managed voice agent platform; Dograh provides equivalent features as open-source software
  • LiveKit Agents — real-time communication focused on WebRTC; Dograh adds telephony and visual workflow design
  • Pipecat — Python framework for voice pipelines; Dograh adds a no-code builder and telephony integration on top

FAQ

Q: Can Dograh handle production phone calls? A: Yes. It integrates with SIP trunks and supports inbound and outbound calling through standard telephony protocols.

Q: What AI models does it support? A: It works with any OpenAI-compatible API, local models via Ollama, and specialized STT/TTS engines like Whisper and Piper.

Q: How many concurrent calls can it handle? A: Concurrency depends on your hardware and model choices. Worker nodes can be scaled horizontally for higher throughput.

Q: Is it suitable for customer service automation? A: Yes. The workflow builder supports conditional logic, knowledge base lookups, and handoff to human agents.

Sources

讨论

登录后参与讨论。
还没有评论,来写第一条吧。

相关资产