Phoenix — Open Source AI Observability
Phoenix is an AI observability platform for tracing, evaluating, and debugging LLM apps. 9.1K+ stars. OpenTelemetry, evals, prompt management.
What it is
Phoenix is an open-source AI observability platform by Arize AI. It traces LLM application calls, evaluates output quality, and helps debug issues in retrieval-augmented generation (RAG) pipelines, agents, and chat applications. It uses OpenTelemetry for instrumentation and provides a web UI for exploring traces.
Phoenix targets ML engineers and developers building production LLM applications who need visibility into what their AI is doing, why it fails, and how to improve it.
How it saves time or tokens
Phoenix shows you exactly which prompts, retrievals, and tool calls happened in each request. When an LLM produces a bad answer, you trace the root cause (wrong documents retrieved, poor prompt, hallucination) without adding debug logging manually.
The evaluation framework lets you score outputs automatically, catching quality regressions before users report them.
How to use
- Install Phoenix:
pip install arize-phoenix - Start the Phoenix server:
phoenix serve - Instrument your LLM application with the Phoenix OpenTelemetry integration
- Open the web UI at
http://localhost:6006to explore traces
Example
import phoenix as px
from phoenix.otel import register
from openinference.instrumentation.openai import OpenAIInstrumentor
# Start Phoenix
px.launch_app()
# Register OpenTelemetry tracer
tracer_provider = register(project_name='my-app')
# Instrument OpenAI calls
OpenAIInstrumentor().instrument(tracer_provider=tracer_provider)
# Your LLM calls are now traced automatically
from openai import OpenAI
client = OpenAI()
response = client.chat.completions.create(
model='gpt-4o',
messages=[{'role': 'user', 'content': 'What is RAG?'}]
)
Related on TokRepo
- Phoenix on AI gateway -- Phoenix as an observability provider
- Monitoring tools -- Application monitoring and observability
Common pitfalls
- Phoenix traces can grow large for high-throughput apps; configure sampling rates for production
- OpenTelemetry instrumentation adds slight latency; benchmark before deploying to latency-sensitive endpoints
- The evaluation framework requires labeled data or LLM-as-judge setup; plan your eval strategy before instrumenting
Frequently Asked Questions
Both provide LLM tracing and evaluation. Phoenix is open source and self-hosted. LangSmith is a managed service by LangChain. Phoenix uses standard OpenTelemetry; LangSmith uses proprietary instrumentation. Choose Phoenix for self-hosting and vendor independence.
Yes. Phoenix traces retrieval steps including document chunks, similarity scores, and reranking. You can see exactly which documents were retrieved and whether they were relevant to the query, helping debug RAG quality issues.
Yes. Phoenix supports OpenAI, Anthropic, Google, and any provider through OpenTelemetry-compatible instrumentation libraries. The openinference library provides auto-instrumentors for popular frameworks.
By default, Phoenix stores traces in memory for the session. For persistence, configure a database backend like PostgreSQL. The managed Arize platform provides long-term storage and additional features.
Phoenix supports relevance scoring, hallucination detection, toxicity checks, and custom evaluations. You can use LLM-as-judge evaluations where a model scores the output, or write custom evaluation functions.
Citations (3)
- Phoenix GitHub— Phoenix is an AI observability platform with 9.1K+ GitHub stars
- Phoenix Docs— OpenTelemetry-based instrumentation for LLM applications
- OpenTelemetry— OpenTelemetry observability framework
Related on TokRepo
Source & Thanks
Arize-ai/phoenix — 9,100+ GitHub stars
Discussion
Related Assets
HumHub — Open-Source Enterprise Social Network
A flexible, open-source social networking platform built on Yii2 for creating private communities, intranets, and collaboration spaces within organizations.
Dolibarr — Open-Source ERP & CRM for Business Management
A modular open-source ERP and CRM application written in PHP for managing contacts, invoices, orders, inventory, accounting, and more from a single web interface.
PrestaShop — Open-Source PHP E-Commerce Platform
A widely adopted open-source e-commerce platform written in PHP with a rich module marketplace, multi-language support, and a strong European user base.