WorkflowsApr 2, 2026·2 min read
Agenta — Open-Source LLMOps Platform
Prompt playground, evaluation, and observability in one platform. Compare prompts, run evals, trace production calls. 4K+ stars.
TO
TokRepo精选 · Community
Quick Use
Use it first, then decide how deep to go
This block should tell both the user and the agent what to copy, install, and apply first.
```bash
pip install agenta
```
```python
import agenta as ag
ag.init()
@ag.instrument()
def generate_response(prompt: str, model: str = "gpt-4o"):
from openai import OpenAI
client = OpenAI()
response = client.chat.completions.create(
model=model,
messages=[{"role": "user", "content": prompt}]
)
return response.choices[0].message.content
# Calls are automatically traced and logged
result = generate_response("Explain RAG in 2 sentences")
```
Launch the full platform:
```bash
agenta init
agenta serve
```
Open `http://localhost:3000` — the LLMOps dashboard is ready.
---
Intro
Agenta is an open-source LLMOps platform with 4,000+ GitHub stars that combines prompt engineering, evaluation, and observability in a single tool. It provides a visual prompt playground for iterating on prompts, automated evaluation pipelines for measuring quality, A/B testing for comparing prompt variants, and production tracing for monitoring live applications. Instead of juggling separate tools for each stage of the LLM development lifecycle, Agenta unifies them into one self-hostable platform.
Works with: OpenAI, Anthropic, Google, Mistral, local models, LangChain, LlamaIndex. Best for teams iterating on LLM applications who need prompt management + evaluation + observability together. Setup time: under 5 minutes.
---
🙏
Source & Thanks
> Created by [Agenta AI](https://github.com/Agenta-AI). Licensed under Apache-2.0.
>
> [agenta](https://github.com/Agenta-AI/agenta) — ⭐ 4,000+
Discussion
Sign in to join the discussion.
No comments yet. Be the first to share your thoughts.
Related Assets
OpenLIT — OpenTelemetry LLM Observability
Monitor LLM costs, latency, and quality with OpenTelemetry-native tracing. GPU monitoring and guardrails built in. 2.3K+ stars.
TokRepo精选
Rerun — Visualize Multimodal AI Data in Real-Time
SDK for logging, storing, and visualizing 3D, images, time series, and text in real-time. Built for robotics and AI. 10K+ stars.
TokRepo精选
Unstructured — Document ETL for LLM Pipelines
Extract clean data from PDFs, DOCX, HTML, images, and emails for RAG and LLM ingestion. 14K+ GitHub stars.
TokRepo精选