What is AgentEval — .NET Toolkit for Agent Evaluation?

AgentEval is a .NET evaluation toolkit for AI agents that validates tool usage, scores RAG quality, compares models, and exports regression-ready reports.

Is AgentEval — .NET Toolkit for Agent Evaluation free to use?

Yes. AgentEval — .NET Toolkit for Agent Evaluation is freely available on TokRepo. Check the Source & Thanks section on the asset page for the specific open-source license.

How do I install AgentEval — .NET Toolkit for Agent Evaluation?

Visit the asset page on TokRepo and click "Copy for agent" to get the installation instructions. Most assets can be installed with a single command.

AgentEval — .NET Toolkit for Agent Evaluation

Practical Notes

Setup time ~15 minutes (add NuGet + run one starter eval)
Runs alongside tests: the fastest check is dotnet test with evaluation assertions enabled
GitHub stars + forks (verified): see Source & Thanks

AgentEval is most useful when you treat tool usage as a contract. Instead of only judging final text, assert that:

The agent called the expected tools (and did not call forbidden ones).
The tool inputs are well-formed and minimally scoped.
Retrieval answers are grounded (your RAG checks pass consistently).

Because this repo is explicitly labeled as preview/experimental, pin versions in CI and keep an upgrade checklist (baseline scores + golden traces) before bumping.

FAQ

Q: Is this production-ready? A: The repo warns it is preview/experimental. Use it in CI with pinned versions and your own validation before shipping.

Q: Can I evaluate tool calls, not just text? A: Yes — tool usage validation is a first-class goal in the project description.

Q: How do I start fast? A: Add the NuGet package, follow the Getting Started guide, and turn one high-risk workflow into an eval test.

AgentEval — .NET Toolkit for Agent Evaluation

Practical Notes

FAQ

Source & Thanks

Discussion

Related Assets

Agent Evaluation — Test Virtual Agents in CI

CAMEL — Multi-Agent and Tooling Library

Stripe Agent Toolkit — Payments SDK for LLM Agents

Agent Governance Toolkit — Policy Guardrails for Agents