Esta página se muestra en inglés. Una traducción al español está en curso.
PromptsMay 11, 2026·2 min de lectura

promptfoo-action — Run Prompt Evals in GitHub CI

Add promptfoo-action to GitHub Actions to run prompt/agent evals on PRs or pushes, cache results, and comment a before/after report for safer iteration.

Listo para agents

Staging seguro para este activo

Este activo primero queda en staging. El prompt copiado pide inspeccionar los archivos staged antes de activar scripts, config MCP o config global.

Stage only · 27/100Política: staging
Superficie agent
Cualquier agent MCP/CLI
Tipo
Prompt
Instalación
Stage only
Confianza
Confianza: Established
Entrada
Asset
Comando de staging seguro
npx -y tokrepo@latest install 1eecb87d-ec62-4982-828d-18dd9a031695 --target codex

Primero deja archivos en staging; la activación requiere revisar el README y el plan staged.

Introducción

Add promptfoo-action to GitHub Actions to run prompt/agent evals on PRs or pushes, cache results, and comment a before/after report for safer iteration.

  • Best for: teams shipping prompts/agents who want CI regressions checks and a human-reviewable report in PRs
  • Works with: GitHub Actions, promptfoo configs (YAML/JSON), and optional caching via actions/cache (per repo docs)
  • Setup time: 13 minutes

Quantitative Notes

  • GitHub stars + forks (verified): see Source & Thanks
  • Action writes results to output.json (repo docs)
  • Setup time ~13 minutes (workflow + one config file)

Practical Notes

A minimal workflow is to run evals on PRs that touch prompts/** and store output.json as an artifact. Example snippet:

- uses: actions/checkout@v4
- uses: promptfoo/promptfoo-action@v1
  with:
    github-token: ${{ secrets.GITHUB_TOKEN }}
    config: promptfooconfig.yaml

Start with a small test set, then expand coverage once the report format fits your review process.

Safety note: Treat eval configs like code: review provider keys, red-team prompts, and data files; avoid leaking secrets in logs.

FAQ

Q: Do I need to host anything? A: No. It runs in GitHub Actions and uses promptfoo under the hood.

Q: Can I gate merges on quality? A: Yes. Use thresholds/fail options so CI fails when success rate drops.

Q: How do I keep costs down? A: Cache results and limit concurrency; run evals only on prompt-related paths.


🙏

Fuente y agradecimientos

GitHub: https://github.com/promptfoo/promptfoo-action Owner avatar: https://avatars.githubusercontent.com/u/137907881?v=4 License (SPDX): MIT GitHub stars (verified via api.github.com/repos/promptfoo/promptfoo-action): 65 GitHub forks (verified via api.github.com/repos/promptfoo/promptfoo-action): 31

Discusión

Inicia sesión para unirte a la discusión.
Aún no hay comentarios. Sé el primero en compartir tus ideas.

Activos relacionados