Awesome Prompt Engineering — Papers, Tools & Courses
Hand-curated collection of 60+ papers, 50+ tools, benchmarks, and courses for prompt engineering and context engineering. Covers CoT, RAG, agents, security, and multimodal. Apache 2.0.
Instalación lista para agent
Este activo puede instalarse después de elegir el runtime, revisar el plan y ejecutar el comando correspondiente.
npx -y tokrepo@latest install 1b3fa22b-1246-42e5-a6d4-f5211029f6ef --target codexEjecutar después de confirmar el plan con dry-run.
Why curation beats search
Google "prompt engineering" and you get 800 million results. 700 million are repackaged Twitter threads. The remaining 100 million contain the 60 papers that actually matter and a long tail of noise. Awesome Prompt Engineering is the filtered signal — a maintained repo of the work that practitioners actually cite and tools they actually use.
What's inside
| Category | Count | Examples |
|---|---|---|
| Survey papers | 14 | Brown et al. "GPT-3 Few-shot", Wei et al. "CoT" |
| Technique papers | 47 | ReAct, Tree of Thoughts, Self-Consistency, PAL |
| Benchmarks | 10 | MMLU, BIG-Bench, HELM, HumanEval |
| Tools | 52 | PromptFlow, Guidance, DSPy, Instructor |
| Courses | 18 | Andrew Ng Short Course, DeepLearning.AI, Cohere LLM University |
| Jailbreak / defense | 23 | Garak, Rebuff, NeMo Guardrails |
The 5 papers you must read first
- Chain-of-Thought Prompting (Wei et al., 2022) — the foundational technique.
- ReAct: Reasoning and Acting (Yao et al., 2022) — the pattern underlying most agents.
- Tree of Thoughts (Yao et al., 2023) — deliberate search over reasoning paths.
- Constitutional AI (Bai et al., 2022) — how Anthropic shapes model behavior.
- DSPy: Compiling Declarative LM Calls (Khattab et al., 2023) — the modern alternative to string prompts.
Tools by use case
Structured output
- Instructor (Python) — Pydantic-first structured output for any LLM
- Zod Schema (TS) — runtime type validation for OpenAI function calls
Prompt compilation
- DSPy — declarative programs compiled into optimized prompts
- Guidance — templating with control-flow primitives
Evaluation
- Promptfoo — A/B test prompts across models
- DeepEval — pytest-style LLM evals
Red teaming
- Garak — LLM vulnerability scanner
- Rebuff — prompt injection detection library
Context engineering — the next wave
"Context engineering" emerged in 2025 as the umbrella term for everything beyond prompt wording: retrieval relevance, token budget allocation, memory strategies, tool orchestration. The repo added a dedicated section tracking the ~20 papers on context compression, session management, and long-context scaling laws.
How to use the list in practice
- Starting fresh? Read the 5 seed papers above plus one survey per month.
- Building a production app? Pick one tool per axis: structured output, eval, red-team.
- Running an AI team? Use the courses section as onboarding material — each has quizzes.
- Doing research? Follow the papers' citation graph; the repo links arxiv + GitHub repros.
Update cadence and license
The list is updated roughly monthly, with 5-15 new entries per release. Entries are removed when linked projects die or when newer work supersedes them. Apache 2.0 license — fork freely.
Preguntas frecuentes
This list is maintained by active practitioners with quarterly curation reviews. It removes outdated entries and ranks papers by citation counts and practical impact rather than listing everything that exists. Each entry has a short review, not just a title and link.
Chain-of-Thought Prompting by Wei et al. 2022, ReAct by Yao et al. 2022, and Constitutional AI by Bai et al. 2022. These three cover the foundations: reasoning, action, and behavior shaping. All are freely available on arxiv.
Yes. A dedicated section added in 2025 tracks papers on context compression, retrieval relevance, session memory, and long-context scaling. This is where the field is moving beyond single-prompt optimization.
The tools section prioritizes actively maintained libraries with real production use. DSPy, Instructor, Promptfoo, Guardrails AI, and Garak are all battle-tested. Each entry notes maturity and preferred use cases.
Roughly monthly with 5-15 new entries per release. The maintainers run a quarterly curation review that also removes dead projects and superseded papers.
Referencias (3)
- Awesome Prompt Engineering GitHub— 60+ papers, 50+ tools, 10 benchmarks curated
- arxiv— Chain-of-Thought prompting introduced by Wei et al. 2022
- Stanford DSPy— DSPy compiler for declarative LM programs
Relacionados en TokRepo
Fuente y agradecimientos
Maintained by PromtsLab. Licensed under Apache 2.0.
Awesome-Prompt-Engineering — ⭐ 5,700+
Discusión
Activos relacionados
Prompt Engineering Guide — 73K-Star Learning Hub
The most comprehensive open-source prompt engineering resource with guides, papers, notebooks, and courses covering chain-of-thought, RAG, AI agents, and 13 languages. 3M+ learners, MIT.
Prompt Engineering Techniques — 20+ Methods
Comprehensive prompt engineering collection with 20+ techniques. 7.3K+ stars. Chain-of-Thought, ReAct, Tree-of-Thought, runnable notebooks.
Awesome Context Engineering — Prompt to Production
Awesome Context Engineering is a survey of papers, frameworks, and guides bridging prompt engineering to production-grade agent systems.
Prompt Perfect — System Prompt Engineering Templates
Battle-tested system prompt templates for building LLM personas, agents, and workflows. Structured formats for role definition, constraints, and output control. 4,000+ GitHub stars.