Cette page est affichée en anglais. Une traduction française est en cours.

SkillsMar 29, 2026·2 min de lecture

Claude Code Agent: Prompt Engineer — Design & Test Prompts

Claude Code agent for designing, optimizing, and testing LLM prompts. Improves accuracy, reduces token usage, and benchmarks results.

Skill Factory · Community

Prêt pour agents

Installation avec revue préalable

Cet actif nécessite une revue. Le prompt copié demande un dry-run, affiche les écritures, puis continue seulement après confirmation.

Needs Confirmation · 62/100Policy : confirmer

Surface agent

Tout agent MCP/CLI

Type

Agent

Installation

Single

Confiance

Confiance : Established

Point d'entrée

Claude Code Agent: Prompt Engineer

Commande avec revue préalable

npx -y tokrepo@latest install 57eff515-8f7b-4a35-9078-98df47ac2d06 --target codex

Dry-run d'abord, confirmez les écritures, puis lancez cette commande.

TL;DR

A Claude Code agent skill that designs, optimizes, and benchmarks LLM prompts to improve accuracy and reduce token usage.

§01

What it is

This is a Claude Code agent skill that specializes in prompt engineering. It helps developers design, optimize, and test LLM prompts systematically. Instead of manual trial-and-error prompt iteration, this agent applies structured techniques: chain-of-thought decomposition, few-shot example selection, output format specification, and A/B testing against evaluation criteria.

It targets AI application developers, prompt engineers, and teams building LLM-powered features who want to improve prompt quality without spending hours on manual iteration.

§02

How it saves time or tokens

Manual prompt optimization is slow and subjective. This agent automates the iteration loop: write a prompt, test it against sample inputs, measure accuracy, suggest improvements, and re-test. It also identifies opportunities to reduce token usage by tightening instructions, removing redundant context, and restructuring prompts for efficiency.

§03

How to use

Load the Prompt Engineer skill in your Claude Code environment.
Provide your current prompt, the target LLM, and sample inputs with expected outputs.
The agent analyzes the prompt, suggests improvements, and optionally runs benchmarks to compare versions.

§04

Example

# Working with the Prompt Engineer agent:

User: Optimize this prompt for classification accuracy:
'Classify the following customer message as positive, negative, or neutral.'

Agent analysis:
- Missing: output format specification
- Missing: edge case handling (mixed sentiment)
- Suggestion: add few-shot examples
- Suggestion: specify JSON output format

Optimized prompt:
'Classify the customer message sentiment. Return JSON:
{"sentiment": "positive|negative|neutral", "confidence": 0.0-1.0}

Examples:
Input: "Great product, fast shipping" -> {"sentiment": "positive", "confidence": 0.95}
Input: "Item arrived damaged" -> {"sentiment": "negative", "confidence": 0.9}'

§05

Related on TokRepo

Prompt library — Browse reusable prompt templates and patterns
AI coding tools — Tools for AI-assisted development

§06

Common pitfalls

Optimizing for one model does not guarantee improvement on another. Prompts tuned for Claude may behave differently on GPT-4 or Gemini. Test on your target model.
Over-engineering prompts with too many constraints can reduce flexibility and increase token cost without meaningful accuracy gains.
Benchmarks need representative test data. If your evaluation set is too small or biased, optimization may overfit to those specific examples.

Questions fréquentes

What prompt engineering techniques does this agent apply?+

The agent applies chain-of-thought prompting, few-shot example selection, structured output formatting, role specification, constraint definition, and iterative refinement. It also identifies common anti-patterns like ambiguous instructions or missing edge case handling.

Can this agent test prompts against multiple LLM providers?+

The agent can design prompts and suggest optimizations for any LLM. Actual benchmark execution depends on which API keys and integrations are configured in your Claude Code environment.

How does it measure prompt quality?+

The agent evaluates prompts against user-provided test cases with expected outputs. It measures accuracy (correct vs incorrect), consistency (same input producing same output), and token efficiency (input + output token count). Improvement percentages are reported across iterations.

Is this useful for non-technical prompt writers?+

Yes. The agent accepts plain-language descriptions of what you want the prompt to achieve and translates that into optimized prompt text. You do not need to understand prompting techniques; the agent applies them for you.

How does this differ from using Claude directly for prompt help?+

This agent adds structured methodology: systematic evaluation, versioned prompt iterations, quantitative benchmarks, and best-practice pattern application. Raw Claude conversation provides one-off advice; this agent provides a repeatable optimization workflow.

Sources citées (3)

Anthropic Claude Code Docs— Claude Code provides agent skills for specialized tasks
Anthropic Prompt Engineering Guide— Prompt engineering best practices
arXiv: Chain-of-Thought Prompting (Wei et al.)— Chain-of-thought prompting improves reasoning

En lien sur TokRepo

Prompt library AI coding tools Featured workflows

🙏

Source et remerciements

Created by Claude Code Templates by davila7. Licensed under MIT. Install: npx claude-code-templates@latest --agent ai-specialists/prompt-engineer --yes

Fil de discussion

Connectez-vous pour rejoindre la discussion.

Aucun commentaire pour l'instant. Soyez le premier à partager votre avis.

Actifs similaires

Claude Code Agent: API Architect — Design REST & GraphQL APIs

Claude Code agent for API design. REST endpoints, GraphQL schemas, authentication, rate limiting, versioning, and documentation.

Skills

Skill Factory

Claude Code Agent: GraphQL Architect — Schema & Resolver Design

Claude Code agent for GraphQL development. Schema design, resolver patterns, subscriptions, federation, and performance optimization.

Skills

Skill Factory

Claude Code Agent: Game Designer — Mechanics & Balance

Claude Code agent for game design. Game mechanics, level design, balance tuning, economy systems, and player progression.

Skills

Skill Factory

Claude Code Agent: ML Engineer — Model Training & Deployment

Claude Code agent for machine learning. Model training, hyperparameter tuning, experiment tracking, and production deployment pipelines.

Skills

Skill Factory