PromptsApr 6, 2026·2 min read

AI Scientist — Automated Research Paper Generation

Fully automated AI system that conducts research, runs experiments, and writes complete scientific papers. Generates novel ideas, implements them, and produces LaTeX manuscripts. 12,000+ stars.

Prompt Lab · Community

Agent ready

Review-first install path

This asset needs a review step. The copied prompt tells the agent to dry-run, show the writes, then proceed only after confirmation.

Needs Confirmation · 64/100Policy: confirm

Agent surface

Any MCP/CLI agent

Kind

Prompt

Install

Single

Trust

Trust: Community

Entrypoint

AI Scientist — Automated Research Paper Generation

Review-first command

npx -y tokrepo@latest install 0a2623ca-92b3-4fba-82e0-fc9a7cda45bd --target codex

Dry-run first, confirm the writes, then run this command.

TL;DR

AI Scientist generates research ideas, runs experiments, analyzes results, and writes complete LaTeX papers autonomously.

§01

What it is

AI Scientist is a fully automated research system by Sakana AI that takes a research template and compute budget, then autonomously generates novel ideas, designs experiments, runs code, analyzes results, and produces complete scientific papers in LaTeX. The output includes literature review, methodology, results, and discussion sections.

This tool targets researchers exploring AI-assisted scientific discovery, labs looking to accelerate hypothesis exploration, and anyone studying the frontier of automated research. It works with Claude, GPT-4, and Gemini as the underlying LLM.

§02

How it saves time or tokens

The traditional research cycle from idea to manuscript takes weeks or months of manual work. AI Scientist compresses the entire pipeline into a single command. It generates multiple candidate ideas, implements each as code, runs experiments, and writes up findings -- all without human intervention between steps.

The token_estimate for this workflow is approximately 2,800 tokens per run. The system supports batching multiple ideas in a single launch, amortizing setup overhead.

§03

How to use

Clone and install dependencies:

git clone https://github.com/SakanaAI/AI-Scientist.git
cd AI-Scientist
pip install -r requirements.txt

Run the full pipeline with your chosen model and experiment template:

python launch_scientist.py \
  --model claude-sonnet-4-20250514 \
  --experiment nanoGPT \
  --num-ideas 5

Collect the output LaTeX manuscripts from the results directory.

§04

Example

The system generates structured research ideas before implementing them:

Input: 'Improve training efficiency of small language models'

Generated Ideas:
1. Adaptive learning rate scheduling based on gradient noise
2. Curriculum learning with dynamic difficulty assessment
3. Sparse attention patterns for resource-constrained training

For each idea, AI Scientist writes experiment code, runs it, and produces a full paper:

# Check generated papers
ls results/nanoGPT/
# idea_1_paper.pdf  idea_2_paper.pdf  idea_3_paper.pdf

§05

Related on TokRepo

AI Tools for Research -- Research automation and discovery tools
Prompt Library -- Reusable prompts for various AI workflows

§06

Common pitfalls

Generated papers should be treated as drafts requiring human review. The system may produce plausible-sounding but incorrect analysis.
Experiment templates constrain the research scope. Without a well-designed template, the system may explore unproductive directions.
API costs can accumulate quickly when generating multiple ideas with large models. Set --num-ideas conservatively for initial runs.

Frequently Asked Questions

What LLMs does AI Scientist support?+

AI Scientist works with Claude (Anthropic), GPT-4 (OpenAI), and Gemini (Google). You specify the model via the --model flag when launching the pipeline. Each model produces different quality and style of research output.

Are the generated papers publishable?+

The papers are structured like academic manuscripts with proper sections, but they require human review for correctness, novelty claims, and scientific rigor. They are best used as research drafts or starting points for further investigation.

How much does a typical run cost in API tokens?+

Cost depends on the model and number of ideas. The workflow estimates approximately 2,800 tokens per run. With 5 ideas and a large model like GPT-4, expect costs in the range of a few dollars per batch.

Can I use custom experiment templates?+

Yes. AI Scientist uses experiment templates that define the research domain and code structure. You can create custom templates following the existing examples like nanoGPT to target your specific research area.

Does AI Scientist run actual experiments or just write about them?+

It runs actual code. The system generates Python experiment scripts, executes them, collects metrics and plots, then writes the paper based on real results. This is not just text generation -- it includes code execution and data analysis.

Citations (3)

AI Scientist GitHub— AI Scientist autonomously generates ideas, runs experiments, and writes papers
Sakana AI— Sakana AI research on automated scientific discovery
arXiv— Large language models for automated research and hypothesis generation

Related on TokRepo

Research tools on TokRepo Prompt library AI coding tools

🙏

Source & Thanks

Created by Sakana AI. Licensed under Apache 2.0.

AI-Scientist — ⭐ 12,000+

Thanks to Sakana AI for pushing the boundary of automated scientific discovery.

Discussion

No comments yet. Be the first to share your thoughts.

Related Assets

Prompt Architect — 27 Frameworks for Expert Prompts

Transform vague prompts into structured, expert-level prompts using 27 research-backed frameworks across 7 intent categories. Works with Claude Code, ChatGPT, Cursor, and 30+ AI tools.

Prompts

Prompt Lab

Autoresearch — Automated AI Research Agents by Karpathy

An open-source system by Andrej Karpathy that uses AI agents to autonomously run machine learning research experiments on single-GPU setups.

Skills

AI Open Source

LLMLingua — Compress Prompts 20x with Minimal Loss

Microsoft research tool for prompt compression. Reduce token usage up to 20x while maintaining LLM performance. Solves lost-in-the-middle for RAG. MIT, 6,000+ stars.

Prompts

Script Depot

Awesome Prompt Engineering — Papers, Tools & Courses

Hand-curated collection of 60+ papers, 50+ tools, benchmarks, and courses for prompt engineering and context engineering. Covers CoT, RAG, agents, security, and multimodal. Apache 2.0.

Prompts

Prompt Lab