TOKREPO · 主题包

稳定

博士研究者的文献 + 复现代码包

十件资产给真在做严肃文献综述、要把论文代码跑通的博士：Zotero、arXiv MCP、GPT Researcher、academic-researcher agent、Marker、Nougat、JupyterLab、Papermill、Overleaf、AI Scientist。文献检索 → 文献管理 → PDF 解析 → 阅读 → 复现 → 写作一条龙。

10 个资产

关于这个主题包

这个 pack 包含什么

这是给已经过了「跟 ChatGPT 聊一聊我的研究方向」阶段、进入真正硬核工作的博士生 / 博士后的工具栈：(a) 认真读 200 篇 paper，(b) 追踪谁引用了谁，(c) 真把作者放出来的代码跑通，(d) 最后写出能在答辩台上守住的东西。每个 pick 都开源、活跃维护、在 pipeline 里占一个明确的位置。

这套 pack 的态度是：拒绝假装 AI 能替你读 methodology。AI 在以下环节进入循环 — 文献分诊、PDF 清洗、初步摘要、复现代码 debug、写作起草。但博士如果不真读 methods，答辩日子不会好过。工具的排布刻意让 AI 不挡在你和论文之间，只在周围打辅助。

它们怎么协同（研究工作流）

  文献检索
  ┌────────────────────────────────────┐
  │ arXiv MCP ──► GPT Researcher       │
  │  （精准）       （广度地图）         │
  └─────────────────┬──────────────────┘
                    ▼
         ┌───────────────────┐
         │  Zotero（真相源） │  ◄── BibTeX 出到 Overleaf
         │  collection +     │
         │  附 PDF           │
         └─────────┬─────────┘
                   ▼
   PDF 解析 ┌─────────────────┐
            │ Marker（快）    │
            │ Nougat（公式）  │
            └────────┬────────┘
                     ▼ 干净 markdown
        ┌─────────────────────────┐
        │ Academic Researcher     │
        │ Claude Code agent       │ ── 摘要、引用图、空白点
        └──────────┬──────────────┘
                   ▼
         复现代码
         ┌───────────────────┐
         │ JupyterLab        │
         │   + Papermill     │ ── seed sweep、ablation
         └────────┬──────────┘
                  ▼
            写作
         ┌───────────────────┐
         │ Overleaf          │  ◄── Zotero 喂引用
         │ + AI Scientist    │      （只用初稿 — 你来写）
         └───────────────────┘

脊椎是 Zotero 作为「你读过什么」的唯一真相源。上游所有东西喂 Zotero，下游所有东西从 Zotero 读。没这个纪律，整条 pipeline 会腐化成 4000 个浏览器 tab 加一本你自己都复现不了的论文。

你会遇到的取舍

AI 摘要 vs 真读 — 这套 pack 最大的风险。GPT Researcher 和 Academic Researcher agent 能在 30 秒内把一篇论文摘要完。这个摘要够你判断要不要读这篇论文，但当作 methodology 的替代品就极其危险。硬规矩：博士论文里引用的 paper，methods 部分必须自己读完。AI 只用于分诊，不用于「凭感觉引用」。
复现的天花板 — Papermill + JupyterLab 让你能干净跑作者放出来的代码，但很多论文放的代码早就跑不动了（依赖死了、weights 丢了、CUDA 版本不对）。给环境考古留时间。一切用 conda env export 锁死。如果某篇论文的 claim 在 rerun 时崩了，这值得在论文里写一条 footnote。
Marker vs Nougat — Marker 更快、表格处理得好；Nougat 更慢但能真正解析 LaTeX 公式。先跑 Marker；公式才是重点时再上 Nougat。
自托管 Overleaf vs SaaS 版 — SaaS 版方便，但你的草稿在别人机器上。自托管在学校集群（或者就一个 Docker 容器）是未发表工作的正确选择。代价是一个下午的搭建。
AI Scientist 当工具不当目标 — 用 AI 端到端生成论文学术上和伦理上都很危险。把它当作「AI 能到什么程度」的参考架构、当作 ablation 表格的初稿生成器 — 永远不当作绕过真正科学贡献的捷径。

常见踩坑

过度信任 AI 对 methodology 的摘要 — 摘要必然压缩；methodology 的细节（loss 公式、正则化、数据切分）正是被压缩掉的东西。审稿人偏偏就问这些被砍掉的细节。读 methods。
Zotero PDF 散在各设备 — 第一天就打开 WebDAV / 自己的同步目标。读博第 3 年发现一半带批注的 PDF 只在一台死了的笔记本上 — 经典博士恐怖故事。
只看 notebook 就说复现 — 论文的 figure_3.ipynb 可能端到端能跑，但跳过了真正的训练。先看 notebook 干了什么，再宣布「复现成功」。
只在 arXiv 找文献 — arXiv 快，但偏向 ML / 物理 / 数学。生物、社科、人文大部分文献住在期刊里、只能通过学校权限读。arXiv MCP 用于 arXiv 覆盖的部分，别当万能源。
BibTeX 条目重复 — Zotero 不阻止你在 arXiv 版和期刊版都点连接器，结果同一篇 paper 两条略不同的 metadata 进库。每章交接前跑一次去重。

安装 · 一行命令

$ tokrepo install pack/phd-researcher-lit-code

丢给 agent，或粘到终端

包内含什么

10 个资产打包就绪

Skill#01

Zotero — Free Research Source Manager and Citation Tool

Zotero is a free, open-source reference management tool that helps you collect, organize, annotate, cite, and share research sources. Available on Windows, macOS, Linux, and iOS, it supports one-click saving from browsers and generates citations in thousands of styles.

by AI Open Source·211 views

$ tokrepo install zotero-free-research-source-manager-citation-tool-74dca4cd

MCP#02

arXiv MCP Server — Search and Analyze Papers

arxiv-mcp-server is an MCP server for searching and analyzing arXiv papers, with uvx/uv tool stdio launch examples for reproducible research workflows.

by MCP Hub·174 views

$ tokrepo install arxiv-mcp-server-search-and-analyze-papers

Skill#03

GPT Researcher — Autonomous Research Report Agent

AI agent that generates detailed research reports from a single query. Searches multiple sources, synthesizes findings, and cites references.

by TokRepo精选·3722 views

$ tokrepo install gpt-researcher-autonomous-research-report-agent-23330210

Skill#04

Claude Code Agent: Academic Researcher

Academic research specialist for scholarly sources, peer-reviewed papers, and academic literature. Use PROACTIVELY for research paper analysis, literature reviews, citation...

by TokRepo精选·146 views

$ tokrepo install claude-code-agent-academic-researcher-ed4529f4

Skill#05

Marker — Convert PDF to Markdown with High Accuracy

Fast, accurate PDF to Markdown + JSON converter. Handles tables, images, equations, code blocks, and multi-column layouts. GPU-accelerated. 33K+ GitHub stars.

by Script Depot·272 views

$ tokrepo install marker-convert-pdf-markdown-high-accuracy-42976daf

Skill#06

Nougat — Neural Optical Understanding for Academic Documents

Nougat is a visual transformer model from Meta that converts academic PDF pages into structured Markdown, accurately preserving mathematical equations, tables, and text formatting.

by AI Open Source·166 views

$ tokrepo install nougat-neural-optical-understanding-academic-documents-ed1264b8

Skill#07

JupyterLab — Next-Generation Interactive Development Environment

The extensible web-based IDE for notebooks, code, and data from Project Jupyter, succeeding the classic Jupyter Notebook interface.

by AI Open Source·204 views

$ tokrepo install jupyterlab-next-generation-interactive-development-4de315f7

Skill#08

Papermill — Parameterize and Execute Jupyter Notebooks

Papermill is a Python tool for parameterizing, executing, and analyzing Jupyter notebooks programmatically, enabling notebook-based pipelines and report generation.

by AI Open Source·266 views

$ tokrepo install papermill-parameterize-execute-jupyter-notebooks-4be4a73a

Skill#09

Overleaf — Self-Hosted Collaborative LaTeX Editor

Overleaf is an open-source web-based LaTeX editor that enables real-time collaborative document editing. Self-host it with Docker to keep your academic papers and technical documents on your own infrastructure.

by Script Depot·168 views

$ tokrepo install overleaf-self-hosted-collaborative-latex-editor-8d4b8be6

Prompt#10

AI Scientist — Automated Research Paper Generation

Fully automated AI system that conducts research, runs experiments, and writes complete scientific papers. Generates novel ideas, implements them, and produces LaTeX manuscripts. 12,000+ stars.

by Prompt Lab·293 views

$ tokrepo install ai-scientist-automated-research-paper-generation-0a2623ca

常见问题

博士刚开始，真有必要第一天就装这 10 个吗？

不用 — 第一周装 Zotero、JupyterLab、Overleaf，这三件会变成肌肉记忆，迁移成本会复利。第二个月、找准子领域之后再加 arXiv MCP 和 academic-researcher agent。Marker、Nougat、Papermill、AI Scientist 等你撞上每个工具对应的具体问题再装 — 别给还不存在的问题预装解决方案。

AI agent 能替我做文献综述吗？

完全没法以「能扛过博士答辩」的程度做。GPT Researcher 和 academic-researcher agent 给陌生领域画第一张地图非常厉害 — 大致是大三本科生文献综述的质量。用它找出经典论文、识别主要流派，然后自己读那些论文。把 AI 生成的综述当作文献综述章节交上去，在大多数大学算抄袭，在所有大学都是学术自杀。

Marker 还是 Nougat — PDF 转文本工具先装哪个？

先装 Marker。它更快、表格图处理得好、90% 的论文都覆盖得过关。等你开始啃公式密集的理论论文，再加 Nougat — Nougat 专门在学术文档上训练，LaTeX 公式保留得好得多。两个都跑、按 paper 挑也行；存储和计算便宜，丢失公式不便宜。

我在 50 篇不同论文之间跑 notebook，怎么保证博士工作可复现？

三条规矩。(1) 每个复现住自己的目录，自己的 environment.yml 或 requirements.txt 锁死精确版本。(2) 用 Papermill 通过参数调 notebook 而不是原地改 — 源 notebook 保持干净，运行记录可审计。(3) 把执行后的 notebook + 输出和输入参数一起保存，两年后还能证明你跑过什么。conda 环境 + git + 一个装 Papermill 输出的 RUNS/ 目录解决 95% 的复现痛点。

用 AI Scientist 或 Claude 帮写博士论文符合学术伦理吗？

完全取决于你学校的政策和你诚实披露的程度。2026 年常见共识：AI 用于列大纲、改语法、压力测试想法、生成你之后大改的初稿是 OK 的 — 跟写作辅导老师做的事一样。AI 不 OK 的是：生成原创分析、伪造引用、生成你不改一字就交的段落。拿不准就在 methods 里披露。博士的意义是你能为每一句话辩护；如果你没法为 AI 写的一段话辩护，就别留。

更多主题包

12 个主题包 · 80+ 精选资产

回首页浏览全部精选合集

返回主题包总览