ScriptsMar 29, 2026·1 min read

Firecrawl — Web Scraping API for LLMs

Turn any website into LLM-ready markdown. API-first web scraping with JavaScript rendering, auto-pagination, structured extraction, and batch crawling.

TO
TokRepo精选 · Community
Quick Use

Use it first, then decide how deep to go

This block should tell both the user and the agent what to copy, install, and apply first.

pip install firecrawl-py
from firecrawl import FirecrawlApp

app = FirecrawlApp(api_key="fc-...")
result = app.scrape_url("https://example.com", params={"formats": ["markdown"]})
print(result["markdown"])

Intro

Firecrawl is an API that converts any webpage into clean markdown optimized for LLMs. Handles JavaScript rendering, anti-bot measures, pagination, and sitemaps automatically.

Best for: RAG data ingestion, web research, content monitoring, competitive analysis Works with: LangChain, LlamaIndex, any LLM pipeline


Features

  • Scrape — Single URL to markdown/HTML/structured data
  • Crawl — Entire site crawling with depth control
  • Map — Get all URLs from a website
  • Extract — Schema-based structured data extraction via LLM
  • Batch — Process thousands of URLs concurrently

Self-Hosted

git clone https://github.com/mendableai/firecrawl.git
docker compose up

🙏

Source & Thanks

Created by Mendable. Licensed under AGPL-3.0. mendableai/firecrawl — 25K+ GitHub stars

Related Assets