Key Features
- Markdown output — Clean, LLM-ready text extraction
- JavaScript rendering — Handles SPAs and dynamic content
- Structured extraction — CSS selectors, schema-based extraction
- Chunking strategies — Topic-based, fixed-size, or semantic chunking
- Media extraction — Images, links, metadata
- Rate limiting — Built-in politeness and throttling
- Async — Fast concurrent crawling
FAQ
Q: What is Crawl4AI? A: Open-source web crawler optimized for AI and LLM use cases. Extracts clean markdown, handles JavaScript-rendered pages, and supports structured data extraction.
Q: How do I install Crawl4AI? A: Check the Quick Use section above for step-by-step installation instructions. Most assets can be set up in under 2 minutes.