Scripts2026年4月7日·1 分钟阅读

Crawlee — Web Scraping and Browser Automation Library

Build reliable web scrapers in Node.js or Python. Crawlee handles proxy rotation, browser fingerprints, auto-scaling, and anti-bot bypassing out of the box.

What is Crawlee?

Crawlee is a Node.js/Python web scraping library that automatically handles proxy rotation, browser fingerprinting, retries, auto-scaling, and data storage.

In one sentence: Crawlee is a web scraping library for Node.js and Python with built-in proxy rotation, anti-detection, and auto-scaling.

Core Features

1. Multiple Crawler Types

HTTP crawlers (fast) and browser crawlers (JS rendering).

2. Anti-Detection

Built-in browser fingerprint randomization and session management.

3. Proxy Rotation

Automatic per-request proxy rotation.

4. Auto-Scaling

Adjusts concurrency based on system resources and target site response.

5. Built-In Storage

Structured datasets, key-value stores, and request queues.

FAQ

Q: How does it compare to Scrapy? A: Crawlee has native browser support, built-in anti-detection, and works in both JS and Python. Scrapy is Python-only and primarily HTTP-based.

🙏

来源与感谢

讨论

登录后参与讨论。
还没有评论,来写第一条吧。

相关资产