ScriptsMay 24, 2026·2 min read

Agent-Reach — Internet Access Toolkit for AI Agents

A CLI tool that gives AI agents the ability to read and search Twitter, Reddit, YouTube, GitHub, Bilibili, and more with zero API fees.

Agent ready

Review-first install path

This asset needs a review step. The copied prompt tells the agent to dry-run, show the writes, then proceed only after confirmation.

Needs Confirmation · 66/100Policy: confirm
Agent surface
Any MCP/CLI agent
Kind
Skill
Install
Single
Trust
Trust: Established
Entrypoint
Agent-Reach Overview
Review-first command
npx -y tokrepo@latest install 9d9051f3-5727-11f1-9bc6-00163e2b0d79 --target codex

Dry-run first, confirm the writes, then run this command.

Introduction

Agent-Reach gives AI agents structured access to public internet content across major platforms. Instead of relying on expensive search APIs, it scrapes and parses content from Twitter, Reddit, YouTube, GitHub, and other platforms into LLM-friendly formats at no API cost.

What Agent-Reach Does

  • Searches and reads content from Twitter, Reddit, YouTube, and GitHub
  • Supports Bilibili and XiaoHongShu for Chinese-language content
  • Converts web content into structured, LLM-ready text
  • Integrates as an MCP server for coding agents
  • Works as a standalone CLI or Python library

Architecture Overview

Agent-Reach uses a modular scraper architecture with platform-specific adapters. Each adapter handles authentication, rate limiting, and content extraction for its target platform. Results are normalized into a common schema with title, content, metadata, and source URL, then formatted for LLM consumption with appropriate context windowing.

Self-Hosting & Configuration

  • Install via pip with optional browser automation dependencies
  • Configure platform-specific settings in a YAML config file
  • Set up proxy rotation for high-volume scraping
  • Deploy as an MCP server for integration with coding agents
  • Run headless with Playwright for JavaScript-heavy platforms

Key Features

  • Zero API fees: direct content extraction without paid API subscriptions
  • Multi-platform support covering major social and developer platforms
  • LLM-optimized output formatting with metadata preservation
  • MCP server mode for seamless agent integration
  • Built-in rate limiting and retry logic to avoid blocks

Comparison with Similar Tools

  • Firecrawl — general web scraping API; Agent-Reach is platform-specialized and free
  • Crawl4AI — focuses on general crawling; Agent-Reach targets social platforms
  • Jina Reader — URL-to-text conversion; Agent-Reach adds search and multi-platform support
  • ScrapeGraphAI — AI-driven scraping; Agent-Reach provides structured platform adapters

FAQ

Q: Is scraping these platforms legal? A: Agent-Reach accesses publicly available content. Users are responsible for complying with each platform's terms of service and local regulations.

Q: Do I need API keys for the platforms? A: No. Agent-Reach extracts public content directly without requiring paid API access.

Q: Can I use this with Claude Code or other agents? A: Yes. It includes an MCP server mode that integrates directly with MCP-compatible coding agents.

Q: How does it handle rate limiting? A: Built-in adaptive rate limiting and retry logic with configurable delays per platform.

Sources

Discussion

Sign in to join the discussion.
No comments yet. Be the first to share your thoughts.

Related Assets