Releases: BrowserCash/teracrawl
Releases · BrowserCash/teracrawl
Release v1.0.0
Description
Teracrawl v1.0.0 introduces a high-performance, AI-first web scraping API that converts any website into clean, LLM-ready Markdown. Achieving #1 coverage (84.2%) on the scrape-evals benchmark, it combines smart two-phase crawling with robust session management to handle complex JavaScript-heavy sites at scale.
Key Features
- LLM-Ready Output: Automatically extracts main content and converts HTML to semantic Markdown, stripping away ads, navigation, and clutter.
- Smart Two-Phase Crawling:
- Fast Mode: Rapidly scrapes static content using resource blocking and reused contexts.
- Dynamic Mode: Intelligently falls back to full rendering for complex SPAs and hydration-heavy sites.
- Search + Scrape Pipeline: New /crawl endpoint queries Google (via browser-serp) and scrapes the top N results in parallel, providing a deep search capability for AI agents.
- High Concurrency: Built on @browsercash/pool to manage multiple browser sessions and tabs simultaneously, ensuring high throughput.
- Top-Tier Reliability: Ranked #1 in success rate and content quality on the scrape-evals benchmark against 14 major providers.
- Safety First: Includes built-in blocking for trackers, analytics, and ads to improve speed and privacy.