Skip to content

Releases: BrowserCash/teracrawl

Release v1.0.0

03 Dec 02:09

Choose a tag to compare

Description

Teracrawl v1.0.0 introduces a high-performance, AI-first web scraping API that converts any website into clean, LLM-ready Markdown. Achieving #1 coverage (84.2%) on the scrape-evals benchmark, it combines smart two-phase crawling with robust session management to handle complex JavaScript-heavy sites at scale.

Key Features

  • LLM-Ready Output: Automatically extracts main content and converts HTML to semantic Markdown, stripping away ads, navigation, and clutter.
  • Smart Two-Phase Crawling:
    • Fast Mode: Rapidly scrapes static content using resource blocking and reused contexts.
    • Dynamic Mode: Intelligently falls back to full rendering for complex SPAs and hydration-heavy sites.
  • Search + Scrape Pipeline: New /crawl endpoint queries Google (via browser-serp) and scrapes the top N results in parallel, providing a deep search capability for AI agents.
  • High Concurrency: Built on @browsercash/pool to manage multiple browser sessions and tabs simultaneously, ensuring high throughput.
  • Top-Tier Reliability: Ranked #1 in success rate and content quality on the scrape-evals benchmark against 14 major providers.
  • Safety First: Includes built-in blocking for trackers, analytics, and ads to improve speed and privacy.