Skip to content

Conversation

@foxt451
Copy link
Collaborator

@foxt451 foxt451 commented Nov 14, 2025

This was just copied from cheerio-scraper, turned into an http-scraper and then specialized into a sitemap scraper: pageFunction is removed, as well as most of the inputs, at least from the input schema. Tested locally - for now will just push dataset items with a url and status code for each page.

Closes apify/apify-sdk-js#486. For now doesn't handle missing sitemaps and expects explicit URLs to them

@foxt451 foxt451 marked this pull request as draft November 27, 2025 13:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Actor to check web page availability

1 participant