I want to scan a whole site, but only get ~200 results #84

mgifford · 2023-05-10T13:16:27Z

Details

I've got a few big sites I'd like to scan, but it keeps stopping about 200 or so pages in.

Is there a way to override that? It's still a good measure, but would be useful to be able to scan the whole site if needed.

Maybe I'm just missing something in the config.

tuminzee · 2023-05-10T15:04:38Z

can you try to explain this with example? I am not able to understand
unlighthouse is made so that we can generate reports on all routes

mgifford · 2023-05-10T15:34:48Z

So I run this:

npx unlighthouse --site https://example.com/eng/index.html

All churns on as you'd expect until I get:

✔ Completed runLighthouseTask for /eng/declaration/wwl-cna/c15/index.html. [Score: 0.94 Samples: 1 100% complete]
✔ Unlighthouse has finished scanning https://example.com/eng/index.html/: 200 routes in 431s.

That's for basically any site I craw.

So I get 200 or so scans but the site is much bigger.

Relates to #84

harlan-zw · 2023-05-10T16:33:58Z

There is config for the maximum number of routes to scan scanner.routeRules that is set to 200 by default.

This was implemented as the stability of the worker and the UI starts degrading around here and it's quite easy for a site scan to end up queueing thousands of routes.

I've pushed up a warning that will be triggered when you hit the limit to give better visibility, it will be available in v0.6.0 which will be released soon.

You can read more about how the large sites are handled on this page.

mgifford · 2024-07-05T14:06:31Z

So @harlan-zw this should work to scan 500 URLs vs the default 200? It would also take 2 samples rather than just one.

Assuming this is in the directory where you execute the script: unlighthouse.config.ts

export default {
scanner: {
// run lighthouse for each URL 2 times
samples: 2,
// increase the maximum number of routes - https://unlighthouse.dev/api/config#scannermaxroutes
maxRoutes: 500,
},
debug: true,
}

harlan-zw added a commit that referenced this issue May 10, 2023

fix(core): warn when scanner.maxRoutes is exceeded

ad83ff6

Relates to #84

harlan-zw added the workaround-available label May 14, 2023

harlan-zw closed this as not planned Won't fix, can't repro, duplicate, stale Mar 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I want to scan a whole site, but only get ~200 results #84

I want to scan a whole site, but only get ~200 results #84

mgifford commented May 10, 2023

tuminzee commented May 10, 2023

mgifford commented May 10, 2023 •

edited

Loading

harlan-zw commented May 10, 2023 •

edited

Loading

mgifford commented Jul 5, 2024 •

edited

Loading

I want to scan a whole site, but only get ~200 results #84

I want to scan a whole site, but only get ~200 results #84

Comments

mgifford commented May 10, 2023

Details

tuminzee commented May 10, 2023

mgifford commented May 10, 2023 • edited Loading

harlan-zw commented May 10, 2023 • edited Loading

mgifford commented Jul 5, 2024 • edited Loading

mgifford commented May 10, 2023 •

edited

Loading

harlan-zw commented May 10, 2023 •

edited

Loading

mgifford commented Jul 5, 2024 •

edited

Loading