You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When i set: crawler.queue({skipDuplicates:true}), (which is supposed to be true by default.)
According to the doc: " The request is considered to be the same if URL, userAgent, device, and extraHeaders are strictly the same.",
is there a way to deduplicate based only on the URL?
I only need to export URLs of a certain domain.
The text was updated successfully, but these errors were encountered:
When i set: crawler.queue({skipDuplicates:true}), (which is supposed to be true by default.)
According to the doc: " The request is considered to be the same if URL, userAgent, device, and extraHeaders are strictly the same.",
is there a way to deduplicate based only on the URL?
I only need to export URLs of a certain domain.
The text was updated successfully, but these errors were encountered: