You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jul 5, 2019. It is now read-only.
I guess there are two things to check for.
1: User agent and if it matches specific or * is used.
2: Make an array of parts of site to not follow and check each link that the crawler wants to follow against this array
-f --followrobotstxt <yes/no> if you want your fetcher to play nice or not
The text was updated successfully, but these errors were encountered: