-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Received HTTP code 403 when trying to fetch a site using Cloudflare #316
Comments
Same error trying to add Coinbase documents with following declaration {
"name": "Coinbase",
"documents": {
"Privacy Policy": {
"fetch": "https://www.coinbase.com/legal/privacy",
"select": [".ComposePageLayout__ContentWrapper-sc-109zw5h-2"],
"remove": [".SidebarNav__NavigationLinksList-sc-1c3jy97-1"]
},
"Trackers Policy": {
"fetch": "https://www.coinbase.com/legal/cookie",
"select": [".ComposePageLayout__ContentWrapper-sc-109zw5h-2"],
"remove": [".SidebarNav__NavigationLinksList-sc-1c3jy97-1"]
},
"Terms of Service": {
"fetch": "https://www.coinbase.com/legal/user_agreement/ireland_europe",
"select": [".ComposePageLayout__ContentWrapper-sc-109zw5h-2"],
"remove": [".SidebarNav__NavigationLinksList-sc-1c3jy97-1"]
}
}
}
|
This is mainly because those sites are using a service like cloudflare to check their traffic Our attempt to scrape is evaluated as a bot and thus is blocked by a 403. I tried the following all these with no success
So I suggest for now that you use "executeClientScripts" In the meantime, I've send a ticket request to Cloudflare through my personnal premium account. Let's see what they say
|
And here is the response of cloudflare
Online application: https://docs.google.com/forms/d/e/1FAIpQLSdqYNuULEypMnp4i5pROSc-uP6x65Xub9svD27mb8JChA_-XA/viewform @trujilloelsa @clementbiron @MattiSG I believe we should apply, what about you ? |
Yes ✔️ |
As we have not had any answer in 40 days, I created a new topic on Cloudflare community https://community.cloudflare.com/t/cloudflare-bot-verification-submitted-but-no-answer/320260 |
I'm not sure this is a Cloudflare protection but running
with the following declaration
|
Same for
|
Same for this declaration OpenTermsArchive/france-declarations@a0e6b46 |
I'm not sure it's about Cloudflare protection, but the following declarations return a 403 error:
|
We do not actively work on #166 at the moment. We will reopen it when we prioritise this work again. In the meantime, feel free to add any additional relevant information specific to Cloudflare to this issue. |
Trying to add Roblox service and documents with the following declaration
I get this node error messages
Error: The document cannot be accessed or its content can not be selected: Received HTTP code 403 when trying to fetch 'https://en.help.roblox.com/hc/en-us/articles/115004630823-Roblox-Privacy-and-Cookie-Policy-'
Error: The document cannot be accessed or its content can not be selected: Received HTTP code 403 when trying to fetch 'https://en.help.roblox.com/hc/en-us/articles/115004647846-Roblox-Terms-of-Use'
Error: The document cannot be accessed or its content can not be selected: Received HTTP code 403 when trying to fetch 'https://en.help.roblox.com/hc/en-us/articles/203313410-Roblox-Community-Rules'
The text was updated successfully, but these errors were encountered: