Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to use Splash on my side? #3

Open
amarynets opened this issue Oct 13, 2017 · 4 comments
Open

How to use Splash on my side? #3

amarynets opened this issue Oct 13, 2017 · 4 comments

Comments

@amarynets
Copy link

I want to use my own Splash instance on my server and Crawlera service(I use C10).
I try this example and a response is Website crawl ban its mean that Crawlera doesn't deal with it. But if I just use only Crawlera all works well.
Also, I tried this, just deleted request:set_header("X-Crawlera-UA", "desktop").
The result is the same. Is something change in Crawlera API? Or I should use C50 plan or smt bad in my code?
Thanks!

@duhanyul
Copy link

I am trying to do a similar thing as well and i keep getting response codes 416 i was wondering if you ever resolved the issue?

@apalala
Copy link

apalala commented Jan 18, 2018

@duhanyul 416 may be coming from https://www.distilnetworks.com/

@duhanyul
Copy link

@apalala I made ROBOTSTXT_OBEY = False and i dont get the response code anymore so i guess there is something blocking it

@alexf-a
Copy link

alexf-a commented Dec 20, 2018

Not sure if you're still having this issue, but the C50 plan solved it for me. Some websites have defense mechanisms that the C10 won't get around.

For some reason I now only get this problem when I run javascript from the Lua main method. Not sure if there is a proper way to include Javascript with an execute endpoint and the Lua Crawlera script...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants