-
Notifications
You must be signed in to change notification settings - Fork 52
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
instagram scraping for likes broke #840
Comments
shit. this graphql query url requires a login cookie. it 403s if you don't include one. this may spell the death of backfeeding instagram likes. shit. |
example from the one other project i've found that implemented this: ping/instagram_private_api@78294a8 (thank you @ping!) |
current tentative plan: do all scraping with a login cookie from a throwaway account. IG does already rate limit our scraping, but i believe that's automatic, not us specifically. i expect bridgy is still small enough (5k users, <1k instagram) that IG employees haven't noticed us in particular yet. |
it's alive. god help me, it's alive. now we wait to see how long it takes to get blocked. 😕 |
first reported by @thorkon yesterday. (thank you!) started sometime in the last few weeks. looks like instagram likes now need an extra HTTP fetch to scrape.
eg if you fetch https://www.instagram.com/p/BmbU2qSFXrR/ , the embedded JSON media object has:
that
edges
field used to have the individual likes, but now it's empty. if you click on the 9 likes link on the instagram UI, it GETs this URL:which returns:
i can work with that.
also note that the daily
instagram_live_test.py
run didn't catch this because it wasn't checking for likes. sigh. i'll fix that.The text was updated successfully, but these errors were encountered: