-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New Legifrance (2020) layout #11
base: master
Are you sure you want to change the base?
Conversation
I’ve not been able to find them on Legifrance
NB: inclut les codes abrogés ?
I have also not met any anti-scraping restrictions. Could simply be due to a low rate of requests on my behalf. |
Also rename new_page_url to page_url
Might not be worth the effort
Les builds python 3.6 et 2.7 (sic) et pypy3 passent sauf Les autres builds ont des problèmes de dépendances (PyYAML refuse python 3.4 par exemple, url-normalize refuse python 3.5, ey pypy n’arrive pas à compiler cryptography) |
La branche master de Cimbali/legipy utilise selenium comme transport adapter pour éviter les restrictions anti-scraping, et un cache pour éviter de trop surcharger les serveurs legifrance (sans changer l’interface requests). Additionnellement,
|
It’s probably not perfect because I couldn’t test it exhaustively, but all commands for me now.
Also adds a caching CLI option.