Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

segmentation fault with tlfp-parse-many #122

Open
olivierlacan opened this issue Jan 12, 2019 · 1 comment
Open

segmentation fault with tlfp-parse-many #122

olivierlacan opened this issue Jan 12, 2019 · 1 comment
Labels

Comments

@olivierlacan
Copy link

With pypy3.5-6.0.0 the following command segfaults after a few minutes:

senapy-cli doslegs_urls --min-year=2019 | tlfp-parse-many data/

Last few lines before the segfault:

https://www.senat.fr/dossier-legislatif/pjl16-537.html
 -= DOSLEG URL: https://www.senat.fr/dossier-legislatif/pjl16-537.html =-
  [] download SENAT version
  [] parse SENAT version
  [] download AN version
  [] parse AN version
        title: projet de loi ordonnances relatives à la modification des règles applicables à l'évaluation environnementale des projets et portant réforme des procédures destinées à assurer l'information du public à l'élaboration de décisions ayant une incidence sur l'environnement
  [] parse the texts
** parsing texts
    ^ text:  http://www.assemblee-nationale.fr/15/projets/pl0011.asp
    ^ text:  http://www.assemblee-nationale.fr/15/ta-commission/r0091-a0.asp
    ^ text:  http://www.assemblee-nationale.fr/15/ta/ta0004.asp
    ^ text:  https://www.senat.fr/leg/pjl16-666.html
    ^ text:  https://www.senat.fr/leg/pjl17-009.html
    ^ text:  https://www.senat.fr/leg/tas17-001.html
    ^ text:  https://www.senat.fr/leg/pjl17-199.html
Segmentation fault: 11

Since it's parsing HTML this could very well be a lower-level lxml or Python bug but I thought you'd want to know.

@olivierlacan olivierlacan changed the title segmentation fault segmentation fault with tlfp-parse-many Jan 12, 2019
@mdamien
Copy link
Member

mdamien commented Jan 12, 2019

Sorry but I can't reproduce it on my Ubuntu machine, looks like it's a MacOS-specific segfault, maybe you can also try without PyPy. Also I'm transfering the issue to the main repo since we don't yet where the error is coming from.

@mdamien mdamien transferred this issue from regardscitoyens/lawfactory_utils Jan 12, 2019
@mdamien mdamien added the bug label Jan 12, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants