Segmentation fault in spaCy 2.0.5 / python 3.5 #1757

william-dowling · 2017-12-21T20:15:39Z

spaCy 2.0.5 is throwing a core dump. No core dump from this code was seen when using spaCy 1.6, 1.7.5, 1.8.2. I have seen the core dump happen whether or not I am running the debugger.

Here is the complete code that causes the core dump:

#!/usr/bin/env python3

import os
import spacy

language = 'en'

print("Loading Language Model for '%s'..." % language)
nlp = spacy.load(language)
print("Language Model for '%s' loaded." % language)


doc = nlp('Inhalers can be used to treat persistent recurrent asthma')

c = doc[0]
p = None
if c.head != c and c.head != p:
    print('OK')

I see that if I replace "!=" with "is not" then the core dump does not happen.

Info about spaCy

Python version: 3.5.2
spaCy version: 2.0.5
Models: en, en_core_web_md
Platform: Darwin-15.6.0-x86_64-i386-64bit

honnibal · 2018-01-15T14:56:18Z

Thanks! Passing None into Cython can sometimes cause problems if not caught.

fucking-signup · 2018-01-25T22:20:39Z

@honnibal I don't think it has solved the issue, sadly.
I am using the same platform (Darwin-17.3.0-x86_64-i386-64bit, macOS basically) and newly added test against segmentation fault is causing a segmentation fault.

Also could be related:
I am getting a segmentation fault when training new models at random. It could happen at any time or not happen at all. The cause is always update method in language.py. And it doesn't matter if the code itself is from examples of how to train a model or from cli/train.py. However, it seems like frequency of segmentation fault increases when there are more training examples (500+).

apierleoni · 2018-01-29T14:12:32Z

same problem here,
spacy 2.0.5, mac os, python 3.6.4
trying to learn new NER entities from 5000 of examples using update on the model en_core_web_lg.
It fails randomly with seg fault usually after the second iteration (sometimes making it to the 8th).
Might be related to this (not so) old issue: #1335

nikeqiang · 2018-02-01T15:22:59Z

I'm experiencing a similar problem training the NER on anything but a very small set of examples. Training on anything over 1000 examples throws the following error. Is this a memory error?

Process finished with exit code 139 (interrupted by signal 11: SIGSEGV)

Info about spaCy
Python version: 3.6.3
spaCy version: 2.0.5
Models: en, en_core_sm
Platform: MacOS

I note that I got the same error when trying to train using each of (a) the Prodigy ner.batch-train recipe and (b) the regular spacy train_ner.py script.

Example Error messages when running prodigy:

line 1: 41665 Segmentation fault: 11 python -m prodigy "$@"

line 1: 49673 Segmentation fault: 11 python -m prodigy "$@"

adidier17 · 2018-02-09T16:47:58Z

I'm also experiencing the same issue when training the English NER model. When training on about 100 examples there were no problems, but with 500+ I also get the error: "Segmentation fault: 11"

Environment

Operating System: OS Sierra 10.12.6
Python Version Used: 3.6.4
spaCy Version Used: 2.0.7
Models: en version 2.0.0

The error occurs on nlp.update after 2 or 3 iterations.

other_pipes = [pipe for pipe in nlp.pipe_names if pipe != 'ner']
with nlp.disable_pipes(*other_pipes): #trains only the ner model
		optimizer = nlp.begin_training()
		for itn in range(n_iter):
			random.shuffle(train)
			losses = {}
			for text, annotations in train:
				nlp.update(
					[text], #batch of texts
					[annotations], #batch of annotations
					drop = dropout, #make it harder to memorize data
					sgd = optimizer, #update weights
					losses = losses)
			print(losses)

lock · 2018-05-07T23:55:17Z

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

ines added the bug Bugs and behaviour differing from documentation label Jan 3, 2018

honnibal closed this as completed in b904d81 Jan 15, 2018

honnibal added a commit that referenced this issue Jan 15, 2018

Add test for #1757: Comparison against None

4b09616

nikeqiang mentioned this issue Feb 12, 2018

Segmentation fault training NER with large number of training examples #1757 #1335 #1969

Closed

lock bot locked as resolved and limited conversation to collaborators May 7, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Segmentation fault in spaCy 2.0.5 / python 3.5 #1757

Segmentation fault in spaCy 2.0.5 / python 3.5 #1757

william-dowling commented Dec 21, 2017 •

edited

Loading

honnibal commented Jan 15, 2018

fucking-signup commented Jan 25, 2018

apierleoni commented Jan 29, 2018

nikeqiang commented Feb 1, 2018 •

edited

Loading

adidier17 commented Feb 9, 2018 •

edited

Loading

lock bot commented May 7, 2018

Segmentation fault in spaCy 2.0.5 / python 3.5 #1757

Segmentation fault in spaCy 2.0.5 / python 3.5 #1757

Comments

william-dowling commented Dec 21, 2017 • edited Loading

Info about spaCy

honnibal commented Jan 15, 2018

fucking-signup commented Jan 25, 2018

apierleoni commented Jan 29, 2018

nikeqiang commented Feb 1, 2018 • edited Loading

adidier17 commented Feb 9, 2018 • edited Loading

Environment

lock bot commented May 7, 2018

william-dowling commented Dec 21, 2017 •

edited

Loading

nikeqiang commented Feb 1, 2018 •

edited

Loading

adidier17 commented Feb 9, 2018 •

edited

Loading