Optimising sigmoid function everywhere #990

markroxor · 2016-11-01T08:46:06Z

No description provided.

piskvorky · 2016-11-01T10:14:34Z

gensim/models/word2vec.py

    return sum(lprob)


 def score_cbow_pair(model, word, word2_indices, l1):
    l2a = model.syn1[word.point]  # 2d matrix, codelen x layer1_size
    sgn = (-1.0)**word.code  # ch function, 0-> 1, 1 -> -1
-    lprob = -log(1.0 + exp(-sgn*dot(l1, l2a.T)))
+    lprob = log(expit(sgn * dot(l1, l2a.T)))


Not directly related to this PR, but since we're on the topic, there's a function in numpy for safely adding up logs of exps: numpy.logaddexp(0, x).

piskvorky · 2016-11-01T10:20:35Z

These are all welcome @markroxor , but I wonder why such overflows/underflows happen in the first place.

Making exp(1000) or exp(-1000) "work" is probably just masking the real underlying problem, the root cause. Why should the exponents in word2vec be so huge?

markroxor · 2016-11-01T12:19:08Z

I've never encountered a case when such a warning might occur but it was reported in issue #838 (it depends on the corpus I suppose). AFAIK we are just handling the huge exponents in a better way so it should not cause any problem.

gojomo · 2016-11-01T19:50:18Z

Using a scipy function seems like a good idea. However, note these pure-Python paths are not tested by our automated testing (because the Cythonized routines are available at that point), so such changes need careful testing by other more manual-means. @markroxor, have you run the Word2Vec/Doc2Vec test suites locally with the optimized-code disabled?

piskvorky · 2016-11-02T01:47:43Z

AFAIK we are just handling the huge exponents in a better way so it should not cause any problem.

Well, that's the part I doubt. The fact that we now return 0.0 or inf or -inf correctly, without an overflow/underflow warning, doesn't really qualify as "no problem" IMO. It's possible it just pushes the problem further upstream.

Your changes are nice and welcome @markroxor (and LGTM now); this is a more a note to self and @tmylk , to investigate and fix this more thoroughly.

markroxor · 2016-11-02T08:35:46Z

@gojomo "with the optimized-code disabled". Do you mean that I should test word2vec/doc2vec without incorporating these changes?

gojomo · 2016-11-02T23:30:31Z

@markroxor - the change you've made is in code that's only run if the Cython-optimized versions (word2vec_inner.pyx, doc2vec_inner.pyx) are not available. In a normal installation, or in the Travis-CI tests, those versions are available. So the changed code won't be run unless you do something to disable the optimized code - like changing the load-attempt at the top of doc2vec.py or word2vec.py.

At times in the past to test the pure-Python paths, I've commented-out that load, then run a full test suite locally. It might be nice if the classes allowed a user-switch without editing code. But, without the optimization tests that normally take seconds take many minutes, so as a practical matter the pure-Python paths may never be part of the automated test suite.

markroxor · 2016-11-03T11:52:27Z

@gojomo I executed the tests locally and they passed perfectly. Indeed the execution was very slow.

markroxor added 2 commits November 1, 2016 10:21

optimizing sigmoid function

eaf8bb7

changing all sigmoids

e5b8bfd

piskvorky reviewed Nov 1, 2016

View reviewed changes

used logaddexp

20add70

markroxor added 2 commits November 3, 2016 09:43

testing non-optimised code

7ac7c81

tests passed

096353d

tmylk merged commit dae481e into piskvorky:develop Nov 8, 2016

markroxor deleted the 895 branch December 22, 2017 05:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimising sigmoid function everywhere #990

Optimising sigmoid function everywhere #990

markroxor commented Nov 1, 2016

piskvorky Nov 1, 2016 •

edited

Loading

piskvorky commented Nov 1, 2016 •

edited

Loading

markroxor commented Nov 1, 2016

gojomo commented Nov 1, 2016

piskvorky commented Nov 2, 2016 •

edited

Loading

markroxor commented Nov 2, 2016

gojomo commented Nov 2, 2016

markroxor commented Nov 3, 2016

Optimising sigmoid function everywhere #990

Optimising sigmoid function everywhere #990

Conversation

markroxor commented Nov 1, 2016

piskvorky Nov 1, 2016 • edited Loading

Choose a reason for hiding this comment

piskvorky commented Nov 1, 2016 • edited Loading

markroxor commented Nov 1, 2016

gojomo commented Nov 1, 2016

piskvorky commented Nov 2, 2016 • edited Loading

markroxor commented Nov 2, 2016

gojomo commented Nov 2, 2016

markroxor commented Nov 3, 2016

piskvorky Nov 1, 2016 •

edited

Loading

piskvorky commented Nov 1, 2016 •

edited

Loading

piskvorky commented Nov 2, 2016 •

edited

Loading