fix beam search script #175

szha · 2018-06-27T05:20:28Z

Description

The following run was broken.
python beam_search_generator.py --bos I think --lm standard_lstm_lm_200

This PR fixes the issue by adding state_info to language model while preserving existing behaviors.

Checklist

Essentials

Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented

Changes

add state_info to models
use state_info to decide state shape in beam search

szha · 2018-06-27T05:20:54Z

@sxjscience @hhexiy

mli · 2018-06-27T16:09:12Z

Job PR-175/6 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-175/6/index.html

szhengac · 2018-06-27T17:53:54Z

gluonnlp/model/beam_search.py

@@ -92,30 +92,56 @@ def _expand_to_beam_size(data, beam_size, batch_size):
        Each NDArray should have shape (batch_size * beam_size, ...)
    """
    if isinstance(data, list):
-        return [_expand_to_beam_size(ele, beam_size, batch_size) for ele in data]
+        assert not state_info or isinstance(state_info, list), \


I think we can move this part outside if else by using
assert not state_info or isinstance(state_info, type(data))

szhengac · 2018-06-27T17:57:00Z

gluonnlp/model/beam_search.py

@@ -124,14 +150,37 @@ def _choose_states(F, states, indices):
        Each NDArray/Symbol should have shape (N, ...).
    """
    if isinstance(states, list):
-        return [_choose_states(F, ele, indices) for ele in states]
+        assert not state_info or isinstance(state_info, list), \


Same as previous comment

szhengac · 2018-06-27T17:58:53Z

gluonnlp/model/beam_search.py

    else:
        raise NotImplementedError


-def _choose_states(F, states, indices):
+def _choose_states(F, states, state_info, indices):
    """

    Parameters
    ----------
    F : ndarray or symbol
    states : Object contains NDArrays/Symbols
        Each NDArray/Symbol should have shape (N, ...).


Fix docstring. Now N may not be in the first dim.

szhengac · 2018-06-27T18:00:12Z

gluonnlp/model/beam_search.py

+        states = F.take(states, indices)
+        if batch_axis != 0:
+            states = states.swapaxes(0, batch_axis)
+        return states


So the returned states always have batch in the first dim even it is not the case for the input? Would it cause some inconsistency?

Because of the two swaps, the batch dimension should be where it used to be.

Yes, you are right.

We can directly use take(axis=batch_axis) once apache/mxnet#11326 is merged.

szhengac · 2018-06-27T18:04:23Z

Also, test case with different layouts is needed.

sxjscience · 2018-06-28T11:56:16Z

gluonnlp/model/beam_search.py

-                   .reshape((batch_size * beam_size,) + data.shape[1:])
+        if not state_info:
+            state_info = {'__layout__': 'NC'}
+        batch_axis = state_info['__layout__'].find('N')


I think the following will be better:

if not state_info: batch_axis = 0 else: batch_axis = state_info['__layout__'].find('N')

sxjscience · 2018-06-29T03:43:20Z

tests/unittest/test_beam_search.py

-            return mx.nd.stack(*updated_states, axis=0)
+            if not state_info:
+                state_info = {'__layout__': 'NC'}
+            batch_axis = state_info['__layout__'].find('N')


Same here

if not state_info: batch_axis = 0 else: ...

Since it's in the test, the change is not necessary.

mli · 2018-06-30T02:36:43Z

Job PR-175/13 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-175/13/index.html

mli · 2018-06-30T18:08:35Z

Job PR-175/18 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-175/18/index.html

* fix beam search script * add tests * address comment * fix * update test

szha requested a review from sxjscience June 27, 2018 05:20

szha force-pushed the fix_beam branch 4 times, most recently from 76f2fdb to a5034a8 Compare June 27, 2018 15:53

szhengac reviewed Jun 27, 2018

View reviewed changes

szha force-pushed the fix_beam branch 2 times, most recently from e627b17 to 246f1aa Compare June 28, 2018 00:51

sxjscience reviewed Jun 28, 2018

View reviewed changes

sxjscience reviewed Jun 29, 2018

View reviewed changes

szha force-pushed the fix_beam branch from 6beb82e to 50c733c Compare June 30, 2018 02:22

szha force-pushed the fix_beam branch 2 times, most recently from c6fcd77 to d79403b Compare June 30, 2018 17:49

szha added 4 commits June 30, 2018 10:50

fix beam search script

ae0edd0

add tests

d33df29

address comment

76b228d

fix

87f5bc5

szha force-pushed the fix_beam branch from d79403b to ec04c5a Compare June 30, 2018 17:53

update test

ec04c5a

szhengac approved these changes Jun 30, 2018

View reviewed changes

szha merged commit b926873 into dmlc:master Jun 30, 2018

szha deleted the fix_beam branch June 30, 2018 18:37

leezu pushed a commit to leezu/gluon-nlp that referenced this pull request Jul 11, 2018

fix beam search script (dmlc#175)

de32fd5

* fix beam search script * add tests * address comment * fix * update test

paperplanet pushed a commit to paperplanet/gluon-nlp that referenced this pull request Jun 9, 2019

fix beam search script (dmlc#175)

97a82ad

* fix beam search script * add tests * address comment * fix * update test

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix beam search script #175

fix beam search script #175

szha commented Jun 27, 2018

szha commented Jun 27, 2018

mli commented Jun 27, 2018

szhengac Jun 27, 2018

szhengac Jun 27, 2018

szhengac Jun 27, 2018

szhengac Jun 27, 2018

szha Jun 27, 2018

szhengac Jun 27, 2018

sxjscience Jun 28, 2018

szhengac commented Jun 27, 2018

sxjscience Jun 28, 2018

sxjscience Jun 29, 2018

sxjscience Jun 29, 2018

mli commented Jun 30, 2018

mli commented Jun 30, 2018

fix beam search script #175

fix beam search script #175

Conversation

szha commented Jun 27, 2018

Description

Checklist

Essentials

Changes

szha commented Jun 27, 2018

mli commented Jun 27, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

szhengac commented Jun 27, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mli commented Jun 30, 2018

mli commented Jun 30, 2018