Ak/fix train feeding #5

AlexeyKruglov · 2017-11-19T17:49:30Z

Fix index out of range error.
About 8x (for me) training speed-up by feeding whole sample sequence through CuDNN, not char-by-char.
Inference: GPU memory requirement not growing with generated seq length anymore -- by dropping Variable history in inference.

... by feeding the whole sequence to CuDNN RNN, as opposed to character-by-character.

codeman38 · 2018-01-10T18:54:07Z

For what it's worth, using the pre-built pytorch 1.3 on macOS (where CUDA is not available), on an early-2015 MacBook Pro, this patch doesn't seem to improve performance on CPU-- instead, it makes training more than twice as slow. [Edited to add: Tested with CPU on Linux, and got similar results there as well.]

After 100 iterations using the default hyperparameters:
master: ~1.09s/iteration
fix-train-feeding: ~3.71s/iteration

Not sure whether this is an issue with this specific CPU, an inefficiency in PyTorch itself, or something that could be further improved in the char-rnn code, but it's definitely worth pointing out either way.

AlexeyKruglov added 3 commits November 19, 2017 17:46

Speed up training

80a79a3

... by feeding the whole sequence to CuDNN RNN, as opposed to character-by-character.

Fix bug with index out of range

3e0a865

Save GPU memory by not storing history during inference

601f1ed

codeman38 mentioned this pull request Jan 10, 2018

Fix off-by-one error in train.py #7

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ak/fix train feeding #5

Ak/fix train feeding #5

AlexeyKruglov commented Nov 19, 2017

codeman38 commented Jan 10, 2018 •

edited

Loading

Ak/fix train feeding #5

Are you sure you want to change the base?

Ak/fix train feeding #5

Conversation

AlexeyKruglov commented Nov 19, 2017

codeman38 commented Jan 10, 2018 • edited Loading

codeman38 commented Jan 10, 2018 •

edited

Loading