Improve XR-Transformer memory efficiency #128

jiong-zhang · 2022-03-31T19:19:03Z

Issue #, if available:

Description of changes:

Add XMCTextDataset, a dataset for XMC problem that tokenizes input text at batch generation time. This avoids pre-tokenizing the whole training corpus which could lead to OOM when number of instances is big.
Deprecated --save-emb-dir argument in pecos.xmc.xtransformer.train. Embeddings should be obtained through pecos.xmc.xtransformer.encode.
Deprecated --steps-scale argument in pecos.xmc.xtransformer.train. Setting different training steps for each layer should be done through setting custom max_steps in the TransformerMatcher.TrainParams

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

jiong-zhang requested review from rofuyu and OctoberChang March 31, 2022 19:19

jiong-zhang force-pushed the xr-transformer-realtime-tokenizer branch from 1ab9a78 to d6a9d4f Compare April 1, 2022 02:04

Improve XR-Transformer tokenizer memory efficiency

f64878e

jiong-zhang force-pushed the xr-transformer-realtime-tokenizer branch from b2d9761 to f64878e Compare April 1, 2022 02:31

Merge branch 'mainline' into xr-transformer-realtime-tokenizer

f40aa3d

jiong-zhang changed the title ~~Add XR-Transformer realtime tokenizer~~ Improve XR-Transformer memory efficiency Apr 1, 2022

rofuyu approved these changes Apr 1, 2022

View reviewed changes

jiong-zhang merged commit 685a098 into amzn:mainline Apr 1, 2022

jiong-zhang deleted the xr-transformer-realtime-tokenizer branch April 14, 2022 18:02

Provide feedback