[RAG] Fix Extra Positions Bug #4389

klshuster · 2022-03-02T00:18:55Z

Patch description
--n-extra-positions can be used to expand the total number of positions used when encoding documents + input in pre-trained transformers. If we set this >0, then we are supposed to fill up all of end positions with the doc tokens, leaving the beginning positions for the input text.

In the current RAG implementation, if we specify --n-extra-positions M < --n-positions N, we encounter an issue where the documents were not properly truncated. This is found in the concat_docs_and_input function, where self.expanded_input_truncate is used to determine the maximum length of the documents. This is an overloaded attribute, hence why the bug was a bit subtle.

This bug was not previously caught, as in all of my experiments, --n-extra-positions > --n-positions.

Testing steps
Added a test to confirm that concat_docs_and_input respects the n_positions available in the model.

mojtaba-komeili

LGTM

mojtaba-komeili · 2022-03-02T15:39:58Z

nit: a general comment on self.n_extra_positions: In RagModel constructor we have assert opt['n_extra_positions'] >= 0. Yet there are line in code that we check for self.n_extra_positions <= 0. Can we remove the < check for clarity?

jxmsML

LGTM! thanks for fixing this subtle bug

klshuster · 2022-03-03T17:45:28Z

The failing test is fixed in #4395

fix extra pos bug

297f6db

klshuster requested review from mojtaba-komeili and jxmsML March 2, 2022 00:18

facebook-github-bot added the CLA Signed label Mar 2, 2022

klshuster mentioned this pull request Mar 2, 2022

Blender-90M FiD Rag model training #4388

Closed

mojtaba-komeili approved these changes Mar 2, 2022

View reviewed changes

Merge branch 'main' into fix_n_extra_positions_bug

8a7940f

jxmsML approved these changes Mar 2, 2022

View reviewed changes

klshuster merged commit 2e3f5c7 into main Mar 3, 2022

klshuster deleted the fix_n_extra_positions_bug branch March 3, 2022 17:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RAG] Fix Extra Positions Bug #4389

[RAG] Fix Extra Positions Bug #4389

klshuster commented Mar 2, 2022

mojtaba-komeili left a comment

mojtaba-komeili commented Mar 2, 2022 •

edited

Loading

jxmsML left a comment •

edited

Loading

klshuster commented Mar 3, 2022

[RAG] Fix Extra Positions Bug #4389

[RAG] Fix Extra Positions Bug #4389

Conversation

klshuster commented Mar 2, 2022

mojtaba-komeili left a comment

Choose a reason for hiding this comment

mojtaba-komeili commented Mar 2, 2022 • edited Loading

jxmsML left a comment • edited Loading

Choose a reason for hiding this comment

klshuster commented Mar 3, 2022

mojtaba-komeili commented Mar 2, 2022 •

edited

Loading

jxmsML left a comment •

edited

Loading