[RAG] Handle Different DPR Model File with Pre-trained Model #3688

klshuster · 2021-06-04T15:35:46Z

Patch description
This patch fixes a bug in which one would like to swap the --dpr-model-file for a pre-trained RAG or FiD model. The underlying issue is that the dpr model weights are loaded during model initialization, PRIOR TO loading the pre-trained model weights. This is fine when the pre-trained model is an underlying seq2seq model (BART, T5, BB), but it is not ok when the pre-trained model is a RAG or FiD model.

The solution is to overwrite the retriever weights in the state_dict with the already loaded dpr model weights.

Testing steps
Included CI testing:

$ pytest -k TestLoadDPRModel
===== test session starts =====
platform linux -- Python 3.7.9, pytest-6.2.1, py-1.10.0, pluggy-1.0.0.dev0
rootdir: /private/home/kshuster/ParlAI, configfile: pytest.ini
plugins: hydra-core-1.0.0, requests-mock-1.8.0, regressions-2.1.1, datadir-1.3.1
collected 96 items / 95 deselected / 1 selected

test_rag.py .                                                                                                                                                                      [100%]

=====slowest 10 durations =====
41.01s call     tests/nightly/gpu/test_rag.py::TestLoadDPRModel::test_load_dpr

(2 durations < 0.005s hidden.  Use -vv to show these durations.)
=====1 passed, 95 deselected, 4 warnings in 43.30s =====

mojtaba-komeili · 2021-06-04T16:01:28Z

tests/nightly/gpu/test_rag.py

+            dpr_model='bert_from_parlai_rag',
+            pretrained_path=RAG_SEQUENCE_ZOO_MODEL,
+        )
+        assert not torch.allclose(


nit: why not using the unittest functions like assertIsNone and assertTrue etc.?

i've spoken with @stephenroller about this and he's told me that it's all the same to pytest

mojtaba-komeili · 2021-06-04T16:02:23Z

parlai/agents/rag/rag.py

+                logging.warning(
+                    f"Overriding DPR Model with {modelzoo_path(opt['datapath'], opt['dpr_model_file'])}"
+                )
+        except FileNotFoundError:


should there be warning here?

hmm no, since it's supposed to be silent to the user

mojtaba-komeili · 2021-06-04T16:05:08Z

parlai/agents/rag/rag.py

+        try:
+            init_model, _ = self._get_init_model(opt, None)
+            init_model_opt = Opt.load(f'{init_model}.opt')
+            override_dpr = modelzoo_path(


couldn't it just compare opt['dpr_model_file'] != init_model_opt['dpr_model_file'], since the rest seems to be the same?

unfortunately I don't think so -> modelzoo_path only modifies the path if it starts with zoo:, otherwise it's a no-op. Someone could, theoretically, pass in the full path to their dpr model file, even if it is the zoo path, so we need to make sure that the reference isn't the same

mojtaba-komeili

Thanks for fixing it so quickly.

moyapchen · 2021-06-04T16:07:42Z

parlai/agents/rag/rag.py

+        by the state loading.
+
+        NOTE: If `--model-file M` was trained with `--dpr-model-file D`, and
+        `--dpr-model-file D` is specified *after training* (i.e., in eval/interactive),


Thought here - should there be an "--override-dpr-model-file-for-eval"?

(This was the initial case that triggered everything so...)

i think this case i mention here was not what triggered things, but with the new implementation it's handled smoothly

moyapchen · 2021-06-04T16:14:44Z

tests/nightly/gpu/test_rag.py

+    See RagAgent._should_override_dpr_model_weights for important note
+    regarding specifying the *same* dpr model file as was used to train
+    the model.
+    """


Nit: For clarity's sake, might be nice to be explicit about the 3 cases that show up and expected behavior

A -> Init DPR Model for M B -> DPR Model within M after training M C -> New DPR Model you're using to override

added a comment!

klshuster · 2021-06-04T16:58:38Z

i actually think there is a simpler way to do this, implementing now

klshuster · 2021-06-04T17:42:50Z

Re-requesting review as this can be handled better

moyapchen

LGTM. Thanks!

klshuster added 2 commits June 4, 2021 11:24

handle dpr model file correctly

c637397

update comments

0ec7ea3

klshuster requested review from mojtaba-komeili and moyapchen June 4, 2021 15:35

facebook-github-bot added the CLA Signed label Jun 4, 2021

mojtaba-komeili reviewed Jun 4, 2021

View reviewed changes

mojtaba-komeili approved these changes Jun 4, 2021

View reviewed changes

moyapchen reviewed Jun 4, 2021

View reviewed changes

different way of doing this

5fe3c50

klshuster requested review from moyapchen and mojtaba-komeili June 4, 2021 17:42

fix import

bb40dd8

moyapchen approved these changes Jun 7, 2021

View reviewed changes

klshuster merged commit 64d3859 into master Jun 8, 2021

klshuster deleted the fix_dpr_mf branch June 8, 2021 18:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RAG] Handle Different DPR Model File with Pre-trained Model #3688

[RAG] Handle Different DPR Model File with Pre-trained Model #3688

klshuster commented Jun 4, 2021

mojtaba-komeili Jun 4, 2021

klshuster Jun 4, 2021

mojtaba-komeili Jun 4, 2021

klshuster Jun 4, 2021

mojtaba-komeili Jun 4, 2021

klshuster Jun 4, 2021

mojtaba-komeili left a comment

moyapchen Jun 4, 2021

klshuster Jun 4, 2021

moyapchen Jun 4, 2021

klshuster Jun 4, 2021

klshuster commented Jun 4, 2021

klshuster commented Jun 4, 2021

moyapchen left a comment

[RAG] Handle Different DPR Model File with Pre-trained Model #3688

[RAG] Handle Different DPR Model File with Pre-trained Model #3688

Conversation

klshuster commented Jun 4, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mojtaba-komeili left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

klshuster commented Jun 4, 2021

klshuster commented Jun 4, 2021

moyapchen left a comment

Choose a reason for hiding this comment