Fix accelerator issue #16

SachiraKuruppu · 2023-09-13T13:36:04Z

Provide the complete model (rag_model) to the accelerator instead of giving the retriever and the generator separately.
Change the forward path to use the retriever or the generator based on the task string.

This declutters the training code.

SachiraKuruppu · 2023-09-13T13:50:25Z

This needs a bit more work:

Add arguments to AutoModelForRagE2E to load the model for inference with peft layers.
Fixdalm/eval/eval_with_hnsw.py use of rag_model.forward(...) function.

I'll have a look tomorrow, or feel free to fix these.

shamanez

looking good

SachiraKuruppu added 6 commits September 13, 2023 12:10

Extract training script dataset process function to utils

a3d3321

Rename train utils module to train_utils

88e9f00

Move models to separate directory.

3856e06

Fix linter issues

587e2d6

Refactor retriever only training script

a194252

Send the whole model to accelerator

d927d48

shamanez approved these changes Sep 13, 2023

View reviewed changes

shamanez merged commit 86d598a into main Sep 13, 2023
1 check failed

Provide feedback