rag e2e first commit #2

shamanez · 2023-08-31T09:35:56Z

No description provided.

shamanez · 2023-08-31T09:39:30Z

rag_e2e/base_model.py

+from transformers import AutoModel
+
+
+class AutoModelForSentenceEmbedding(torch.nn.Module):


I would add the causal language model to this class. Obviously we can change the class name.

Reason: In future, if we are to use accelerate with deep speed, it won't work with two models for now.

shamanez · 2023-08-31T09:40:43Z

rag_e2e/base_model.py

+        self.normalize = normalize
+        self.tokenizer = tokenizer
+
+    def forward(self, **kwargs):


If you add a casual language model to the above init function. You can easily add another parameter for the forward position and get the output from either the retriever or the generator.

like model_type

interesting idea

shamanez · 2023-08-31T09:41:45Z

rag_e2e/trl/__init__.py

+
+if is_diffusers_available():
+    from .models import (
+        DDPOPipelineOutput,


I can change these naming later, so don't worry.

shamanez · 2023-08-31T09:46:23Z

rag_e2e/trl/trainer/rage2e_trainer.py

+            logprobs_logits, doc_logprobs, query_token_length
+        )
+
+        loss = get_nll(marginalized_log_probs, input_tensors[:, 1:])


let's get the mean loss here.

batch_size * 1

rag_e2e/e2e_peft_lora_constrastive_learning.py

shamanez · 2023-08-31T09:55:40Z

rag_e2e/e2e_peft_lora_constrastive_learning.py

+
+    # Prepare everything with our `accelerator`.
+    # see https://github.com/huggingface/accelerate/issues/253#issuecomment-1253231210
+    r_model, c_model = accelerator.prepare(r_model, c_model)


rag_e2e/e2e_peft_lora_constrastive_learning.py

…usal in preprocessing

shamanez

good job. Everything seems great/

shamanez

correct

rag e2e first commit

644e150

shamanez commented Aug 31, 2023

View reviewed changes

Ben-Epstein added 2 commits August 31, 2023 17:18

pre-calculate query token length, combine query+passage+answer for ca…

f79ebb3

…usal in preprocessing

send the right loss to accelerate

6e9d55d

shamanez commented Aug 31, 2023

View reviewed changes

shamanez merged commit 9be379f into main Sep 1, 2023

shamanez deleted the feat/e2e-rag branch September 1, 2023 02:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rag e2e first commit #2

rag e2e first commit #2

shamanez commented Aug 31, 2023

shamanez Aug 31, 2023

shamanez Aug 31, 2023

Ben-Epstein Aug 31, 2023

shamanez Aug 31, 2023

shamanez Aug 31, 2023

shamanez Aug 31, 2023

shamanez left a comment

shamanez left a comment

		from transformers import AutoModel


		class AutoModelForSentenceEmbedding(torch.nn.Module):

rag e2e first commit #2

rag e2e first commit #2

Conversation

shamanez commented Aug 31, 2023

shamanez Aug 31, 2023

Choose a reason for hiding this comment

shamanez Aug 31, 2023

Choose a reason for hiding this comment

Ben-Epstein Aug 31, 2023

Choose a reason for hiding this comment

shamanez Aug 31, 2023

Choose a reason for hiding this comment

shamanez Aug 31, 2023

Choose a reason for hiding this comment

shamanez Aug 31, 2023

Choose a reason for hiding this comment

shamanez left a comment

Choose a reason for hiding this comment

shamanez left a comment

Choose a reason for hiding this comment