added initial evals #8

shamanez · 2023-09-10T02:23:56Z

** not completed**

Jacobsolawetz · 2023-09-10T03:06:39Z

😍

Ben-Epstein · 2023-09-11T16:15:56Z

RAG_e2e_in_batch_negs/base_model.py


    def forward(self, task, model, input_ids, attention_mask):
-
        if task == "retrieval":


Ben-Epstein · 2023-09-11T16:17:16Z

RAG_e2e_in_batch_negs/test_utils.py

+    # the number of bi-directional links created for every new element during
+    # construction. Reasonable range for M is 2-100. Higher M work better on
+    # datasets with high intrinsic dimensionality and/or high recall,
+    # while low M work better for datasets with low intrinsic dimensionality and/or


how do you measure "intrinsic dimensionality"? I'm curious both qualitatively and quantitatively

Ben-Epstein · 2023-09-11T16:17:59Z

RAG_e2e_in_batch_negs/test_utils.py

+    # low recalls
+
+    # Initializing index - the maximum number of elements should be known beforehand
+    search_index.init_index(max_elements=num_elements, ef_construction=200, M=100)


Let's figure out how to make these values dynamic for a given dataset. Even something as simple as building a heuristic around tf-idf

Ben-Epstein · 2023-09-11T16:19:50Z

RAG_e2e_in_batch_negs/test_utils.py

@@ -0,0 +1,120 @@
+import numpy as np
+import hnswlib


any particular reason we use this over annoy, faiss, or even lancedb?

Is this index out-of-core or is it stored in-memory?

Ben-Epstein · 2023-09-11T16:20:35Z

RAG_e2e_in_batch_negs/test_utils.py

+    k, search_index, query_embeddings, ids_to_cat_dict, threshold=0.7
+):
+    # Controlling the recall by setting ef:
+    search_index.set_ef(100)  # ef should always be > k


let's make k and ef global variables then so we can assure that ef >> k

Ben-Epstein · 2023-09-11T16:23:51Z

RAG_e2e_in_batch_negs/test_with_hnsw.py

+                ).detach().float().cpu()
+
+        start_index = step * args.test_batch_size
+        end_index = start_index + args.test_batch_size if (start_index + args.test_batch_size) < num_passages else num_passages


Suggested change

end_index = start_index + args.test_batch_size if (start_index + args.test_batch_size) < num_passages else num_passages

end_index = min(start_index + args.test_batch_size, num_passaged)

Ben-Epstein · 2023-09-11T16:25:42Z

RAG_e2e_in_batch_negs/test_with_hnsw.py

+        )
+        search_results = get_nearest_neighbours(args.top_k, passage_search_index, query_embeddings, passage_to_id_dict, threshold=0.0)
+
+        retrieved_cats = [item[0] for item in search_results]


little easier to understand

Suggested change

retrieved_cats = [item[0] for item in search_results]

retrieved_cats = [cat for cat, _ in search_results]

Ben-Epstein · 2023-09-11T16:31:13Z

RAG_e2e_in_batch_negs/test_with_hnsw.py

+        query:text , query:embeddings # you need to take both of things in to the account
+        passage:text , query:embeddings


what are these lines doing? What is this syntax?

Ben-Epstein · 2023-09-11T16:32:56Z

RAG_e2e_in_batch_negs/test_with_hnsw.py

+        for seq in sequences:
+            print(f"Result: {seq['generated_text']}")
+
+            # use regex and see whether the answer is the


How do we eval the generation? gpt-4? Check for hallucinations? Bleu/rouge against original answer?

added initial evals

566005b

shamanez requested a review from SachiraKuruppu September 10, 2023 02:24

SachiraKuruppu added 4 commits September 10, 2023 23:53

Fix linter issues

2bcd14c

Fix formatting issues

e0cc839

Process and obtain unique passages

3ecada0

Fix retriever evaluation

bd8f19f

Ben-Epstein reviewed Sep 11, 2023

View reviewed changes

shamanez merged commit bd591e4 into main Sep 12, 2023

shamanez deleted the revisit-evals branch September 12, 2023 10:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added initial evals #8

added initial evals #8

shamanez commented Sep 10, 2023 •

edited

Loading

Jacobsolawetz commented Sep 10, 2023

Ben-Epstein Sep 11, 2023

Ben-Epstein Sep 11, 2023

Ben-Epstein Sep 11, 2023

Ben-Epstein Sep 11, 2023

Ben-Epstein Sep 11, 2023

Ben-Epstein Sep 11, 2023

Ben-Epstein Sep 11, 2023

Ben-Epstein Sep 11, 2023

Ben-Epstein Sep 11, 2023 •

edited

Loading


		def forward(self, task, model, input_ids, attention_mask):

		if task == "retrieval":

	end_index = start_index + args.test_batch_size if (start_index + args.test_batch_size) < num_passages else num_passages
	end_index = min(start_index + args.test_batch_size, num_passaged)

	retrieved_cats = [item[0] for item in search_results]
	retrieved_cats = [cat for cat, _ in search_results]

		query:text , query:embeddings # you need to take both of things in to the account
		passage:text , query:embeddings

added initial evals #8

added initial evals #8

Conversation

shamanez commented Sep 10, 2023 • edited Loading

Jacobsolawetz commented Sep 10, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Ben-Epstein Sep 11, 2023 • edited Loading

Choose a reason for hiding this comment

shamanez commented Sep 10, 2023 •

edited

Loading

Ben-Epstein Sep 11, 2023 •

edited

Loading