Prompt Experimentation and UI updates #36

oindrillac · 2024-02-15T03:00:29Z

No description provided.

Co-authored-by: Aakanksha Duggal <aduggal@redhat.com>

review-notebook-app · 2024-02-15T03:00:34Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

aakankshaduggal

/lgtm
Thanks @oindrillac

hemajv

Added a few minor comments, but overall it looks great 🎉

We can summarize the findings and results from the prompt_experiments.ipynb into our final report/slide deck and share it with the team 😄

hemajv · 2024-02-19T22:20:13Z

app/app.py

-            "**LangChain evaluation on grammar, descriptiveness and helpfulness:**",
-            help="Use Langchain to evaluate on cutsom criteria (this list can be updated based on what we are looking to see from the generated docs"
+            "**LLM based evaluation on logic, correctness and helpfulness:**",
+            help="Use Langchain Criteria based Eval to evaluate on cutsom criteria (this list can be updated based on what we are looking to see from the generated docs). Note this is language mo0del based evaluation and not always a true indication of the quality of the output that is generatged."


small nit fix: Note this is language mo0del.... should be Note this is language model.... and ...output this is generatged should be ....output that is generated.

hemajv · 2024-02-19T22:22:53Z

notebooks/evaluation/results_1.csv

@@ -0,0 +1,3141 @@
+,model,prompt,code_file,part,response,langchain_helpfulness,langchain_correctness,langchain_logical,instruction,total_langchain_score


repo structuring change: We should perhaps move all the CSV files and the eval_df.pkl file into a separate folder: /evaluation/data

hemajv · 2024-02-19T22:43:41Z

notebooks/evaluation/prompt_experiments.ipynb

I couldn't review this notebook on Reviewnb and the file diff was too large to leave comments line by line, so summarizing my comments here:

Include a small description at the beginning of the notebook to explain what the notebook is about

Since the get_response() and append_row_to_dataframe() functions are already defined in the helper_functions.ipynb, maybe you can invoke the function from there rather than defining the function again in this notebook?

Small nit fix: in the Conclusion section, I think point 3 needs to be rephrased from : When he granite model it fails, it fails becuase half.... to When the granite model fails, it fails because half of the time....

hemajv · 2024-02-28T21:36:57Z

As discussed, we will be addressing the comments in a separate PR. Merging this for now.

/lgtm

oindrillac and others added 2 commits February 15, 2024 02:48

updates to app

c8fe3bb

added prompt experiments

3da2b4b

Co-authored-by: Aakanksha Duggal <aduggal@redhat.com>

oindrillac requested review from aakankshaduggal and hemajv February 15, 2024 04:35

aakankshaduggal approved these changes Feb 15, 2024

View reviewed changes

hemajv requested changes Feb 19, 2024

View reviewed changes

hemajv merged commit 2e70c97 into redhat-et:main Feb 28, 2024
0 of 2 checks passed

hemajv mentioned this pull request Feb 28, 2024

Clean-up and structuring of prompt_experiments.ipynb #37

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prompt Experimentation and UI updates #36

Prompt Experimentation and UI updates #36

oindrillac commented Feb 15, 2024

review-notebook-app bot commented Feb 15, 2024

aakankshaduggal left a comment

hemajv left a comment •

edited

Loading

hemajv Feb 19, 2024

hemajv Feb 19, 2024

hemajv Feb 19, 2024

hemajv commented Feb 28, 2024

		@@ -0,0 +1,3141 @@
		,model,prompt,code_file,part,response,langchain_helpfulness,langchain_correctness,langchain_logical,instruction,total_langchain_score

Prompt Experimentation and UI updates #36

Prompt Experimentation and UI updates #36

Conversation

oindrillac commented Feb 15, 2024

review-notebook-app bot commented Feb 15, 2024

aakankshaduggal left a comment

Choose a reason for hiding this comment

hemajv left a comment • edited Loading

Choose a reason for hiding this comment

hemajv Feb 19, 2024

Choose a reason for hiding this comment

hemajv Feb 19, 2024

Choose a reason for hiding this comment

hemajv Feb 19, 2024

Choose a reason for hiding this comment

hemajv commented Feb 28, 2024

hemajv left a comment •

edited

Loading