Fix Typos in Evaluation Script and System Prompt. Identify Errors in a Dataset #335

zuxin666 · 2024-04-10T23:01:11Z

This pull request addresses several minor issues found in the evaluation script and system prompts.
I also found some errors in one of the dataset as outlined below.

Changes:

Fixed a typo in the system prompt for chat models to ensure clarity in instructions provided to the model.
Evaluation Script Updates (openfunctions_evaluation.py):
- Modified function parameters in build_handler to directly pass temperature, top_p, and max_tokens which enhances the readability and maintainability of the code.
- Corrected the conditional checks and variable assignments to accurately reflect the intended logic for handling different test categories.

Dataset Corrections:

Two errors were identified in the dataset file gorilla_openfunctions_v1_test_parallel_multiple_function.json:

Line 58: We should change "required": ["location", "star"]}} to "required": ["location", "stars"]}}, to match the expected function parameters.
Line 171: We should change "required": ["player_name", "class"] to "required": ["player_name", "class_type"] to align with the parameter names used in functions.

I do not know how to update these files so just outline them here.

HuanzhiMao · 2024-04-10T23:38:14Z

Hi @zuxin666 ,
Thanks for your attention and pointing this out! We are addressing the dataset correction issue right now on our end. Stay tuned for a PR very soon that fixes it.

ShishirPatil

Thanks for the PR, @zuxin666! Thanks for flagging it - addressing the bugs in the dataset, and the PR will be out soon!

…a Dataset (ShishirPatil#335) This pull request addresses several minor issues found in the evaluation script and system prompts. I also found some errors in one of the dataset as outlined below. ### Changes: 1. Fixed a typo in the system prompt for chat models to ensure clarity in instructions provided to the model. 2. Evaluation Script Updates (`openfunctions_evaluation.py`): - Modified function parameters in `build_handler` to directly pass `temperature`, `top_p`, and `max_tokens` which enhances the readability and maintainability of the code. - Corrected the conditional checks and variable assignments to accurately reflect the intended logic for handling different test categories. ### Dataset Corrections: Two errors were identified in the dataset file `gorilla_openfunctions_v1_test_parallel_multiple_function.json`: 1. **Line 58**: We should change `"required": ["location", "star"]}}` to `"required": ["location", "stars"]}},` to match the expected function parameters. 2. **Line 171**: We should change `"required": ["player_name", "class"]` to `"required": ["player_name", "class_type"]` to align with the parameter names used in functions. I do not know how to update these files so just outline them here.

Fix typos for evaluation and prompt

b9917b7

ShishirPatil approved these changes Apr 11, 2024

View reviewed changes

ShishirPatil merged commit 1668032 into ShishirPatil:main Apr 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Typos in Evaluation Script and System Prompt. Identify Errors in a Dataset #335

Fix Typos in Evaluation Script and System Prompt. Identify Errors in a Dataset #335

zuxin666 commented Apr 10, 2024

HuanzhiMao commented Apr 10, 2024 •

edited

Loading

ShishirPatil left a comment

Fix Typos in Evaluation Script and System Prompt. Identify Errors in a Dataset #335

Fix Typos in Evaluation Script and System Prompt. Identify Errors in a Dataset #335

Conversation

zuxin666 commented Apr 10, 2024

Changes:

Dataset Corrections:

HuanzhiMao commented Apr 10, 2024 • edited Loading

ShishirPatil left a comment

Choose a reason for hiding this comment

HuanzhiMao commented Apr 10, 2024 •

edited

Loading