Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix Typos in Evaluation Script and System Prompt. Identify Errors in a Dataset #335

Merged
merged 1 commit into from
Apr 11, 2024

Conversation

zuxin666
Copy link
Contributor

This pull request addresses several minor issues found in the evaluation script and system prompts.
I also found some errors in one of the dataset as outlined below.

Changes:

  1. Fixed a typo in the system prompt for chat models to ensure clarity in instructions provided to the model.
  2. Evaluation Script Updates (openfunctions_evaluation.py):
    • Modified function parameters in build_handler to directly pass temperature, top_p, and max_tokens which enhances the readability and maintainability of the code.
    • Corrected the conditional checks and variable assignments to accurately reflect the intended logic for handling different test categories.

Dataset Corrections:

Two errors were identified in the dataset file gorilla_openfunctions_v1_test_parallel_multiple_function.json:

  1. Line 58: We should change "required": ["location", "star"]}} to "required": ["location", "stars"]}}, to match the expected function parameters.
  2. Line 171: We should change "required": ["player_name", "class"] to "required": ["player_name", "class_type"] to align with the parameter names used in functions.

I do not know how to update these files so just outline them here.

@HuanzhiMao
Copy link
Collaborator

HuanzhiMao commented Apr 10, 2024

Hi @zuxin666 ,
Thanks for your attention and pointing this out! We are addressing the dataset correction issue right now on our end. Stay tuned for a PR very soon that fixes it.

Copy link
Owner

@ShishirPatil ShishirPatil left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the PR, @zuxin666! Thanks for flagging it - addressing the bugs in the dataset, and the PR will be out soon!

@ShishirPatil ShishirPatil merged commit 1668032 into ShishirPatil:main Apr 11, 2024
devanshamin pushed a commit to devanshamin/gorilla that referenced this pull request Jul 9, 2024
…a Dataset (ShishirPatil#335)

This pull request addresses several minor issues found in the evaluation
script and system prompts.
I also found some errors in one of the dataset as outlined below.

### Changes:
1. Fixed a typo in the system prompt for chat models to ensure clarity
in instructions provided to the model.
2. Evaluation Script Updates (`openfunctions_evaluation.py`):
- Modified function parameters in `build_handler` to directly pass
`temperature`, `top_p`, and `max_tokens` which enhances the readability
and maintainability of the code.
- Corrected the conditional checks and variable assignments to
accurately reflect the intended logic for handling different test
categories.

### Dataset Corrections:
Two errors were identified in the dataset file
`gorilla_openfunctions_v1_test_parallel_multiple_function.json`:
1. **Line 58**: We should change `"required": ["location", "star"]}}` to
`"required": ["location", "stars"]}},` to match the expected function
parameters.
2. **Line 171**: We should change `"required": ["player_name", "class"]`
to `"required": ["player_name", "class_type"]` to align with the
parameter names used in functions.

I do not know how to update these files so just outline them here.
aw632 pushed a commit to vinaybagade/gorilla that referenced this pull request Aug 22, 2024
…a Dataset (ShishirPatil#335)

This pull request addresses several minor issues found in the evaluation
script and system prompts.
I also found some errors in one of the dataset as outlined below.

### Changes:
1. Fixed a typo in the system prompt for chat models to ensure clarity
in instructions provided to the model.
2. Evaluation Script Updates (`openfunctions_evaluation.py`):
- Modified function parameters in `build_handler` to directly pass
`temperature`, `top_p`, and `max_tokens` which enhances the readability
and maintainability of the code.
- Corrected the conditional checks and variable assignments to
accurately reflect the intended logic for handling different test
categories.

### Dataset Corrections:
Two errors were identified in the dataset file
`gorilla_openfunctions_v1_test_parallel_multiple_function.json`:
1. **Line 58**: We should change `"required": ["location", "star"]}}` to
`"required": ["location", "stars"]}},` to match the expected function
parameters.
2. **Line 171**: We should change `"required": ["player_name", "class"]`
to `"required": ["player_name", "class_type"]` to align with the
parameter names used in functions.

I do not know how to update these files so just outline them here.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants