-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BFCL] Fix Dataset and Possible Answer Issue #557
Conversation
Hi, thanks for your efforts in supporting such a wonderful benchmark. Recently I found some possible errors in ground truth as follows:
|
Thanks for pointing this out! Regarding
|
I see. I agree~ |
Hey, I found some possible errors of ground truth about the In {"id": "simple_13", "ground_truth": {"calculate_area_under_curve": {"function": ["x^2", "x**2"], "interval": [[1.0, 3.0]], "method": ["", "trapezoidal"]}}}
{"id": "simple_14", "ground_truth": {"calculate_derivative": {"function": ["3x^2 + 2x - 1", "3*x**2+2*x-1"], "x_value": ["", 0.0]}}}
{"id": "simple_15", "ground_truth": {"integrate": {"function": ["x^3", "x**3"], "start_x": [-2], "end_x": [3], "method": ["simpson"]}}}
{"id": "simple_16", "ground_truth": {"calculus.derivative": {"function": ["2*x^2", "2x^2", "2**x^2"], "value": [1], "function_variable": ["x", ""]}}} While in {"id": "parallel_multiple_function_4", "ground_truth": {"integral": {"function": ["x^2", "lambda x : x**2"], "a": [1.0], "b": [5.0]}, "derivative": {"function": ["x^2", "lambda x : x**2"], "x": [3.0]}}} I think maybe the lambda function should be included in the simple cases as well? |
Fair point. Updated. |
...ley-function-call-leaderboard/data/possible_answer/gorilla_openfunctions_v1_test_simple.json
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
This PR fixes ShishirPatil#550, fixes ShishirPatil#541, and all the issues pointed out by the comments below. We want to thank @lucenzhong and @XuHwang for pointing these out. Here's a breakdown of the changes: - simple: 7 entry affected - Indices: `13, 14, 15, 16, 200, 285, 375` - multiple function: 3 entry affected. - Indices: `29, 33, 99` - parallel function: 5 entry affected. - Indices: `26, 71, 72, 73, 89` - parallel multiple function: 6 entry affected. - Indices: `4, 19, 80, 83, 132, 195` - executable parallel function: 1 entry affected - Indices: `11` - javascript: 3 entry affected - Indices: `18, 29, 35` This will affect the leaderboard score. We will update it soon, in a different PR.
This PR fixes #550, fixes #541, and all the issues pointed out by the comments below.
We want to thank @lucenzhong and @XuHwang for pointing these out.
Here's a breakdown of the changes:
13, 14, 15, 16, 200, 285, 375
29, 33, 99
26, 71, 72, 73, 89
4, 19, 80, 83, 132, 195
11
18, 29, 35
This will affect the leaderboard score. We will update it soon, in a different PR.