Adding code run failure handling process & Update to align with the newest OpenAI package formats #34
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
Adding code run failure handling process:
This is done by query back to the API with the exception message and reward function code.
There is a new parameter added in the config.yaml called max_retries that limits the attempts to fix the buggy reward code and prevents infinite loop.
The current code can now at least generate 1 legit reward functions and, in most cases, generate all the num_samples to be executable.
Update to align with the newest OpenAI package formats:
According to openai/openai-python#742
OpenAI has released a new major version of its SDK, and they recommend upgrading promptly.
I reimplemented the queries and handling of responses to align with the new requirements.
Type of change
Related Issues
#4
#4 (comment)
Specific changes
Changed the eureka/eureka.py file's query method
Modulized the reward code processing by encapsulating it as a function for code reuse
eureka/cfg/config.yaml Changed by adding parameter max_retries and updating the model to gpt-4-1103-preview
Screenshots
Example of failed run and automatic code fix: