Skip to content

Conversation

@eaftan
Copy link

@eaftan eaftan commented Sep 27, 2021

No description provided.

pull bot referenced this pull request in mcx/openai-python Oct 10, 2021
* Add CLI option to download files (#34)

* Option to check if file has been uploaded in the past before uploading (#33)

The check is done based on filename, file purpose and file size

* Add fine-tuning hparams directly into the fine-tunes CLI (#35)

* update fine_tunes cli use_packing argument (#38)

* A file verification and remediation tool.

It applies the following validations:
- prints the number of examples, and warns if it's lower than 100
- ensures prompt and completion columns are present
- optionally removes any additional columns
- ensures all completions are non-empty
- infers which type of fine-tuning the data is most likely in (classification, conditional generation and open-ended generation)
- optionally removes duplicate rows
- infers the existence of a common suffix, and if there is none, suggests one for classification and conditional generation
- optionally prepends a space to each completion, to make tokenization better
- optionally splits into training and validation set for the classification use case
- optionally ensures there's an ending string for all completions
- optionally lowercases completions or prompts if more than a 1/3 of alphanumeric characters are upper case

It interactively asks the user to accept or reject recommendations. If the user is happy, then it saves the modified output file as a jsonl, which is ready for being used in fine-tuning with the printed command.

* Completion: remove  from kwargs before passing to EngineAPI (#37)

* Version bump before pushing to external

Co-authored-by: Todor Markov <todor.m.markov@gmail.com>
Co-authored-by: Boris Power <81998504+BorisPower@users.noreply.github.com>
Co-authored-by: Dave Cummings <dave@openai.com>
@hallacy
Copy link
Collaborator

hallacy commented Jan 19, 2022

Hey @eaftan, thanks for the PR! Can you tell me a bit more about this PR and what it could be used for?

@hallacy
Copy link
Collaborator

hallacy commented Jan 26, 2022

I'm going to close this out for now. Feel free to open this back up later if you'd like to talk about this PR

@hallacy hallacy closed this Jan 26, 2022
cgayapr pushed a commit to cgayapr/openai-python that referenced this pull request Dec 14, 2024
* Add CLI option to download files (openai#34)

* Option to check if file has been uploaded in the past before uploading (openai#33)

The check is done based on filename, file purpose and file size

* Add fine-tuning hparams directly into the fine-tunes CLI (openai#35)

* update fine_tunes cli use_packing argument (openai#38)

* A file verification and remediation tool.

It applies the following validations:
- prints the number of examples, and warns if it's lower than 100
- ensures prompt and completion columns are present
- optionally removes any additional columns
- ensures all completions are non-empty
- infers which type of fine-tuning the data is most likely in (classification, conditional generation and open-ended generation)
- optionally removes duplicate rows
- infers the existence of a common suffix, and if there is none, suggests one for classification and conditional generation
- optionally prepends a space to each completion, to make tokenization better
- optionally splits into training and validation set for the classification use case
- optionally ensures there's an ending string for all completions
- optionally lowercases completions or prompts if more than a 1/3 of alphanumeric characters are upper case

It interactively asks the user to accept or reject recommendations. If the user is happy, then it saves the modified output file as a jsonl, which is ready for being used in fine-tuning with the printed command.

* Completion: remove  from kwargs before passing to EngineAPI (openai#37)

* Version bump before pushing to external

Co-authored-by: Todor Markov <todor.m.markov@gmail.com>
Co-authored-by: Boris Power <81998504+BorisPower@users.noreply.github.com>
Co-authored-by: Dave Cummings <dave@openai.com>
safa0 pushed a commit to safa0/openai-agents-python that referenced this pull request Apr 27, 2025
fix: clarify multi-agent orchestration description
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants