Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Validation #1033

Merged
Merged

Conversation

XiaohanZhangCMU
Copy link
Member

No description provided.

dakinggg and others added 28 commits March 5, 2024 11:06
Red button because CI running jobs it doesn't need. Tests passed on main.
Co-authored-by: Daniel King <43149077+dakinggg@users.noreply.github.com>
… finetuning (mosaicml#985)

The main purpose of this PR is to support training on non-terminal responses in multi-round chats. This is achieved by tokenizing at the level of conversation "turns" and exposing some options for what turns are used as training targets (i.e. generate loss). This also adds support for treating prompt tokens as loss-generating.

The script for converting a finetuning dataset to streaming has also been updated (with some bug fixes).
Co-authored-by: Max Marion <mmarion538@gmail.com>
* Fix typo in monolithic chkpt callback docs

* reorder to match function signature
@XiaohanZhangCMU XiaohanZhangCMU requested a review from a team as a code owner March 14, 2024 20:19
@XiaohanZhangCMU XiaohanZhangCMU merged commit c404dc7 into mosaicml:byod/data_validation Mar 14, 2024
0 of 2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.