-
Notifications
You must be signed in to change notification settings - Fork 531
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
log details to metadata for run analytics #992
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
More of an ask for foundry team: Is there anything else that would be useful for analytics?
Should we make it configurable to turn this on or off? Or is presence of MosaicMLLogger enough (users could always turn it off via MOSAICML_PLATFORM env var)
…d conver to lowercase
…l/llm-foundry into angel/log-data-for-run-analytics
…-data-for-run-analytics
This reverts commit 43be314.
…l/llm-foundry into angel/log-data-for-run-analytics
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you include evidence of this working in the PR description? Some manual tests and screenshots would be good.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🚀 LGTM! Just one super super tiny formatting comment :) Thanks Angel! Will be great to have more logging and data 😄
…l/llm-foundry into angel/log-data-for-run-analytics
This will log information via the
MosaicMLLogger
to place the following keys in a run's metadata for analytics purposes. The data to log includesmodel_name
:string
script
: 'Training', 'Eval'train_task_type
:PRETRAIN
,CONTINUED_PRETRAIN
,INSTRUCTION_FINETUNE
train_loader_name
:string
train_dataset_hf_name
:string
eval_loader_name
:string
eval_dataset_hf_name
:string
tokenizer_name
:string
n_heads
:number
d_model
:int
callbacks
:string[]
train_loader_workers
:int
eval_loader_workers
:int
gauntlet_configured
:boolean
icl_configured
:boolean
Screenshots
Using the
Quickstart
exampleUsing the
gpt2-small
example