Skip to content

Issues: mosaicml/llm-foundry

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Failing to build in Docker bug Something isn't working
#1678 opened Nov 28, 2024 by swood
MI300X Compatibility bug Something isn't working
#1568 opened Oct 3, 2024 by nikhil-tensorwave
Triviaqa metrics wrong! bug Something isn't working
#1557 opened Sep 28, 2024 by lqniunjunlper
Fine-tuning error in conda environment without docker image bug Something isn't working
#1538 opened Sep 21, 2024 by LalchandPandia
When Finetuning Llama3, Error occurs bug Something isn't working
#1508 opened Sep 2, 2024 by AndrewHYC
Allow multiprocessing when preparing ICL dataset enhancement New feature or request
#1276 opened Jun 13, 2024 by sanjari-orb
LLaMA PRO training resume problem question Further information is requested
#1231 opened May 23, 2024 by germanjke
Conversion Sharded -> Monolithic checkpoint question Further information is requested
#1220 opened May 17, 2024 by pretidav
Train with attention mask enhancement New feature or request
#1183 opened May 8, 2024 by germanjke
Fine-tune dbrx-instruct on a single VM with 8 H100s question Further information is requested
#1105 opened Apr 10, 2024 by hosseinsarshar
Composer crashes when attempting to load sharded checkpoint bug Something isn't working
#998 opened Feb 27, 2024 by growlix
Any plan for supporting DPO? enhancement New feature or request
#846 opened Jan 8, 2024 by lorabit110
Converting a composer seq2seq t5 model throws an exception bug Something isn't working
#754 opened Nov 21, 2023 by timsteuer
Benchmarking GLUE tasks for in-context learning question Further information is requested
#707 opened Oct 31, 2023 by ashim95
mosaicml-turbo: Where to find the repo? question Further information is requested
#565 opened Aug 29, 2023 by agarvic
Finetuning Models bug Something isn't working
#562 opened Aug 27, 2023 by ak2028
[Bug] Different batch_size return different evaluating result bug Something isn't working
#541 opened Aug 21, 2023 by SingL3
ProTip! Exclude everything labeled bug with -label:bug.