Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pass additional info for fix untrained tokens when using distributed + offloading #2388

Merged
merged 7 commits into from
Mar 11, 2025

Conversation

winglian
Copy link
Collaborator

@winglian winglian commented Mar 6, 2025

Description

Fix untrained tokens doesn't quite work properly when using distributed offloading as the embeddings need to be gathered, but there isn't enough information in the model that the function can determine this, so we need to pass this to the function.

https://github.com/axolotl-ai-cloud/axolotl-contribs-lgpl/pulls

@winglian winglian force-pushed the fix-untrained-w-zero3 branch from 6305227 to 310c273 Compare March 7, 2025 14:00
@winglian winglian merged commit 59899b9 into main Mar 11, 2025
16 checks passed
@winglian winglian deleted the fix-untrained-w-zero3 branch March 11, 2025 16:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant