pass additional info for fix untrained tokens when using distributed + offloading #2388

winglian · 2025-03-06T16:53:51Z

Description

Fix untrained tokens doesn't quite work properly when using distributed offloading as the embeddings need to be gathered, but there isn't enough information in the model that the function can determine this, so we need to pass this to the function.

https://github.com/axolotl-ai-cloud/axolotl-contribs-lgpl/pulls

…+ offloading

winglian mentioned this pull request Mar 6, 2025

handle distributed embeddings axolotl-ai-cloud/axolotl-contribs-lgpl#4

Merged

pass additional info for fix untrained tokens when using distributed …

310c273

…+ offloading

winglian force-pushed the fix-untrained-w-zero3 branch from 6305227 to 310c273 Compare March 7, 2025 14:00

winglian added 6 commits March 7, 2025 11:15

use latest version of vendored lib

c31fe9b

use v0.0.5 of contribs lgpl

5f7fe93

fix for no bad tokens and add tests

d70e0fa

use release

814604b

add multigpu test too

7873c05

make sure the multigpu zero3 test actually uses zero3

3900ce8

winglian merged commit 59899b9 into main Mar 11, 2025
16 checks passed

winglian deleted the fix-untrained-w-zero3 branch March 11, 2025 16:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pass additional info for fix untrained tokens when using distributed + offloading #2388

pass additional info for fix untrained tokens when using distributed + offloading #2388

winglian commented Mar 6, 2025 •

edited

Loading

pass additional info for fix untrained tokens when using distributed + offloading #2388

pass additional info for fix untrained tokens when using distributed + offloading #2388

Conversation

winglian commented Mar 6, 2025 • edited Loading

Description

winglian commented Mar 6, 2025 •

edited

Loading