You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Conversion fails when using layers_per_step together with input_format=fast_llm
example job: 7ada4a96-4b5d-43de-a156-ebea5f359a33
Global counter mismatch for parameter "layers.8.norm_1.weight" and shard "weights": 0 != 2048
[...]
Global counter mismatch for parameter "layers.17.output_weights" and shard "weights": 0 != 268435456
🔄 Steps to Reproduce
Convert a model exported in fast_llm format, using the layers_per_step argument