Skip to content

Conversation

@ArthurZucker
Copy link
Collaborator

This did not work with generation (lm_head needs extra care!) This reverts commit 6dfd561.
cc @SunMarc cc @matej-svejda

This did not work with generation (lm_head needs extra care!)
This reverts commit 6dfd561.
@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Member

@SunMarc SunMarc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

SGTM ! Let's make it work later so that we can easily train with deepspeed + tp or fsdp + tp

@ArthurZucker ArthurZucker merged commit 20ce210 into main Aug 5, 2025
25 checks passed
@ArthurZucker ArthurZucker deleted the revert-commit branch August 5, 2025 13:12
zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025
…ce#39912)

* Revert "remove dtensors, not explicit (huggingface#39840)"
This did not work with generation (lm_head needs extra care!)
This reverts commit 6dfd561.

* update

* style?
zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025
…ce#39912)

* Revert "remove dtensors, not explicit (huggingface#39840)"
This did not work with generation (lm_head needs extra care!)
This reverts commit 6dfd561.

* update

* style?
zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025
…ce#39912)

* Revert "remove dtensors, not explicit (huggingface#39840)"
This did not work with generation (lm_head needs extra care!)
This reverts commit 6dfd561.

* update

* style?
zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025
…ce#39912)

* Revert "remove dtensors, not explicit (huggingface#39840)"
This did not work with generation (lm_head needs extra care!)
This reverts commit 6dfd561.

* update

* style?
zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025
…ce#39912)

* Revert "remove dtensors, not explicit (huggingface#39840)"
This did not work with generation (lm_head needs extra care!)
This reverts commit 6dfd561.

* update

* style?
zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025
…ce#39912)

* Revert "remove dtensors, not explicit (huggingface#39840)"
This did not work with generation (lm_head needs extra care!)
This reverts commit 6dfd561.

* update

* style?
zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025
…ce#39912)

* Revert "remove dtensors, not explicit (huggingface#39840)"
This did not work with generation (lm_head needs extra care!)
This reverts commit 6dfd561.

* update

* style?
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants