Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

reapply multipack patches after model load for trust_remote_code compat #1874

Closed
wants to merge 2 commits into from

Conversation

tmm1
Copy link
Collaborator

@tmm1 tmm1 commented Aug 27, 2024

cc #1873 #1872 #1340 (comment)

note that this was a regression in huggingface/transformers#30370 after which transformers will reload modules causing our patched function to get replaced

@tmm1 tmm1 force-pushed the fix-multipack-remote-code branch from e15bd9a to deb77d4 Compare August 27, 2024 18:12
@tmm1
Copy link
Collaborator Author

tmm1 commented Aug 27, 2024

i think a better solution might be fix transformers so it doesn't wipe away monkeypatches between model loads

otherwise we'll be chasing down and trying to restore a bunch of patches all over. for example, i ran into this again when trying to patch deepseekv2 for liger support

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant