Update base to transformers v4.30.2 #81

bfineran · 2023-06-15T19:52:59Z

rebases NM changes from base version v4.23.1 to v4.30.2

* Update trainer and model flows to accommodate sparseml Disable FP16 on QAT start (#12) * Override LRScheduler when using LRModifiers * Disable FP16 on QAT start * keep wrapped scaler object for training after disabling Using QATMatMul in DistilBERT model class (#41) Removed double quantization of output of context layer. (#45) Fix DataParallel validation forward signatures (#47) * Fix: DataParallel validation forward signatures * Update: generalize forward_fn selection Best model after epoch (#46) fix sclaer check for non fp16 mode in trainer (#38) Mobilebert QAT (#55) * Remove duplicate quantization of vocabulary. enable a QATWrapper for non-parameterized matmuls in BERT self attention (#9) * Utils and auxillary changes update Zoo stub loading for SparseZoo 1.1 refactor (#54) add flag to signal NM integration is active (#32) Add recipe_name to file names * Fix errors introduced in manual cherry-pick upgrade Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com>

* add GHA workflow files to build nightly and release packages * fix name --------- Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local>

Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local>

dbogunowicz · 2023-06-16T12:47:35Z

Tested with the current main of deepsparse

make test green
testing the commands from https://github.com/neuralmagic/deepsparse/blob/main/src/deepsparse/transformers/README.md

KSGulin · 2023-06-16T13:14:40Z

Tested with:

Sparse transfer learn commands from Add accelerate package dep for transformers sparseml#1633
make testinteg TARGETS=transformers

* Add recipe_name to default file names * Upgrade to transformers release V4.30.2 (#62) * Update trainer and model flows to accommodate sparseml Disable FP16 on QAT start (#12) * Override LRScheduler when using LRModifiers * Disable FP16 on QAT start * keep wrapped scaler object for training after disabling Using QATMatMul in DistilBERT model class (#41) Removed double quantization of output of context layer. (#45) Fix DataParallel validation forward signatures (#47) * Fix: DataParallel validation forward signatures * Update: generalize forward_fn selection Best model after epoch (#46) fix sclaer check for non fp16 mode in trainer (#38) Mobilebert QAT (#55) * Remove duplicate quantization of vocabulary. enable a QATWrapper for non-parameterized matmuls in BERT self attention (#9) * Utils and auxillary changes update Zoo stub loading for SparseZoo 1.1 refactor (#54) add flag to signal NM integration is active (#32) Add recipe_name to file names * Fix errors introduced in manual cherry-pick upgrade Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com> * update build versions for NM fork pypi push (#74) * fix nightly package name (#75) * add make build command (#76) * add GHA workflow files to build nightly and release packages (#77) * add GHA workflow files to build nightly and release packages * fix name --------- Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local> * bump up version to 1.6.0 (#79) Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local> --------- Co-authored-by: Konstantin <konstantin@neuralmagic.com> Co-authored-by: Konstantin Gulin <66528950+KSGulin@users.noreply.github.com> Co-authored-by: dhuangnm <74931910+dhuangnm@users.noreply.github.com> Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local>

KSGulin and others added 7 commits June 13, 2023 16:50

Add recipe_name to default file names

ccfa243

update build versions for NM fork pypi push (#74)

f767a5f

fix nightly package name (#75)

e7dca16

add make build command (#76)

61c3aae

add GHA workflow files to build nightly and release packages (#77)

790a385

* add GHA workflow files to build nightly and release packages * fix name --------- Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local>

bump up version to 1.6.0 (#79)

4054a5b

Co-authored-by: dhuang <dhuang@MacBook-Pro-2.local>

bfineran requested review from rahul-tuli, KSGulin and dbogunowicz June 15, 2023 19:52

bfineran self-assigned this Jun 15, 2023

KSGulin changed the base branch from main to upstream-v4.30.2-release-copy June 16, 2023 11:49

KSGulin mentioned this pull request Jun 16, 2023

Add accelerate package dep for transformers neuralmagic/sparseml#1633

Merged

KSGulin approved these changes Jun 16, 2023

View reviewed changes

dbogunowicz approved these changes Jun 16, 2023

View reviewed changes

KSGulin merged commit 0798c9e into upstream-v4.30.2-release-copy Jun 19, 2023

dbogunowicz deleted the rebase-upstream-4.30.2 branch December 5, 2023 10:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update base to transformers v4.30.2 #81

Update base to transformers v4.30.2 #81

bfineran commented Jun 15, 2023

dbogunowicz commented Jun 16, 2023 •

edited

Loading

KSGulin commented Jun 16, 2023 •

edited

Loading

Update base to transformers v4.30.2 #81

Update base to transformers v4.30.2 #81

Conversation

bfineran commented Jun 15, 2023

dbogunowicz commented Jun 16, 2023 • edited Loading

KSGulin commented Jun 16, 2023 • edited Loading

dbogunowicz commented Jun 16, 2023 •

edited

Loading

KSGulin commented Jun 16, 2023 •

edited

Loading