-
Notifications
You must be signed in to change notification settings - Fork 561
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bert + gpt2 inference + training wrt torch 1.3.1 and transformers 2.2.1 #673
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
fantes
changed the title
bert training wrt torch 1.3.1
bert + gpt2 training wrt torch 1.3.1
Dec 11, 2019
fantes
force-pushed
the
bert_training
branch
2 times, most recently
from
December 12, 2019 13:45
d981238
to
293abb2
Compare
fantes
changed the title
bert + gpt2 training wrt torch 1.3.1
bert + gpt2 training wrt torch 1.3.1 and transformers 2.2.1
Dec 12, 2019
fantes
force-pushed
the
bert_training
branch
2 times, most recently
from
December 12, 2019 16:13
3e08630
to
4acd30f
Compare
fantes
changed the title
bert + gpt2 training wrt torch 1.3.1 and transformers 2.2.1
bert + gpt2 inference + training wrt torch 1.3.1 and transformers 2.2.1
Dec 12, 2019
author Louis J <ljean@etud.insa-toulouse.fr> 1563984477 +0200 committer Guillaume Infantes <guillaume.infantes@jolibrain.com> 1576060297 +0100 parent 7eb6443 author Louis J <ljean@etud.insa-toulouse.fr> 1563984477 +0200 committer Guillaume Infantes <guillaume.infantes@jolibrain.com> 1576059845 +0100 LOUISJ'S COMMITS: Move dataset management and model building in separate classes Add train and test The fix on txtinputconnector is temporary, vocab generation should be fixed a more robust way BERT finetuning with custom number of classes Add self supervised Masked LM learning Save solver checkpoint along with model Ensure label is of correct dimension Fix masked_lm, add more explicit error message Add script to trace huggingface models Add classfication on hidden states to be able to use masked lm model for classif Better API, more features, less memory usage and fix bugs Add unit tests for training Move training parameters to solver and net Add comments Download tar from deepdetect.com torch 1.3.1 alone working with caffe patch correction: add pcaffe/logging.h force -j8 when building libtorch (default is -j nproc) points to model traced for torch 131 GUILLAUME COMMITS: changes for torch 131 Move dataset management and model building in separate classes Add train and test The fix on txtinputconnector is temporary, vocab generation should be fixed a more robust way BERT finetuning with custom number of classes Add self supervised Masked LM learning Save solver checkpoint along with model Ensure label is of correct dimension Better API, more features, less memory usage and fix bugs Move training parameters to solver and net Add comments Add inference support for GPT2 Make lower case optional Add gpt2 training Add gpt2 demo rebase all glitches in merge update to last transformers from hugginface gpt2 inference ok sanitize width vs sequence remove comment in cmakelist
fantes
force-pushed
the
bert_training
branch
from
December 12, 2019 16:40
4acd30f
to
fb2e2d4
Compare
BERT EXAMPLES<!> When tracing models, use pytorch 1.3.1. and latest transformers (formerly pytorch-transformers)
Added parameters
Example: Finetune a classification model
Example: Finetune language model
|
GPT 2 EXAMPLESExample: trace gpt2 and run it with the demoRun dede, then
Example: call gpt2 inference with curl
Example: finetuning gpt2<!> Test takes a lot of time and memory -- requires f1 sparse
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
tested OK on unit tests and examples from louijie
include
doc for bert training : #637 [edited for new versions or torch and transformers]
doc for gpt2 training : #644