-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add to_static for electra #7575
Conversation
Thanks for your contribution! |
Codecov ReportAttention:
Additional details and impacted files@@ Coverage Diff @@
## develop #7575 +/- ##
===========================================
- Coverage 58.23% 57.85% -0.38%
===========================================
Files 579 582 +3
Lines 85819 86480 +661
===========================================
+ Hits 49973 50032 +59
- Misses 35846 36448 +602 ☔ View full report in Codecov by Sentry. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这里只有 electra 一个模型支持了动转静吗?还是说这个是所有模型的训练入口?
这里只有 electra 的,打算一个模型提交一个 PR ,这样可以吗? |
可以的 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
PR types
Others
PR changes
Others
Description
为 electra 模型接入动转静
训练环境
训练结果
动态图
运行命令
python -u ./run_pretrain.py \ --model_type electra \ --model_name_or_path electra-small \ --input_dir $DATA_DIR \ --output_dir ./pretrain_model/ \ --train_batch_size 8 \ --learning_rate 5e-4 \ --max_seq_length 128 \ --weight_decay 1e-2 \ --adam_epsilon 1e-6 \ --warmup_steps 10000 \ --num_train_epochs 4 \ --logging_steps 100 \ --save_steps 10000 \ --max_steps -1 \ --device gpu
运行结果
静态图 SOT模式
运行命令
运行结果
静态图 AST模式
运行命令
运行结果