Fine-Tuned BERT-base on Squad v1. #47

Maaarcocr · 2018-11-20T17:04:09Z

I have fine-tuned the TF model on SQuAD v1 and I've made the weights available at: https://s3.eu-west-2.amazonaws.com/nlpfiles/squad_bert_base.tgz

I get 88.5 FM using these weights on SQuAD dev. (If I recall correctly I get roughly 82 EM).

I think it may be beneficial to have these weights here, so that people could play with SQuAD and BERT without the need of fine-tuning, which requires a decent enough setup. Let me know what you think!

thomwolf · 2018-11-21T09:02:04Z

Thanks for the details.
This PyTorch repo is starting to be used by a larger community so we would have to be a little more precise than just rough numbers if we want to include such pre-trained weights.
If you want to add your weights to the repo, you should convert the weights in the PyTorch repo model and get evaluation results on SQuAD with the PyTorch model so everybody has a clean knowledge of what they are using. Otherwise I think it's better that people do their own training and know what are the capabilities of the fine-tuned model they are using.
Feel free to come back and re-open the issue if this something you would like to do.

wasiahmad · 2019-04-18T20:10:37Z

@thomwolf On SQuAD v1.1, BERT (single) scored 85.083 EM and 91.835 F1 as reported in their paper but when I fine-tuned BERT using run_squad.py I got {"exact_match": 81.0975, "f1": 88.7005}. Why there is a difference? What I am missing?

add mat-coqa runner with multitask + adversarial training support (co…

add patch bit

* Update trainer and model flows to accommodate sparseml Disable FP16 on QAT start (huggingface#12) * Override LRScheduler when using LRModifiers * Disable FP16 on QAT start * keep wrapped scaler object for training after disabling Using QATMatMul in DistilBERT model class (huggingface#41) Removed double quantization of output of context layer. (huggingface#45) Fix DataParallel validation forward signatures (huggingface#47) * Fix: DataParallel validation forward signatures * Update: generalize forward_fn selection Best model after epoch (huggingface#46) fix sclaer check for non fp16 mode in trainer (huggingface#38) Mobilebert QAT (huggingface#55) * Remove duplicate quantization of vocabulary. enable a QATWrapper for non-parameterized matmuls in BERT self attention (huggingface#9) * Utils and auxillary changes update Zoo stub loading for SparseZoo 1.1 refactor (huggingface#54) add flag to signal NM integration is active (huggingface#32) Add recipe_name to file names * Fix errors introduced in manual cherry-pick upgrade Co-authored-by: Benjamin Fineran <bfineran@users.noreply.github.com>

Pop

* Update model list * Update README.md --------- Co-authored-by: Qubitium-modelcloud <qubitium@modelcloud.ai>

…ngface#47) (huggingface#49) * fix cannot pickle 'module' object for 8 bit * remove unused import * remove print * check with tuple * revert to len check * add test for 8bit * set same QuantizeConfig * check if it's 4 bit * fix grammar * remove params * it's not a list * set gptqmodel_cuda back * check is tuple * format * set desc_act=True * set desc_act=True * format * format * Refractor fix * desc_act=True --------- Co-authored-by: Qubitium <Qubitium@modelcloud.ai>

thomwolf closed this as completed Nov 21, 2018

maeotaku mentioned this issue May 23, 2019

bert->onnx ->caffe2 weird error #633

Closed

stevezheng23 added a commit to stevezheng23/transformers that referenced this issue Mar 24, 2020

Merge pull request huggingface#47 from stevezheng23/dev/zheng/coqa

99a7a73

add mat-coqa runner with multitask + adversarial training support (co…

younesbelkada pushed a commit to younesbelkada/transformers that referenced this issue Nov 30, 2022

Merge pull request huggingface#47 from younesbelkada/patch-bit

f039a7c

add patch bit

jameshennessytempus pushed a commit to jameshennessytempus/transformers that referenced this issue Jun 1, 2023

Merge pull request huggingface#47 from jamesthesnake/pop

14e9aef

Pop

lwmlyy mentioned this issue Aug 15, 2023

add util for ram efficient loading of model when using fsdp #25107

Merged

1 task

ocavue pushed a commit to ocavue/transformers that referenced this issue Sep 13, 2023

Add support for MT5 (Closes huggingface#39, huggingface#47)

4e96788

ZYC-ModelCloud pushed a commit to ZYC-ModelCloud/transformers that referenced this issue Nov 14, 2024

Update model list (huggingface#47)

2ae3bde

* Update model list * Update README.md --------- Co-authored-by: Qubitium-modelcloud <qubitium@modelcloud.ai>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fine-Tuned BERT-base on Squad v1. #47

Fine-Tuned BERT-base on Squad v1. #47

Maaarcocr commented Nov 20, 2018

thomwolf commented Nov 21, 2018

wasiahmad commented Apr 18, 2019 •

edited

Loading

Fine-Tuned BERT-base on Squad v1. #47

Fine-Tuned BERT-base on Squad v1. #47

Comments

Maaarcocr commented Nov 20, 2018

thomwolf commented Nov 21, 2018

wasiahmad commented Apr 18, 2019 • edited Loading

wasiahmad commented Apr 18, 2019 •

edited

Loading