[master] Add new BERT calibration dataset #1171

nvitramble · 2022-06-30T05:42:22Z

The purpose of this change is to:

Provide an alternative calibration dataset for BERT that defines the calibration examples in terms of qas (question answers) ids in dev-v1.1.json
Clarify the meaning of the integers in the previous calibration dataset

The new calibration examples are randomly selected from dev-v1.1.json. Defining the examples in terms of qas ids can be more convenient for some PTQ workflows. Other benchmarks (e.g. resnet50) already provide multiple calibration datasets (see here).

github-actions · 2022-06-30T05:42:40Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

nv-ananjappa · 2022-06-30T16:52:40Z

LGTM. @psyhtest Please review.

pgmpablo157321 · 2022-07-01T03:05:38Z

@nvitramble Looks ok! If you would like this change to be included in Inferece_v2.1, please also make a PR to the branch r2.1

psyhtest · 2022-07-05T13:40:51Z

Other benchmarks (e.g. resnet50) already provide multiple calibration datasets

ResNet50 is the only benchmark with two calibration datasets. And it's only an historic accident - NVIDIA and Intel proposed their calibration datasets at the same tim, so the grudging consensus was to allow submitters use either.

The new calibration examples are randomly selected from dev-v1.1.json.
Defining the examples in terms of qas ids can be more convenient for some PTQ workflows.

This may well be the case. But why simply not convert the old calibration examples into the new format?

nvitramble · 2022-07-05T20:19:54Z

This may well be the case. But why simply not convert the old calibration examples into the new format?

As discussed in this morning's WG, there is not a 1:1 mapping from (question, context) pairs in dev-v1.1.json to features since contexts get split when len(context + question) > max_seq_len.

And to followup from the WG, is the current calibration dataset used by any submitters? My understandings is that it is unused - for int8 submitters would use the QAT'ed models (which include quantization scales) here and here.

nvitramble · 2022-07-05T20:27:16Z

To explain the motivation more, the current calibration dataset assumes a deterministic mapping from dev-v1.1.json question+context pairs to features. Different implementations of the featurizing function may not maintain the same ordering (e.g. if using huggingface). It is more robust to define the calibration examples in terms of unique ids, since that makes no assumptions about the featurizer (e.g. for ImageNet the calibration set is defined in terms of image file names, for LibriSpeech the calibration set is defined in terms of wav file names, etc.).

rnaidu02 · 2022-07-12T16:30:52Z

Either one of the calibration sets is allowed.

nvpohanh · 2022-07-19T00:57:37Z

@nvitramble Should we cherry-pick this to r2.1 branch?

nvpohanh · 2022-07-19T01:37:28Z

Nvm, it's already done in #1172

nvitramble force-pushed the dev-itramble-bert-calib branch from 1c336c1 to e53d936 Compare July 1, 2022 00:12

nvitramble mentioned this pull request Jul 1, 2022

[r2.1] Add new BERT calibration dataset #1172

Merged

nvitramble changed the title ~~Add new BERT calibration dataset~~ [master] Add new BERT calibration dataset Jul 1, 2022

Add new BERT calibration dataset

28c0d40

nvitramble force-pushed the dev-itramble-bert-calib branch from e53d936 to 28c0d40 Compare July 6, 2022 02:19

rnaidu02 approved these changes Jul 12, 2022

View reviewed changes

rnaidu02 merged commit f576d52 into mlcommons:master Jul 12, 2022

github-actions bot locked and limited conversation to collaborators Jul 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[master] Add new BERT calibration dataset #1171

[master] Add new BERT calibration dataset #1171

nvitramble commented Jun 30, 2022 •

edited

Loading

github-actions bot commented Jun 30, 2022 •

edited

Loading

nv-ananjappa commented Jun 30, 2022

pgmpablo157321 commented Jul 1, 2022

psyhtest commented Jul 5, 2022

nvitramble commented Jul 5, 2022 •

edited

Loading

nvitramble commented Jul 5, 2022

rnaidu02 commented Jul 12, 2022

nvpohanh commented Jul 19, 2022

nvpohanh commented Jul 19, 2022

[master] Add new BERT calibration dataset #1171

[master] Add new BERT calibration dataset #1171

Conversation

nvitramble commented Jun 30, 2022 • edited Loading

github-actions bot commented Jun 30, 2022 • edited Loading

nv-ananjappa commented Jun 30, 2022

pgmpablo157321 commented Jul 1, 2022

psyhtest commented Jul 5, 2022

nvitramble commented Jul 5, 2022 • edited Loading

nvitramble commented Jul 5, 2022

rnaidu02 commented Jul 12, 2022

nvpohanh commented Jul 19, 2022

nvpohanh commented Jul 19, 2022

nvitramble commented Jun 30, 2022 •

edited

Loading

github-actions bot commented Jun 30, 2022 •

edited

Loading

nvitramble commented Jul 5, 2022 •

edited

Loading