[tune](deps): Bump transformers from 4.3.2 to 4.4.1 in /python/requirements #3

dependabot · 2021-03-18T09:17:02Z

Bumps transformers from 4.3.2 to 4.4.1.

Release notes

v4.4.0: S2T, M2M100, I-BERT, mBART-50, DeBERTa-v2, XLSR-Wav2Vec2

SpeechToText

Two new models are released as part of the S2T implementation: Speech2TextModel and Speech2TextForConditionalGeneration, in PyTorch.

Speech2Text is a speech model that accepts a float tensor of log-mel filter-bank features extracted from the speech signal. It’s a transformer-based seq2seq model, so the transcripts/translations are generated autoregressively.

The Speech2Text model was proposed in fairseq S2T: Fast Speech-to-Text Modeling with fairseq by Changhan Wang, Yun Tang, Xutai Ma, Anne Wu, Dmytro Okhonko, Juan Pino.

Compatible checkpoints can be found on the Hub: https://huggingface.co/models?filter=speech_to_text

Speech2TextTransformer #10175 (@patil-suraj)

M2M100

Two new models are released as part of the M2M100 implementation: M2M100Model and M2M100ForConditionalGeneration, in PyTorch.

M2M100 is a multilingual encoder-decoder (seq-to-seq) model primarily intended for translation tasks.

The M2M100 model was proposed in Beyond English-Centric Multilingual Machine Translation by Angela Fan, Shruti Bhosale, Holger Schwenk, Zhiyi Ma, Ahmed El-Kishky, Siddharth Goyal, Mandeep Baines, Onur Celebi, Guillaume Wenzek, Vishrav Chaudhary, Naman Goyal, Tom Birch, Vitaliy Liptchinsky, Sergey Edunov, Edouard Grave, Michael Auli, Armand Joulin.

Compatible checkpoints can be found on the Hub: https://huggingface.co/models?filter=m2m_100

Add m2m100 #10236 (@patil-suraj)

I-BERT

Six new models are released as part of the I-BERT implementation: IBertModel, IBertForMaskedLM, IBertForSequenceClassification, IBertForMultipleChoice, IBertForTokenClassification and IBertForQuestionAnswering, in PyTorch.

I-BERT is a quantized version of RoBERTa running inference up to four times faster.

The I-BERT framework in PyTorch allows to identify the best parameters for quantization. Once the model is exported in a framework that supports int8 execution (such as TensorRT), a speedup of up to 4x is visible, with no loss in performance thanks to the parameter search.

The I-BERT model was proposed in I-BERT: Integer-only BERT Quantization by Sehoon Kim, Amir Gholami, Zhewei Yao, Michael W. Mahoney and Kurt Keutzer.

Compatible checkpoints can be found on the Hub: https://huggingface.co/models?filter=ibert

I-BERT model support #10153 (@kssteven418)

[IBert] Correct link to paper #10445 (@patrickvonplaten)

Add I-BERT to README #10462 (@LysandreJik)

Compatible checkpoints can be found on the Hub: https://huggingface.co/models?filter=speech_to_text

mBART-50

MBart-50 is created using the original mbart-large-cc25 checkpoint by extending its embedding layers with randomly initialized vectors for an extra set of 25 language tokens and then pretrained on 50 languages.

The MBart model was presented in Multilingual Translation with Extensible Multilingual Pretraining and Finetuning by Yuqing Tang, Chau Tran, Xian Li, Peng-Jen Chen, Naman Goyal, Vishrav Chaudhary, Jiatao Gu, Angela Fan.

... (truncated)

Commits

f213d23 Patches full import failure when sentencepiece is not installed (#10752)
c5d6a28 Release: v4.4.1
3b9a733 Patches the full import failure and adds a test (#10750)
c988db5 Release v4.4.0
5c02b97 Fix URLs from #10744 (#10748)
a0a027c Add DistributedSamplerWithLoop (#10746)
1449222 Fix DeBERTa + Conversational pipeline slow tests (#10743)
d3d388b fix M2M100 example (#10745)
b549258 Remove old links to CDN (#10744)
5dcc08f Fix S2T example (#10741)
Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR
@dependabot recreate will recreate this PR, overwriting any edits that have been made to it
@dependabot merge will merge this PR after your CI passes on it
@dependabot squash and merge will squash and merge this PR after your CI passes on it
@dependabot cancel merge will cancel a previously requested merge and block automerging
@dependabot reopen will reopen this PR if it is closed
@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [transformers](https://github.com/huggingface/transformers) from 4.3.2 to 4.4.1. - [Release notes](https://github.com/huggingface/transformers/releases) - [Commits](huggingface/transformers@v4.3.2...v4.4.1) Signed-off-by: dependabot[bot] <support@github.com>

dependabot · 2021-03-20T07:03:38Z

Superseded by #5.

…ay-project#15382)

…style) #3 (ray-project#21652)

…ray-project#23821) This PR refactors `LazyBlockList` in service of out-of-band serialization (see [mono-PR](ray-project#22616)) and is a precursor to an execution plan refactor (PR #2) and adding the actual out-of-band serialization APIs (PR #3). The following is included in this refactor: 1. `ReadTask`s are now a first-class concept, replacing calls; 2. read stage progress tracking is consolidated into `LazyBlockList._get_blocks_with_metadta()` and more of the read task complexity, e.g. the read remote function, was pushed into `LazyBlockList` to make `ray.data.read_datasource()` simpler; 3. we are a bit smarter with how we progressively launch tasks and fetch and cache metadata, including fetching the metadata for read tasks in `.iter_blocks_with_metadata()` instead of relying on the pre-read task metadata (which will be less accurate), and we also fix some small bugs in the lazy ramp-up around progressive metadata fetching. (1) is the most important item for supporting out-of-band serialization and fundamentally changes the `LazyBlockList` data model. This is required since we need to be able to reference the underlying read tasks when rewriting read stages during optimization and when serializing the lineage of the Dataset. See the [mono-PR](ray-project#22616) for more context. Other changes: 1. Changed stats actor to a global named actor singleton in order to obviate the need for serializing the actor handle with the Dataset stats; without this, we were encountering serialization failures.

dependabot bot added the dependencies Pull requests that update a dependency file label Mar 18, 2021

dependabot bot closed this Mar 20, 2021

dependabot bot deleted the dependabot/pip/python/requirements/transformers-4.4.1 branch March 20, 2021 07:03

amogkam pushed a commit that referenced this pull request Jul 2, 2021

[client][placement groups] Client placement group hooks, attempt #3 (r…

0d0c241

…ay-project#15382)

vakker pushed a commit that referenced this pull request Feb 21, 2022

[RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star …

d5bfb7b

…style) #3 (ray-project#21652)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tune](deps): Bump transformers from 4.3.2 to 4.4.1 in /python/requirements #3

[tune](deps): Bump transformers from 4.3.2 to 4.4.1 in /python/requirements #3

dependabot bot commented on behalf of github Mar 18, 2021

dependabot bot commented on behalf of github Mar 20, 2021

[tune](deps): Bump transformers from 4.3.2 to 4.4.1 in /python/requirements #3

[tune](deps): Bump transformers from 4.3.2 to 4.4.1 in /python/requirements #3

Conversation

dependabot bot commented on behalf of github Mar 18, 2021

v4.4.0: S2T, M2M100, I-BERT, mBART-50, DeBERTa-v2, XLSR-Wav2Vec2

SpeechToText

M2M100

I-BERT

mBART-50

dependabot bot commented on behalf of github Mar 20, 2021