Convert T5x models to PyTorch #15464

peregilk · 2022-02-01T16:19:04Z

🚀 Feature request

Googles new Flax implementation of T5, called T5x is creating models/checkpoints in a custom format.

The config is stored in .gin files, and the current T5 conversion scripts like this byT5 conversion script is not working.

Would it be possible to create a script for converting the T5x checkpoints/models?

@patrickvonplaten
@anton-l

github-actions · 2022-03-04T15:04:46Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

patrickvonplaten · 2022-03-04T17:15:43Z

Think @stefan-it has a working script :-)

github-actions · 2022-03-29T15:07:13Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

dirkgr · 2022-05-11T00:25:42Z

@stefan-it, can you share that script?

stefan-it · 2022-05-11T11:31:10Z

Hi @dirkgr , the script was merged into current master of Transformers with this #16853 and is available here:

https://github.com/huggingface/transformers/blob/main/src/transformers/models/t5/convert_t5x_checkpoint_to_flax.py :)

StephennFernandes · 2022-06-18T21:18:30Z

@stefan-it , hey could you please tell me how exactly does the conversion script works.

Actually i tired run the conversion script and I seems like the config file in t5x is in . gin format and the script expects the config file to be in .json format.

Hence I was stuck from converting my t5x model to HF.

Could you please show me how it's done and provide some details

stefan-it · 2022-06-19T08:36:18Z

Hi @StephennFernandes , could you please try to use these steps, mentioned in the corresponding PR:

#16853 (comment)

The config file needs to be in JSON format, yes :)

stefan-it · 2022-06-19T08:37:16Z

If you get any errors, please post them here, so we can try to find a solution 🤗

StephennFernandes · 2022-06-19T12:18:35Z

@stefan-it , thanks for replying. I followed the steps as instructed in #16853 and tried converting my pretrained t5_1_1_base model to hugginface.

But i get the following error:

/home/stephen/anaconda3/lib/python3.9/site-packages/jax/_src/tree_util.py:188: FutureWarning: jax.tree_util.tree_multimap() is deprecated. Please use jax.tree_util.tree_map() instead as a drop-in replacement.
  warnings.warn('jax.tree_util.tree_multimap() is deprecated. Please use jax.tree_util.tree_map() '
Traceback (most recent call last):
  File "/home/stephen/Desktop/t5_test_run/t5x/t5x_convert_to_hf.py", line 234, in <module>
    convert_t5x_checkpoint_to_flax(args.t5x_checkpoint_path, args.config_name, args.flax_dump_folder_path)
  File "/home/stephen/Desktop/t5_test_run/t5x/t5x_convert_to_hf.py", line 27, in convert_t5x_checkpoint_to_flax
    t5x_model = checkpoints.load_t5x_checkpoint(t5x_checkpoint_path)
  File "/home/stephen/Desktop/t5_test_run/t5x/t5x/checkpoints.py", line 1674, in load_t5x_checkpoint
    state_dict = _run_future_tree(future_state_dict)
  File "/home/stephen/Desktop/t5_test_run/t5x/t5x/checkpoints.py", line 162, in _run_future_tree
    leaves = loop.run_until_complete(asyncio.gather(*future_leaves))
  File "/home/stephen/anaconda3/lib/python3.9/asyncio/base_events.py", line 642, in run_until_complete
    return future.result()
  File "/home/stephen/Desktop/t5_test_run/t5x/t5x/checkpoint_importer.py", line 82, in _get_and_cast
    arr = await self._get_fn()  # pytype: disable=bad-return-type
  File "/home/stephen/Desktop/t5_test_run/t5x/t5x/checkpoints.py", line 1502, in _read_ts
    t = await ts.open(tmp_ts_spec_dict, open=True)
ValueError: Error opening "zarr" driver: Error reading local file "./T5_1_1_base_hindi/checkpoint_100000/state.param_states.decoder.decoder_norm.scale.v/.zarray": Invalid key: "./T5_1_1_base_hindi/checkpoint_100000/state.param_states.decoder.decoder_norm.scale.v/.zarray"

stefan-it · 2022-06-20T06:21:26Z

Hi @StephennFernandes could you try to install:

pip3 install --upgrade tensorstore==0.1.13

The tensorstore package was the reason for that zarr driver error message in my conversion experiments.

StephennFernandes · 2022-06-20T09:33:35Z

@stefan-it , hey i tried that but i didnt work for me, i still get the same error. I came across this issue in the t5x repo #452

i am currently using ubuntu 20.04 with linux kernel 5.13.0

stefan-it · 2022-06-20T11:17:03Z

Hi @StephennFernandes ,

I think I have a working solution now. I installed everything in a fresh new virtual environment, but I got bazel errors (hopefully Google will stop using bazel someday...) when trying to build tensorstore==0.1.13.

What I did then:

pip3 install --upgrade tensorstore

to install latest version of tensorstore. The non-working conversion script call looks like:

python3 convert_t5x_checkpoint_to_flax.py --t5x_checkpoint_path ./t5_1_1_small --config_name ./config_1_1.json --flax_dump_folder_path ./t5x_1_1_exported

But tensorstore is not able to handle it. The magic trick here is to use the absolute path to the t5x checkpoint path. So instead of using ./t5_1_1_small fetch the absolute path via:

realpath ./t5_1_1_small

this returns something like:

/home/stefan/transformers/src/transformers/models/t5/t5_1_1_small

then use this path for the t5x_checkpoint_path argument.

I hope this works! It worked under my local setup.

(Oh, and in case you get some strange torch.fx import errors, just run pip3 install --upgrade torch --extra-index-url https://download.pytorch.org/whl/cpu to fix them)

StephennFernandes · 2022-06-20T13:22:45Z

@stefan-it , it worked 🎉 Thanks a ton for all the help 🙏

Actually i still have a couple of other questions:

The current conversion only works on flax models, supposed I'd have to finetune the model in Huggingface using Pytorch. Is there a way to convert HF flax models to Pytorch internally ? Or would I have to first convert t5x model to Pytorch and then convert it to HF ?
Also I am a bit confused about the tokenizer, did this conversion script also convert the tokenizer ? ( I don't think the sentencepiece .model file existed in the model dir ) If not, how should I get going in converting the tokenizer to Huggingface ?

peregilk · 2022-06-20T14:53:38Z

@StephennFernandes Here is a link to a convenience script that I am using for creating the PyTorch and TF models.

https://github.com/peregilk/north-t5/blob/main/create_pytorch_tf_and_vocab.py

Do not expect it to run directly though. It was really not meant for the public. However, it should give you the basic idea about how to load the models and then save them in the correct format.

StephennFernandes · 2022-06-20T14:56:56Z

@peregilk , thanks for sharing. actually the link isnt available, apparently i believe its private. could you please check and confirm.

peregilk · 2022-06-20T15:02:57Z

@StephennFernandes Sorry about that. Now it is public.

As a side note, especially to @patrickvonplaten: Wouldnt it be nice to put a wrapper around the great script that @stefan-it have made. A script that also loads the models in HuggingFace and saves them in PyTorch and TF format, as well as creates the necessary tokenizers. Maybe it can even copy over the training-logs that are saved in the t5x-checkpoint directory. I have done this manually on these models: https://huggingface.co/north/t5_large_NCC. As you see, the tensorboard logs from t5x integrates nicely with the Training Metrics in HF.

patrickvonplaten · 2022-06-20T16:47:32Z

I think this would indeed be a great idea! Maybe we can open a T5X folder under https://github.com/huggingface/transformers/tree/main/examples/research_projects with lots of functionality for conversion ?

StephennFernandes · 2022-09-09T08:58:27Z

@stefan-it @patrickvonplaten
hey were you able to convert the scalable_t5 models ?

actualy i have pretrained a mt5-base t5x/examples/scalable_t5/mt5/base.gin using t5x

But i am unable to convert it to huggingface. i tried several huggingface config.json files from the t5-efficient-base but none-of them worked.

the following is my error when converting:

convert_t5x_checkpoint_to_flax(args.t5x_checkpoint_path, args.config_name, args.flax_dump_folder_path)
  File "/home/stephen/Desktop/mt5_finetuning_preliminary_tests/t5x_to_hf.py", line 12, in convert_t5x_checkpoint_to_flax
    split_mlp_wi = "wi_0" in t5x_model["target"]["encoder"]["layers_0"]["mlp"]
KeyError: 'layers_0'

stefan-it · 2022-09-12T07:31:31Z

Hi @StephennFernandes ,

really interesting, I haven't tried it with the Scaled T5X models yet (Those efficient T5 models that can be found on the Model Hub are converted from the TensorFlow checkpoints, because they are trained with the official T5 implementation and not with T5X).

Please give me some time to investigate that :)

joytianya · 2022-12-01T08:03:12Z

Does this script support the transformation of XL or XXL models?

peregilk · 2022-12-01T08:35:21Z

@joytianya I have been using this script a lot for converting both XL and XXL models. Works fine.

joytianya · 2022-12-01T08:51:10Z

@peregilk thank your answer.

I tried it and generated the following files in /content/flan_t5x_xl_exported, and then I used this below code (T5ForConditionalGeneration) to load the dir and happen error(Error no file named pytorch_model.bin, tf_model.h5, model.ckpt.index or flax_model.msgpack found). How do I solve it?

model = T5ForConditionalGeneration.from_pretrained("/content/flan_t5x_xl_exported", from_flax=True)
# Error no file named pytorch_model.bin, tf_model.h5, model.ckpt.index or flax_model.msgpack found in 
# directory /content/flan_t5x_xl_exported.

/content/flan_t5x_xl_exported:
"
*model-00001-of-00002.msgpack
*model-00002-of-00002.msgpack
*model.msgpack.index.json
config.json
"

joytianya · 2022-12-01T17:04:49Z

@stefan-it
@peregilk
Does the script support T5X converted into pytorch?
if not, Is there any other solution?

peregilk · 2022-12-01T18:28:42Z

@joytianya Try open the files here: https://huggingface.co/north/t5_xl_NCC. All these are converted using the script written by @stefan-it. Note that the large PyTorch files are split into multiple smaller files.

joytianya · 2022-12-02T02:49:52Z

@peregilk
Thank you for your reply
I want to convert my finetuned model into pt,
In addition, when I use scripts to convert t5x to flax, xl and xxl are divided into multiple files. Can they not be divided into multiple files or merge them to a single file?

peregilk · 2022-12-02T09:28:08Z

@joytianya. I do not think this splitting really is related to the conversion script that @stefan-it wrote. Transformers does this automatically with large files.

joytianya · 2022-12-02T12:31:57Z

ok, thank you

github-actions bot closed this as completed Apr 7, 2022

stefan-it reopened this Apr 7, 2022

stefan-it self-assigned this Apr 7, 2022

github-actions bot closed this as completed Apr 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert T5x models to PyTorch #15464

Convert T5x models to PyTorch #15464

peregilk commented Feb 1, 2022

github-actions bot commented Mar 4, 2022

patrickvonplaten commented Mar 4, 2022

github-actions bot commented Mar 29, 2022

dirkgr commented May 11, 2022

stefan-it commented May 11, 2022

StephennFernandes commented Jun 18, 2022

stefan-it commented Jun 19, 2022

stefan-it commented Jun 19, 2022

StephennFernandes commented Jun 19, 2022

stefan-it commented Jun 20, 2022

StephennFernandes commented Jun 20, 2022

stefan-it commented Jun 20, 2022 •

edited

Loading

StephennFernandes commented Jun 20, 2022 •

edited

Loading

peregilk commented Jun 20, 2022

StephennFernandes commented Jun 20, 2022

peregilk commented Jun 20, 2022 •

edited

Loading

patrickvonplaten commented Jun 20, 2022

StephennFernandes commented Sep 9, 2022 •

edited

Loading

stefan-it commented Sep 12, 2022 •

edited

Loading

joytianya commented Dec 1, 2022

peregilk commented Dec 1, 2022

joytianya commented Dec 1, 2022 •

edited

Loading

joytianya commented Dec 1, 2022

peregilk commented Dec 1, 2022

joytianya commented Dec 2, 2022

peregilk commented Dec 2, 2022

joytianya commented Dec 2, 2022

Convert T5x models to PyTorch #15464

Convert T5x models to PyTorch #15464

Comments

peregilk commented Feb 1, 2022

🚀 Feature request

github-actions bot commented Mar 4, 2022

patrickvonplaten commented Mar 4, 2022

github-actions bot commented Mar 29, 2022

dirkgr commented May 11, 2022

stefan-it commented May 11, 2022

StephennFernandes commented Jun 18, 2022

stefan-it commented Jun 19, 2022

stefan-it commented Jun 19, 2022

StephennFernandes commented Jun 19, 2022

stefan-it commented Jun 20, 2022

StephennFernandes commented Jun 20, 2022

stefan-it commented Jun 20, 2022 • edited Loading

StephennFernandes commented Jun 20, 2022 • edited Loading

peregilk commented Jun 20, 2022

StephennFernandes commented Jun 20, 2022

peregilk commented Jun 20, 2022 • edited Loading

patrickvonplaten commented Jun 20, 2022

StephennFernandes commented Sep 9, 2022 • edited Loading

stefan-it commented Sep 12, 2022 • edited Loading

joytianya commented Dec 1, 2022

peregilk commented Dec 1, 2022

joytianya commented Dec 1, 2022 • edited Loading

joytianya commented Dec 1, 2022

peregilk commented Dec 1, 2022

joytianya commented Dec 2, 2022

peregilk commented Dec 2, 2022

joytianya commented Dec 2, 2022

stefan-it commented Jun 20, 2022 •

edited

Loading

StephennFernandes commented Jun 20, 2022 •

edited

Loading

peregilk commented Jun 20, 2022 •

edited

Loading

StephennFernandes commented Sep 9, 2022 •

edited

Loading

stefan-it commented Sep 12, 2022 •

edited

Loading

joytianya commented Dec 1, 2022 •

edited

Loading