Skip to content
This repository has been archived by the owner on Nov 3, 2023. It is now read-only.

Allow flan-t5 models in ParlAI with fp16 improvment #4875

Merged
merged 2 commits into from
Nov 11, 2022
Merged

Conversation

dexterju27
Copy link
Contributor

Patch description
The current setting does allow flan-t5 model to run under parlai hugging face agent seamlessly. Adding Flan-t5 options in the agent.

Testing steps

parlai train_model -t convai2  -dt train --batchsize 1  --fp16 True  --gradient-clip 1.0 --label-truncate 256 --text-truncate 512 --log-every-n-secs 30 --lr-scheduler reduceonplateau --max-train-time 169344.0 --model hugging_face/t5 --save-after-valid True --skip-generation True --optimizer adam -lr 1e-05 -veps 0.15 -vmm min -vmt ppl -vp 15 -tblog True --t5-model-arch google/flan-t5-large  --t5-model-parallel True --dict-tokenizer gpt2

@dexterju27 dexterju27 changed the title Allow flan-t5 models in ParlAI Allow flan-t5 models in ParlAI with fp16 improvment Nov 11, 2022
Copy link
Contributor

@klshuster klshuster left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks!

@dexterju27 dexterju27 merged commit c4d7b08 into main Nov 11, 2022
@dexterju27 dexterju27 deleted the allow-flan-t5 branch November 11, 2022 18:41
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants