Fix RAG generation for T5 FiD #3657

moyapchen · 2021-05-19T15:37:18Z

I noticed a big discrepancy between exact_match and token_em after I switched to using T5 FiD in public ParlAI.
Per discussion with Kurt, it's cause the generate for T5 is supposed to call TorchGeneratorAgent, rather than using the T5 generation via Hugging Face. This was in the original itnernal implementation, but got lost at some point while open sourcing.

Verified by printing out forced decoding outputs and the output from eval_model. Verified that when token_em == 1 that the output of eval_model was also the same. Models trained with this also doing a lot better.

I noticed a big discrepency between exact_match and token_em after I switched to using T5 FiD in public ParlAI. Per discussion with Kurt, it's cause the generate for T5 is supposed to call TorchGeneratorAgent, rather than using the T5 generation via Hugging Face. This was in the original itnernal implementation, but got lost at some point while open sourcing. Verified by printing out forced decoding outputs and the output from eval_model. Verified that when `token_em == 1` that the output of eval_model was also the same.

klshuster

thanks for fixing this!

moyapchen requested a review from klshuster May 19, 2021 15:37

facebook-github-bot added the CLA Signed label May 19, 2021

klshuster approved these changes May 19, 2021

View reviewed changes

moyapchen merged commit 91e883b into master May 19, 2021

moyapchen deleted the fix_rag branch May 19, 2021 16:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix RAG generation for T5 FiD #3657

Fix RAG generation for T5 FiD #3657

moyapchen commented May 19, 2021 •

edited

Loading

klshuster left a comment

Fix RAG generation for T5 FiD #3657

Fix RAG generation for T5 FiD #3657

Conversation

moyapchen commented May 19, 2021 • edited Loading

klshuster left a comment

Choose a reason for hiding this comment

moyapchen commented May 19, 2021 •

edited

Loading