Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MMI criterion #15

Open
martinritchie opened this issue Nov 19, 2019 · 4 comments
Open

MMI criterion #15

martinritchie opened this issue Nov 19, 2019 · 4 comments

Comments

@martinritchie
Copy link

I am reading through the code base and paper and am trying to understand where in the code the MMI criterion is implemented and used. My guess is that during evaluation the source and target sentence are fed into the model in reverse order so that the model has to predict the source given the target. But I can't seem to see where this is implemented or where the MMI is used. Have I missed something obvious?

Thank you.

@dreasysnail
Copy link
Contributor

dreasysnail commented Nov 20, 2019

Hi, thanks for the comments. The training procedure of MMI maintains the same as the MLE training. The difference lies in the decoding part, which we unfortunately cannot directly release at this moment due to company policy. However, we value your suggestion and we would release a reverse model which is trained by using target to predict the source. You can follow our paper to incorporate the reverse model to rerank the randomly generated responses. If you meet with any problem with implementing an MMI decoder, please feel free to rise an issue and we would love to provide further help.

@dreasysnail
Copy link
Contributor

Please see https://github.com/microsoft/DialoGPT#models for the reverse model. Thanks!

@martinritchie
Copy link
Author

Wow, that was quick. Thankyou.

@LHolten
Copy link

LHolten commented Nov 25, 2019

This is my implementation of the MMI decoder #3 (comment)

It seems to work, but all suggestions are welcome

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants