How to rerank fine-tuned DialoGPT outputs with DialogRPT using HuggingFace Transformers? #69

tsutsen · 2021-05-04T18:06:47Z

I am not satisfied with the responses that DialoGPT produces -- for the most part, they seem pretty random and AI-ish to me. I fine-tuned the model with my dataset using Transformers' Trainer but that did not help much – the responses are often just quotes from the dataset out of context. I want these quotes to be relevant at least, so I decided to try DialogRPT human-vs-rand and human-vs-machine.

The problem is I do not understand how to rerank DialoGPT responses with DialogRPT using Transformers. Should I use DialogRPT during fine-tuning to compute loss? Or maybe it is possible to connect it as a LogitsProcessor? If yes, then how? As I understand, Transformers' generate() method outputs scores for every token but DialogRPT outputs a single number. How can I modify the scores of a response then?

I am new to machine learning and this stuff is quite overwhelming for me; any help is very appreciated!

The text was updated successfully, but these errors were encountered:

golsun · 2021-05-04T18:11:59Z

hi @tsutsen
thanks for your interest in our work! I'm going to post replies in this issue

dreasysnail assigned golsun and dreasysnail and unassigned dreasysnail and golsun Jun 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to rerank fine-tuned DialoGPT outputs with DialogRPT using HuggingFace Transformers? #69

How to rerank fine-tuned DialoGPT outputs with DialogRPT using HuggingFace Transformers? #69

tsutsen commented May 4, 2021

golsun commented May 4, 2021

How to rerank fine-tuned DialoGPT outputs with DialogRPT using HuggingFace Transformers? #69

How to rerank fine-tuned DialoGPT outputs with DialogRPT using HuggingFace Transformers? #69

Comments

tsutsen commented May 4, 2021

golsun commented May 4, 2021