-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Hallucination models with TF-IDF #4422
Comments
can you please paste the full stack trace, as well as the command? |
Thank you Kurt. "mytask" is a knowledge-based task that I've created that is very similar to the wizard of wikipedia, but it trained on a different dialogue dataset and has a knowledge-base different from wikipedia. The stack trace is: Traceback (most recent call last): |
Thanks, I've put a fix up in #4436 |
Thank you so much Kurt! |
going to close for now, please reopen if you run into further issues here |
Hi,
I am trying to fine-tune the models published in "Retrieval Augmentation Reduces Hallucination in Conversation" (e.g., zoo:hallucination/bart_rag_sequence/model) using TFIDF instead of DPR for retrieval. I am using the train_model cmd with the args: --init-model zoo:hallucination/bart_rag_sequence/model --tfidf-model-path my_tfidf_model. However, I get an exception at:
parlai/agents/rag/rag.py, line 507, in load_state_dict
self.model.load_state_dict(state_dict)
When I comment this line, it trains but then the model just generates garbage which means that something went really wrong. Any thoughts on how to solve this?
The paper mentions that one of the models was trained with TF-IDF to compare it with DPR, however the TF-IDF is not one of the publicly available. @klshuster could you advise on this?
Thanks
The text was updated successfully, but these errors were encountered: