-
Notifications
You must be signed in to change notification settings - Fork 170
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Meditron-7b doesn't behave as expected #13
Comments
Hi there, thanks for reaching out! In short, the In addition, the 70B model yields much better results. In our paper, you can see the performance comparisons we reported for in-context learning. Hope this helps answer your question. |
Hi, thanks for your prompt answer. I experimented with providing one-shot example. It sometimes works fine but sometimes not.
It returns
It seems okay, but for the next question
It keeps repeating itself. Any suggestions to improve the performance? I appreciate your help. |
Additionally, the model often spits back what I input. Do you have any idea how to avoid this kind of issue? Thanks. |
I am also encountering this issue. Sometimes the model also returns the same question and refuses to answer the question in the one-shot format above. |
I've been experimenting with Meditron-7b for answering medical queries, but its performance seems not as expected compared to other LLM models.
I loaded the model and tokenizer and then used the standard HF pipeline:
Then I used langchain wrapper:
For a simple greeting with
llm(prompt="Hi, how are you?")
, the model repetitively echoed the prompt:When asked about lung cancer risk factors with
llm(prompt="What are the risk factors for lung cancer?")
,, it provided a list of related questions instead of direct answers:Further, using a formatted prompt based on a GitHub repository example, the response included the prompt format instructions verbatim, without addressing the medical query.
And this returned
Is this behavior typical for Meditron-7b, or might it be an issue with my prompting technique? Additionally, would Meditron-70b potentially yield better results?
The text was updated successfully, but these errors were encountered: