Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: Reintroduce chat / instruct templates #8681

Closed
4 tasks done
ericcurtin opened this issue Jul 25, 2024 · 6 comments
Closed
4 tasks done

Feature Request: Reintroduce chat / instruct templates #8681

ericcurtin opened this issue Jul 25, 2024 · 6 comments
Labels
enhancement New feature or request stale

Comments

@ericcurtin
Copy link
Collaborator

Prerequisites

  • I am running the latest code. Mention the version if possible as well.
  • I carefully followed the README.md.
  • I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
  • I reviewed the Discussions, and have a new and useful enhancement to share.

Feature Description

It was removed:

#7675
#7757

We were finding this feature extremely useful.

Motivation

We were actively using it until it was removed.

Possible Implementation

#7675

@ericcurtin ericcurtin added the enhancement New feature or request label Jul 25, 2024
@ggerganov
Copy link
Member

Use the -cnv flag instead - it supports all models with defined chat templates

@ericcurtin
Copy link
Collaborator Author

ericcurtin commented Jul 25, 2024

Building master branch now,

I used to run something like this:

$ llama-main -m /models/granite --log-disable --instruct
> Tell me about Georgi
 George Orwell (185-194) was an English writer and political activist. He is best known for his dystopian novel "1984" which was published in 194. The book is considered a masterpiece of modernist fiction. Orwell's most famous work, "1984," was published in 194. The book is considered a masterpiece of modernist fiction. Orwell's most famous work, "1984," was published in 194. He is also known for his essays and his political activism, which included the writing of "Animal Farm."
>

nice and simple gave me a nice little Ollama/ChatGPT type terminal interface. So now I try (on master):

$ llama-cli -m /models/granite --log-disable -cnv -p "You respond to instructions"
<|im_start|>system
You respond to instructions<|im_end|>

> Who are ABBA?
ABBA were a Swedish rock band formed in 1972. The group consisted of Agnetha Fältskog, Björn Ulvaeus, Benny Andersson, and Anders Björk. They are best known for their hit songs "Waterloo", "Dancing Queen", and "Yes, Please".<|im_end|>
>

I really don't want noise like:

<|im_start|>system
You respond to instructions<|im_end|>

or

<|im_end|>

anyway to get rid of this noise like --instruct didn't show?

@ggerganov
Copy link
Member

Add --log-disable. The <|im_end|> token should not appear - if it does, it likely means you are using a model that hasn't marked this token as special, so you might want to reconvert your model.

@ericcurtin
Copy link
Collaborator Author

--log-disable was added. I appreciate the advice, this is something from HuggingFace, I can't expect users to reconvert nor do I know how to. With --instruct this was not required, it seemed perfect for all the models I was using.

@ericcurtin
Copy link
Collaborator Author

ericcurtin commented Jul 25, 2024

Like without some clean TUI interface for all models, like we had for "--instruct", I'm not sure what the use case is for "-cnv" ... Like if the use-case was testing, interactive versions aren't really useful for testing at all, can't be scripted.

@github-actions github-actions bot added the stale label Aug 25, 2024
Copy link
Contributor

github-actions bot commented Sep 9, 2024

This issue was closed because it has been inactive for 14 days since being marked as stale.

@github-actions github-actions bot closed this as completed Sep 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request stale
Projects
None yet
Development

No branches or pull requests

2 participants