Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bench: server add stop word for PHI-2 #6916

Merged
merged 1 commit into from
Apr 26, 2024
Merged

Conversation

phymbert
Copy link
Collaborator

Context

Since we properly support BOS/EOT with llama_token_is_eog and removed hardcoded stop words in utils.hpp, the phi-2 base model never ends generation. So one can think the benchmark figures decrease.

This temporary fix before switching to an instruct model in the server benchmark.

References

@phymbert phymbert requested a review from ggerganov April 26, 2024 07:06
@phymbert phymbert added performance Speed related topics build Compilation issues server/webui labels Apr 26, 2024
@phymbert phymbert merged commit 5790c8d into master Apr 26, 2024
24 of 26 checks passed
@phymbert phymbert deleted the hp/server/bench/fix-stop-word branch April 26, 2024 07:26
@ggerganov ggerganov mentioned this pull request Apr 26, 2024
nopperl pushed a commit to nopperl/llama.cpp that referenced this pull request May 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
build Compilation issues performance Speed related topics server/webui
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants