Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing spaces #46

Open
Mary-Sam opened this issue Jun 6, 2024 · 2 comments
Open

Missing spaces #46

Mary-Sam opened this issue Jun 6, 2024 · 2 comments

Comments

@Mary-Sam
Copy link

Mary-Sam commented Jun 6, 2024

I have converted Mixtral to TensoRT and I am trying to use your repository to integrate with OpenAI.
I'm using the template history_template_llama3.liquid. When I run your example code for interacting with the model (openai_completion.py and openai_completion_stream.py)

> prompt="This is a story of a hero who went"
> result of openai_completion.py:
againstthetideandfoughtagainsttheevilforces.JustlikeKanedadidinAkira,lifeissuffering,wrongdoingisboilingdeepwithinthesoul,manipulatedbythepowerofunknownforce.Humiliationispartofit,

If I contact triton directly via the http protocol, then I receive the following response to the same request:

"text_output":"to the moon and back.\n\nThe story begins with a young boy named Neil Armstrong who loved to explore and learn about the world around him. He was fascinated by the stars and the moon and dreamed of one day going to space"

How do I add all the spaces as in http protocol?

@dongs0104
Copy link

dongs0104 commented Jun 10, 2024

this code is working for me.

triton-inference-server/tensorrtllm_backend#332 (comment)

#30 (comment)

duplicate #30

@Mary-Sam
Copy link
Author

Mary-Sam commented Jun 11, 2024

@dongs0104
It really works! Thank you very much! And maybe do you know why special tokens are displayed in the generated text? And also the text is never generated until the end of the sentence

photo_2024-06-11 18 10 33

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants