Update on the development branch #424
kaiyux
announced in
Announcements
Replies: 1 comment 6 replies
-
Thank you for these awesome updates. Does this mean that the enc_dec models now support inflight batching? |
Beta Was this translation helpful? Give feedback.
6 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
The TensorRT-LLM team is pleased to announce that we are pushing an update to the development branch(also includes the Triton backend) this November 17th, 2023.
This update includes:
logProbs
andcumLogProbs
stream
keyword argument is notNone
#202Thanks,
The TensorRT-LLM Engineering Team
Beta Was this translation helpful? Give feedback.
All reactions