Bug: Last 2 Chunks In Streaming Mode Come Together In Firefox #9502
Labels
bug-unconfirmed
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
What happened?
When using
/completion
withstream: true
, the last 2 JSON chunks come together in Firefox, but Chrome seems to handle it fine, so it might be a Firefox bug.Looking further into this, it seems like HTTP
Transfer-Encoding: chunked
requires each chunk to be terminated with\r\n
, but here\n\n
is used instead:llama.cpp/examples/server/utils.hpp
Lines 296 to 299 in 6262d13
This doesn't seem to be just a Windows requirement, but listed as part of the HTTP specification:
HTTP Chunked Transfer Coding
More information, including an example
chunked
response:Transfer-Encoding Directives
Name and Version
llama-server.exe
version: 3761 (6262d13)
built with MSVC 19.29.30154.0 for x64
What operating system are you seeing the problem on?
Windows
Relevant log output
No response
The text was updated successfully, but these errors were encountered: