-
Notifications
You must be signed in to change notification settings - Fork 10.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug: cannot create std::vector larger than max_size() #9391
Comments
It's likely something related to the sampling, but without the actual command or stacktrace it's hard to say what's wrong |
This fails the same way for a variety of input models and cli options, but I can certainly provide one of them in detail. |
Here is the launch script:
then |
Here is all that I can get so far out of the core dump from gdb:
|
And, yeah, here is the fail at the sampler:
|
It's a bug. In the meanwhile, you can replace |
I can confirm this fixes the crash, but it appears samplers no longer function on llama-server. Every time I regenerate a response, it's exactly the same. |
@Gryphe please create a new issue and provide instructions to reproduce this (ideally using curl as the client). |
@ggerganov Maybe that is caused by the reset function of the dist sampler? I see there is a |
It looks like it's because of passing the To fix this issue we should update |
What happened?
My usual build recipe and run scripts do not work after b3680. Something changed in b3681, but I don't know what.
I see this same failure across models and cli flags, so it seems to be deeper than a single feature choice, so I have excluded the launch script.
This is the actual error:
Here is what the binary reports at runtime:
Here is how I configure the build:
and some other system info:
Name and Version
$ /opt/llama/bin/llama-cli --version
version: 3681 (df270ef)
built with gcc-12 (GCC) 12.3.0 for x86_64-pc-linux-gnu
What operating system are you seeing the problem on?
Linux
Relevant log output
No response
The text was updated successfully, but these errors were encountered: