-
Notifications
You must be signed in to change notification settings - Fork 9.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Windows VS2022 Build - Returning nonsense #2
Comments
Remove the |
Ensured to pull latest, and with the removed extra '\n' token the output is identical. If I try with a different prompt:
The output matches the same post prompt output using the example prompt!
|
What happens if you use the F16 model instead?
|
F16 model produces very much more expected results. So likely an issue in the
|
I am getting similar results but I am building it with ubuntu (wsl2), with ggml-model-f16.bin results looks good, with ggml-model-q4_0.bin I get symbols too |
Ok, that clears it - the quantization code is currently tested and optimized only on ARM NEON. If you are interested, you can keep track of the progress here: |
Gotcha makes sense, sorry for the hassle! Thanks for swift follow ups. |
Fixed bos/eos token (which is both 11 according to config.json of Fal…
* vvhg-code-infill (ggerganov#1) * infill in separate example (ggerganov#2) * reverted changes to main and added infill example * cleanup * naming improvement * make : add missing blank line * fix missing semicolon * brought infill up to current main code * cleanup --------- Co-authored-by: Cebtenzzre <cebtenzzre@gmail.com>
add cmake commands
support axpy q4_0 for loop
* fstring ggerganov#1 * fstring ggerganov#2
* dictionary ggerganov#1 * dictionary ggerganov#2
Unsure if windows builds are expected to even function! 😄
I had to insert
ggml_time_init();
intomain()
of each astimer_freq
was being left at 0 and causing a divide by zero.Compiled with
cl main.cpp ggml.c utils.cpp /std:c++20 /DEBUG /EHsc
, same for quantize.cpp.Run with the following
main.exe -m ./LLaMA/7B/ggml-model-q4_0.bin -t 32 -n 512 -p "Building a website can be done in 10 simple steps:\n"
Produced the following output:
Not run to completion, but running with the same seed produces identical results. Will give it a poke around but unsure where to begin.
The text was updated successfully, but these errors were encountered: