Segfault when running KL Divergence on both Windows & WSL

I followed the instructions outlined in the [KL Divergence PR](https://github.com/ggerganov/llama.cpp/pull/5076) to the best of my ability:

- `perplexity.exe -m "C:\Users\Kalo\Documents\GitHub\llamacpp_git\examples\dpo_7b_quant\DPO_7b_q8_0.gguf" -f "C:\Users\Kalo\Downloads\8k_redo.txt" --kl-divergence-base 8k_redo_data.dat -ngl 33`
<img width="921" alt="image" src="https://github.com/ggerganov/llama.cpp/assets/66376113/e6aed7c9-f41d-440e-9b96-73f230ccbea0">

On disk, the file is saved:

<img width="520" alt="image" src="https://github.com/ggerganov/llama.cpp/assets/66376113/1e817e61-671c-4759-9f0e-1a70751965de">


Then, I use this data file like how it was specified to be used in the PR, and it silently errors:

- `perplexity.exe -m "C:\Users\Kalo\Documents\GitHub\llamacpp_git\examples\dpo_7b_quant\DPO_7b_IQ2_XXS_combrandt1.gguf" --kl-divergence-base "C:\Users\Kalo\Documents\GitHub\llamacpp_git\cmake_build\bin\Release\8k_redo_data.dat" --kl-divergence -ngl 33`

<img width="930" alt="image" src="https://github.com/ggerganov/llama.cpp/assets/66376113/363521f5-ecb6-463b-8fdc-619117c49b1e">

Attempting to repeat this process from start to finish on WSL rather than Windows caused a segfault after it hanging for ~20 seconds.
Attempting to remove `kl-divergence` mentions that the data "only tokenizes to one token".

```
perplexity: saving all logits to C:\Users\Kalo\Documents\GitHub\llamacpp_git\cmake_build\bin\Release\8k_redo_data.dat
perplexity: tokenizing the input ..
perplexity: tokenization took 0.459 ms
perplexity: you need at least 1024 tokens to evaluate perplexity with a context of 512
perplexity: the data file you provided tokenizes to only 1 tokens
```

I tried cleaning the build files in Visual Studio, reconfiguring cmake, rebuilding from scratch, etc, and I get the same issue.
Perplexity and imatrix calculations otherwise work as you would expect, I have no issues with those.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Segfault when running KL Divergence on both Windows & WSL #5166

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Segfault when running KL Divergence on both Windows & WSL #5166

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions