Be nice to CI machines by not allocating buffers #682

sw · 2023-04-01T15:05:22Z

...for vocab_only=true

Unless I'm misunderstanding the code base completely, the huge buffers are not needed for tokenizing.

Fixes #582.

~~convert-pth-to-ggml.py with vocab_only=1 produces identical files.~~ nvm, that doesn't use the C/C++ code.

...for vocab_only=true

Bump pytest from 7.4.0 to 7.4.2

Be nice to CI machines by not allocating buffers

3ef7478

...for vocab_only=true

sw marked this pull request as ready for review April 1, 2023 15:19

sw requested a review from ggerganov April 1, 2023 15:19

slaren approved these changes Apr 1, 2023

View reviewed changes

ggerganov merged commit 81040f1 into ggml-org:master Apr 2, 2023

sw deleted the ci-tokenizer branch April 2, 2023 07:55

Deadsg pushed a commit to Deadsg/llama.cpp that referenced this pull request Dec 19, 2023

Merge pull request ggml-org#682 from abetlen/dependabot/pip/pytest-7.4.2

feee7da

Bump pytest from 7.4.0 to 7.4.2

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Be nice to CI machines by not allocating buffers #682

Be nice to CI machines by not allocating buffers #682

sw commented Apr 1, 2023 •

edited

Loading

Be nice to CI machines by not allocating buffers #682

Be nice to CI machines by not allocating buffers #682

Conversation

sw commented Apr 1, 2023 • edited Loading

sw commented Apr 1, 2023 •

edited

Loading