llama : sanitize invalid tokens #9357

ggerganov · 2024-09-07T18:34:21Z

Should fix server CI noticed in #9354

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

ggml-ci

slaren · 2024-09-07T18:38:28Z

This will probably also need checks to be added to llama.cpp and/or the server to avoid the possibility of crashing the server from client inputs.

#9333 is similar to this.

ggerganov · 2024-09-07T18:44:50Z

Maybe better to add the checks in the server, since adding to libllama might incur some measurable overhead for checking all input tokens.

slaren · 2024-09-07T18:45:58Z

I wouldn't expect the overhead to be significant, it's little more than two comparisons per token.

ggml-ci

* common : do not add null tokens during warmup ggml-ci * llama : check that the input tokens are valid ggml-ci * tests : fix batch size of bert model ggml-ci

common : do not add null tokens during warmup

ba6a97c

ggml-ci

llama : check that the input tokens are valid

6726e3f

ggml-ci

ggerganov changed the title ~~common : do not add null tokens during warmup~~ llama : sanitize invalid tokens Sep 7, 2024

github-actions bot added examples server labels Sep 7, 2024

tests : fix batch size of bert model

748d516

ggml-ci

ggerganov force-pushed the gg/warmup-fx branch from 2ff2441 to 748d516 Compare September 7, 2024 20:22

ggerganov merged commit faf69d4 into master Sep 7, 2024
59 checks passed

ggerganov deleted the gg/warmup-fx branch September 7, 2024 21:33

slaren mentioned this pull request Sep 7, 2024

llama : sanitize tokens in the upper bound #9359

Merged

MollySophia mentioned this pull request Sep 8, 2024

common: warmup: Handle situation when eos=bos=-1 #9333

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama : sanitize invalid tokens #9357

llama : sanitize invalid tokens #9357

ggerganov commented Sep 7, 2024

Uh oh!

slaren commented Sep 7, 2024 •

edited

Loading

Uh oh!

ggerganov commented Sep 7, 2024

Uh oh!

slaren commented Sep 7, 2024

Uh oh!

Uh oh!

Uh oh!

llama : sanitize invalid tokens #9357

llama : sanitize invalid tokens #9357

Conversation

ggerganov commented Sep 7, 2024

Uh oh!

slaren commented Sep 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented Sep 7, 2024

Uh oh!

slaren commented Sep 7, 2024

Uh oh!

Uh oh!

Uh oh!

slaren commented Sep 7, 2024 •

edited

Loading