remove token functions with `context` args in favor of `model` #3720

MarcusDunn · 2023-10-21T23:09:19Z

changed the following to take in llama_model instead of llama_context

llama_token_get_text
llama_token_get_score
llama_token_get_type
llama_token_bos
llama_token_eos
llama_token_nl
llama_token_prefix
llama_token_middle
llama_token_suffix
llama_token_eot

Special tokens are a property of the model - not the context (which is how it is currently expressed in llama_token_bos and co).

As a model is always attainable when one has context (via llama_get_model) these new variants supersede the old ones.

…ions.

ggerganov

I have some recollection that we had these functions before, but I decided to remove them for some reason. However, now I don't see what could have been the reason.

Will leave this open for a day or two and if we don't see a problem - we can merge and potentially deprecate the context alternatives

llama.h

slaren · 2023-10-22T11:29:16Z

In #3301 I moved the tokenization functions to take a llama_model. Leaving these in llama_context was an oversight. My preferred solution would be to remove the functions take a llama_context entirely, and change them to take a llama_model instead. There is no need to duplicate them.

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

- `llama_token_get_text` - `llama_token_get_score` - `llama_token_get_type`

MarcusDunn · 2023-10-23T16:46:05Z

after @slaren's suggestions - there are small changes to nearly all the examples and this becomes an API breaking change - is there any extra work that should be done to accommodate that? (doc updates? / release notes? / hot topics?)

* master: (350 commits) speculative : ensure draft and target model vocab matches (ggerganov#3812) llama : correctly report GGUFv3 format (ggerganov#3818) simple : fix batch handling (ggerganov#3803) cuda : improve text-generation and batched decoding performance (ggerganov#3776) server : do not release slot on image input (ggerganov#3798) batched-bench : print params at start log : disable pid in log filenames server : add parameter -tb N, --threads-batch N (ggerganov#3584) (ggerganov#3768) server : do not block system prompt update (ggerganov#3767) sync : ggml (conv ops + cuda MSVC fixes) (ggerganov#3765) cmake : add missed dependencies (ggerganov#3763) cuda : add batched cuBLAS GEMM for faster attention (ggerganov#3749) Add more tokenizer tests (ggerganov#3742) metal : handle ggml_scale for n%4 != 0 (close ggerganov#3754) Revert "make : add optional CUDA_NATIVE_ARCH (ggerganov#2482)" issues : separate bug and enhancement template + no default title (ggerganov#3748) Update special token handling in conversion scripts for gpt2 derived tokenizers (ggerganov#3746) llama : remove token functions with `context` args in favor of `model` (ggerganov#3720) Fix baichuan convert script not detecing model (ggerganov#3739) make : add optional CUDA_NATIVE_ARCH (ggerganov#2482) ...

MarcusDunn added 2 commits October 21, 2023 15:59

added llama_model_token_* variants to all the llama_token_* funct…

a4ab8e5

…ions.

added LLAMA_API

7b127a7

ggerganov approved these changes Oct 22, 2023

View reviewed changes

llama.h Outdated Show resolved Hide resolved

ggerganov added the need feedback Testing and feedback with results are needed label Oct 22, 2023

MarcusDunn and others added 4 commits October 23, 2023 09:08

formatting

353f4ef

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

removed old llama_token functions

22d5eb4

changed 3 more functions to take in model

a550b23

- `llama_token_get_text` - `llama_token_get_score` - `llama_token_get_type`

Merge branch 'ggerganov:master' into master

d7ef0be

MarcusDunn changed the title ~~added llama_model_token_* variants to all the llama_token_* functions that takes in model instead of model_context~~ remove token functions that with context args in favor of model Oct 23, 2023

added back docs

4646c9d

MarcusDunn changed the title ~~remove token functions that with context args in favor of model~~ remove token functions with context args in favor of model Oct 23, 2023

MarcusDunn added 3 commits October 23, 2023 09:28

fixed main.cpp

38cdb82

changed token functions to use new model variants

2df3801

changed token functions to use new model variants

fc5bb85

MarcusDunn requested a review from ggerganov October 23, 2023 16:44

slaren approved these changes Oct 23, 2023

View reviewed changes

ggerganov approved these changes Oct 23, 2023

View reviewed changes

ggerganov merged commit 5be6c80 into ggerganov:master Oct 23, 2023
32 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove token functions with `context` args in favor of `model` #3720

remove token functions with `context` args in favor of `model` #3720

MarcusDunn commented Oct 21, 2023 •

edited

Loading

ggerganov left a comment

slaren commented Oct 22, 2023

MarcusDunn commented Oct 23, 2023 •

edited

Loading

remove token functions with context args in favor of model #3720

remove token functions with context args in favor of model #3720

Conversation

MarcusDunn commented Oct 21, 2023 • edited Loading

ggerganov left a comment

Choose a reason for hiding this comment

slaren commented Oct 22, 2023

MarcusDunn commented Oct 23, 2023 • edited Loading

remove token functions with `context` args in favor of `model` #3720

remove token functions with `context` args in favor of `model` #3720

MarcusDunn commented Oct 21, 2023 •

edited

Loading

MarcusDunn commented Oct 23, 2023 •

edited

Loading