Bump llama_tokenize APIs to latest specs #730

manishshettym · 2023-09-18T03:54:50Z

The llama_tokenize and llama_tokenize_with_model APIs are out-dated with respect to the equivalent APIs in llama.cpp (as shown below):

ISSUE: Particularly, they are missing an argument text_len which denotes the length of the text/prompt being tokenized. This causes segmentation faults, as the wrong arguments get passed on to the C++ library.

This PR adds the required arguments to llama_cpp.py. cc: @abetlen

manishshettym · 2023-09-18T04:05:51Z

Ref to the relevant change in llama.cpp's API (two days ago): ggerganov/llama.cpp#3170

manishshettym · 2023-09-19T01:47:04Z

closing this PR since the changes were made directly by @abetlen! Thank you.

bump llama_tokenize API to latest specs

39704ec

manishshettym mentioned this pull request Sep 18, 2023

Segfaults now with latest llama.cpp commits #727

Closed

manishshettym added 2 commits September 17, 2023 21:23

bump high level API

6bb3522

bug fix

90e274b

manishshettym closed this Sep 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump llama_tokenize APIs to latest specs #730

Bump llama_tokenize APIs to latest specs #730

manishshettym commented Sep 18, 2023 •

edited

Loading

manishshettym commented Sep 18, 2023

manishshettym commented Sep 19, 2023

Bump llama_tokenize APIs to latest specs #730

Bump llama_tokenize APIs to latest specs #730

Conversation

manishshettym commented Sep 18, 2023 • edited Loading

manishshettym commented Sep 18, 2023

manishshettym commented Sep 19, 2023

manishshettym commented Sep 18, 2023 •

edited

Loading