### Prerequisites - [x] I am running the latest code. Mention the version if possible as well. - [x] I carefully followed the [README.md](https://github.com/ggml-org/llama.cpp/blob/master/README.md). - [x] I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed). - [x] I reviewed the [Discussions](https://github.com/ggml-org/llama.cpp/discussions), and have a new and useful enhancement to share. ### Feature Description regarding the issue I posted here: https://github.com/ggml-org/llama.cpp/issues/14234 ### Motivation it's unintuitive to have to manually append the EOS token to inputs. it's not like this for transformers/onnx versions ### Possible Implementation _No response_