Fix color getting reset before prompt output done #65

blackhole89 · 2023-03-12T22:01:02Z

This should fix the issue where sometimes the last few tokens of the initial prompt will not get colored correctly, because the color reset ANSI code was emitted early.

(cherry picked from commit 7eb2987)

* Long-range falcon upgrade (16k context) Default context is not 2048 The embedding rotation has been adapted to react to context and expected generation Uses "NTK" fourier aware scaling of the rotation space. 7B and 40B have been tested to work well up to a context of 8k Tests at > 8k are incoming once performance at these sizes works better RAM requirements for K/V caches: Falcon 7B at 8k context : ~2 GB RAM Falcon 40B at 8k context : ~5.5 GB RAM In addition falcon_eval() now uses a configuration struct instead of passing many parameters through multiple abstraction layers. This makes it much easier to pass new features from main into libfalcon * perplexity bugfix --------- Co-authored-by: John <cmt-nct@users.noreply.github.com>

Fix color getting reset before prompt output done

ea84034

(cherry picked from commit 7eb2987)

ggerganov merged commit 404fac0 into ggml-org:master Mar 12, 2023

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix color getting reset before prompt output done #65

Fix color getting reset before prompt output done #65

Uh oh!

blackhole89 commented Mar 12, 2023

Uh oh!

Uh oh!

Fix color getting reset before prompt output done #65

Fix color getting reset before prompt output done #65

Uh oh!

Conversation

blackhole89 commented Mar 12, 2023

Uh oh!

Uh oh!