Nvidia Power Management Mode makes Koboldcpp 50% Slower #1057

hentaitaku · 2024-08-11T00:04:36Z

hentaitaku
Aug 11, 2024

Hello i like to know why is continuous text generating faster.

When i translate text with a gguf model "lmg-anon/vntl-llama3-8b-gguf".
Is one after another text prompt with no wait, faster then one with 10sec wait between.

Continuous: 80ms wait time.
Wait between: 250ms wait time.

My Specs:
Intel i9-12900K
RTX 4070 Ti Super
64GB DDR4 Ram

Many Thanks

Answered by hentaitaku

Aug 12, 2024

Have found out why is the "Power management mode" from nvidia when i set it from Normal to Maximum Performance is my Generation 50% faster.
https://nvidia.custhelp.com/app/answers/detail/a_id/3130/~/setting-power-management-mode-from-normal-to-maximum-performance

I use gguf just to translate small text from games so not more then 20-40 tokens to Generate.

I dont use the Power management mode to much power use when i dont use koboldccp.
I use msi afterburner to load a profile when i need it.
Howto make a profile with full mhz load "curve editor > select biggest mhz your gpu can use and press L to lock it > save new profile".

View full answer

hentaitaku · 2024-08-12T01:05:11Z

hentaitaku
Aug 12, 2024
Author

Have found out why is the "Power management mode" from nvidia when i set it from Normal to Maximum Performance is my Generation 50% faster.
https://nvidia.custhelp.com/app/answers/detail/a_id/3130/~/setting-power-management-mode-from-normal-to-maximum-performance

I use gguf just to translate small text from games so not more then 20-40 tokens to Generate.

I dont use the Power management mode to much power use when i dont use koboldccp.
I use msi afterburner to load a profile when i need it.
Howto make a profile with full mhz load "curve editor > select biggest mhz your gpu can use and press L to lock it > save new profile".

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nvidia Power Management Mode makes Koboldcpp 50% Slower #1057

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Nvidia Power Management Mode makes Koboldcpp 50% Slower #1057

hentaitaku Aug 11, 2024

Replies: 1 comment

hentaitaku Aug 12, 2024 Author

hentaitaku
Aug 11, 2024

hentaitaku
Aug 12, 2024
Author