How to make it faster? #53

theytookcake · 2024-01-26T19:51:40Z

Hello!

What settings should I tweak to make it faster? I dont know if its even possible.

I know that smaller models answer faster, But for that I need I'm using a mistral 7b instruct model and it takes around 10 seconds to answer. Anything I can tweak to make it answer faster?

amakropoulos · 2024-01-27T06:21:48Z

Hi! 10 seconds is too long.
Is this the time it takes to get the first response or every response?
In a 8-year old 6-core CPU I have it takes ~2-3 seconds for the response to arrive.
Can you build the project and see if it takes the same amount of time?

theytookcake · 2024-01-27T17:09:12Z

Hello! I'm now running the WarmUp Function in the Start and its much faster, using stream it takes around a second for text to start arriving!

amakropoulos · 2024-01-27T17:26:03Z

Perfect! Yes, that was the reason I was asking :).

The first response needs to process the character prompt, and if it is large it takes some time.
That's why the Warmup helps. It processes the character prompt and the result is cached.
I should added it in all the samples!

I'll close the issue, let me know if you get stuck with anything else!

amakropoulos closed this as completed Jan 27, 2024

CGAleksey mentioned this issue Nov 25, 2024

Mac.dlopen returns null instead of libundreamai_macos-*64-acc.dylib #275

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to make it faster? #53

How to make it faster? #53

theytookcake commented Jan 26, 2024

amakropoulos commented Jan 27, 2024

theytookcake commented Jan 27, 2024

amakropoulos commented Jan 27, 2024

How to make it faster? #53

How to make it faster? #53

Comments

theytookcake commented Jan 26, 2024

amakropoulos commented Jan 27, 2024

theytookcake commented Jan 27, 2024

amakropoulos commented Jan 27, 2024