Skip to content

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Llama 7B (4-bit) speed on Intel 12th or 13th generation #1157

Closed
Oxi84 opened this issue Apr 24, 2023 · 0 comments
Closed

Llama 7B (4-bit) speed on Intel 12th or 13th generation #1157

Oxi84 opened this issue Apr 24, 2023 · 0 comments

Comments

@Oxi84
Copy link

Oxi84 commented Apr 24, 2023

Hello,

What is an average token generation speed on intel 12-13th generation CPUs?
I am sure somebody has it.

I only read here (#39), that speed for old intel with 4 cores is around 165 s/token and for AMD 5700G is around 100 ms/token.

So i thought that maybe Intel CPUs run faster than AMD ones.

Thanks.

@ggml-org ggml-org locked and limited conversation to collaborators Apr 24, 2023
@prusnak prusnak converted this issue into discussion #1165 Apr 24, 2023

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant