Can someone explain what are the meanings of these timings #1320

Tommy787576 · 2023-05-04T14:49:34Z

What does load / sample / prompt eval / eval / total time mean? What does 0.58 ms per run mean? Why can't I see the timings report like what readme shows?

This is much clearer! What is the mapping between them? I want to know the predict: token per ms.

ggml-org locked and limited conversation to collaborators May 4, 2023

prusnak converted this issue into discussion #1323 May 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This issue was moved to a discussion.

Can someone explain what are the meanings of these timings #1320

Can someone explain what are the meanings of these timings #1320

Tommy787576 commented May 4, 2023 •

edited

Loading

This issue was moved to a discussion.

This issue was moved to a discussion.

Can someone explain what are the meanings of these timings #1320

Can someone explain what are the meanings of these timings #1320

Comments

Tommy787576 commented May 4, 2023 • edited Loading

This issue was moved to a discussion.

Tommy787576 commented May 4, 2023 •

edited

Loading