MiniCPM-v 2.6 and llama-cpp can not work and accelerated on A770 dGPU? #11982

yangqing-yq · 2024-08-31T11:32:51Z

No description provided.

JinheTang · 2024-09-02T01:56:46Z

Hi @yangqing-yq , upgrading to ipex-llm[cpp]>=2.2.0b20240827 may solve this problem. Then you may run

./llama-minicpmv-cli -m ../MiniCPM-V-2_6-gguf/ggml-model-Q4_0.gguf --mmproj ../MiniCPM-V-2_6-gguf/mmproj-model-f16.gguf -c 4096 --temp 0.7 --top-p 0.8 --top-k 100 --repeat-penalty 1.05 --image xx.jpg  -p "What is in the image?" -ngl 99

model page:
openbmb/MiniCPM-V-2_6-gguf

yangqing-yq · 2024-09-14T08:00:19Z

this is the result for A750. Can you help to confirm if these values are correct?
especially the TTFT is 4689 ms?!
input image is 1920x1080
"
llama_print_timings: load time = 6392.73 ms
llama_print_timings: sample time = 43.04 ms / 73 runs ( 0.59 ms per token, 1696.29 tokens per second)
llama_print_timings: prompt eval time = 4689.01 ms / 904 tokens ( 5.19 ms per token, 192.79 tokens per second)
llama_print_timings: eval time = 1709.74 ms / 72 runs ( 23.75 ms per token, 42.11 tokens per second)
llama_print_timings: total time = 8175.84 ms / 976 tokens
"

@qiuxin2012

yangqing-yq · 2024-09-20T06:07:11Z

@JinheTang @qiuxin2012

JinheTang · 2024-09-24T07:11:16Z

Hi @yangqing-yq , we tested it on our A750 machine and our results were similar to yours. It should be correct.

qiuxin2012 added the user issue label Sep 2, 2024

rnwang04 assigned JinheTang Sep 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MiniCPM-v 2.6 and llama-cpp can not work and accelerated on A770 dGPU? #11982

MiniCPM-v 2.6 and llama-cpp can not work and accelerated on A770 dGPU? #11982

yangqing-yq commented Aug 31, 2024

JinheTang commented Sep 2, 2024

yangqing-yq commented Sep 14, 2024

yangqing-yq commented Sep 20, 2024 •

edited

Loading

JinheTang commented Sep 24, 2024

MiniCPM-v 2.6 and llama-cpp can not work and accelerated on A770 dGPU? #11982

MiniCPM-v 2.6 and llama-cpp can not work and accelerated on A770 dGPU? #11982

Comments

yangqing-yq commented Aug 31, 2024

JinheTang commented Sep 2, 2024

yangqing-yq commented Sep 14, 2024

yangqing-yq commented Sep 20, 2024 • edited Loading

JinheTang commented Sep 24, 2024

yangqing-yq commented Sep 20, 2024 •

edited

Loading