-
Notifications
You must be signed in to change notification settings - Fork 10.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ggml : move FP16 <-> FP32 stuff to ggml-impl.h #3861
Conversation
Any ideas how to fix the Windows CI? I guess something fails with the initialization of the |
This fixes the issue under Windows with MSVC:
build: 223696c (1449)
build: 6e08281 (1445)
build: ff3bad8 (1441) |
On a side note, I wasted a lot of time realizing that the default cmake build options under Windows no longer include AVX. This seems to be because |
#809 would fix this. |
I just tried adding #809 to this PR, but the CMake stuff fails on Linux:
I'll proceed to merge this PR, but we should find a way to fix the default build performance with MSVC + CMake |
* ggml : move FP16 <-> FP32 stuff to ggml-impl.h ggml-ci * tests : fix ARM build * ggml : explicitly initialize deprecated type traits * ggml : add math.h to ggml-impl.h * ggml : remove duplicate static assert macros * ggml : prefix lookup tables with ggml_ ggml-ci * ggml-impl : move extern "C" to start of file
close #3858
Alternative approach to fix FP16 <-> FP32 performance degradation due to #3833
Introduce
ggml-impl.h
and move common stuff (mostly macros) fromggml.c
into it for reuse inggml-quants.c