fix crash on non-AVX systems dynamically loading GGML CPU backends #11780

jmorganca · 2025-02-10T02:16:51Z

Thanks for the awesome work by @slaren in #10469 (and a few follow up PRs) to enable dynamic GGML backend loading. This made supporting different CPU instructions in GGML much, much easier.

I noticed a small hitch with with the llamafile code where a machine with a non-AVX CPU would crash when trying to dlopen CPU libraries built with GGML_LLAMAFILE=ON. This moves the AVX-dependent code to do a member variable, fixing the crash on dlopen. I'm not sure how sgemm.cpp is vendored, and so let me know the best way/place to suggest a change.

slaren

Thanks, I missed this global. The fix looks ok, but if the code is not inlined it may add some overhead to the other types. I will leave this open for a while in case someone knowledgeable about llamafile/tinyblas wants to propose a better solution.

llamafile: use member variable instead of constant for iq4nlt

f3ee51e

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Feb 10, 2025

slaren approved these changes Feb 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix crash on non-AVX systems dynamically loading GGML CPU backends #11780

fix crash on non-AVX systems dynamically loading GGML CPU backends #11780

jmorganca commented Feb 10, 2025

slaren left a comment

fix crash on non-AVX systems dynamically loading GGML CPU backends #11780

Are you sure you want to change the base?

fix crash on non-AVX systems dynamically loading GGML CPU backends #11780

Conversation

jmorganca commented Feb 10, 2025

slaren left a comment

Choose a reason for hiding this comment