quant_cuda_kernel.cu(212): error: identifier "__hfma2" is undefined #23

HueCheng1021 · 2023-05-11T02:37:41Z

an error is reported when compiling the quant_cuda kernel.

in my case,
Cuda compilation tools, release 12.0, V12.0.140

efrantar · 2023-07-10T12:46:05Z

Our kernels were developed with CUDA 11.4. However, this function still seems to exist in the newest CUDA API, so I am unfortunately not sure what's causing the error. If you don't need our fastest FP16 kernels (e.g. if you aren't on an A100 for which they were actually developed), you could perhaps try commenting out the corresponding code in quant_cuda.cpp and quant_cuda_kernel.cu and using the FP32 version (omitting the option --faster-kernel).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

quant_cuda_kernel.cu(212): error: identifier "__hfma2" is undefined #23

quant_cuda_kernel.cu(212): error: identifier "__hfma2" is undefined #23

HueCheng1021 commented May 11, 2023

efrantar commented Jul 10, 2023

quant_cuda_kernel.cu(212): error: identifier "__hfma2" is undefined #23

quant_cuda_kernel.cu(212): error: identifier "__hfma2" is undefined #23

Comments

HueCheng1021 commented May 11, 2023

efrantar commented Jul 10, 2023