Skip to content

Commit

Permalink
docs: add more details to the cuda-fp16 issue
Browse files Browse the repository at this point in the history
  • Loading branch information
danbev committed Jan 2, 2025
1 parent 28e40be commit adcbf3c
Showing 1 changed file with 5 additions and 0 deletions.
5 changes: 5 additions & 0 deletions notes/llama.cpp/cuda-fp16-release-build-issue.md
Original file line number Diff line number Diff line change
Expand Up @@ -124,3 +124,8 @@ Notice that my GPU has a compute capability of 8.9 which is greater than the
reported 520.
So there is something wrong with the way the compute capability is being
detected.

Using the following cmake options worked for me
```console
cmake -S . -B build -DGGML_CUDA=ON -DGGML_CUDA_F16=ON -DGGML_NATIVE=OFF -DCMAKE_BUILD_TYPE=Release
```

0 comments on commit adcbf3c

Please sign in to comment.