Skip to content

Commit

Permalink
[GPU] Optimized operations in the blas kernels with the latest buffe…
Browse files Browse the repository at this point in the history
…r changes.

    Updated the pipeline for both fp32 and fp16.
    SwiGLU, RmsNorm and Concat ops updated.

        **Self evaluation:**
        1. Build test:   [X]Passed [ ]Failed [ ]Skipped
        2. Run test:     [X]Passed [ ]Failed [ ]Skipped

Signed-off-by: Niket Agarwal <[email protected]>
  • Loading branch information
niket-agarwal committed Jan 9, 2025
1 parent 32621ff commit 7e17653
Show file tree
Hide file tree
Showing 5 changed files with 245 additions and 213 deletions.
Loading

0 comments on commit 7e17653

Please sign in to comment.