cermeng

Yunmeng cermeng

1 follower · 3 following

Achievements

Lists (2)

Sort

🌟 LLM

7 repositories

✨ tutorial

3 repositories

Stars

NVIDIA / cutlass

CUDA Templates for Linear Algebra Subroutines

C++ 6,916 1,130 Updated Feb 28, 2025

deepspeedai / DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,975 178 Updated Feb 25, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 75,632 10,929 Updated Mar 1, 2025

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 17,120 1,420 Updated Feb 25, 2025

DefTruth / CUDA-Learn-Notes

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,587 267 Updated Feb 24, 2025

hijiangtao / resume

个人中文简历 Latex 源码 https://hijiangtao.github.io/

TeX 2,081 568 Updated Sep 4, 2024

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 2,229 232 Updated Feb 28, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 11,130 1,112 Updated Mar 2, 2025

lm-sys / RouteLLM

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality

Python 3,666 275 Updated Aug 10, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 16,006 1,509 Updated Mar 2, 2025

NVIDIA / kvpress

LLM KV cache compression made easy

Python 412 27 Updated Feb 18, 2025

PKUFlyingPig / cs-self-learning

计算机自学指南

HTML 60,722 7,106 Updated Feb 27, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 39,868 5,972 Updated Mar 2, 2025

DefTruth / Awesome-LLM-Inference

📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. 🎉🎉

3,539 244 Updated Mar 1, 2025

Hannibal046 / Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

21,780 1,780 Updated Feb 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yunmeng cermeng

Achievements

Achievements

Block or report cermeng

Lists (2)

🌟 LLM

✨ tutorial

Stars

NVIDIA / cutlass

deepspeedai / DeepSpeed-MII

ggml-org / llama.cpp

QwenLM / Qwen

DefTruth / CUDA-Learn-Notes

hijiangtao / resume

flashinfer-ai / flashinfer

sgl-project / sglang

lm-sys / RouteLLM

Dao-AILab / flash-attention

NVIDIA / kvpress

PKUFlyingPig / cs-self-learning

vllm-project / vllm

DefTruth / Awesome-LLM-Inference

Hannibal046 / Awesome-LLM