vokkko

Follow

vokkko vokkko

Follow

contributions to the community.

2 followers · 23 following

Popular repositories Loading

Awesome-Efficient-LLM Awesome-Efficient-LLM Public

Forked from horseee/Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python
auto-round auto-round Public

Forked from intel/auto-round

SOTA Weight-only Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"

Python
EfficientDM EfficientDM Public

Forked from ThisisBillhe/EfficientDM

[ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models"

Jupyter Notebook
Quest Quest Public

Forked from mit-han-lab/Quest

[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

Cuda
llmc llmc Public

Forked from ModelTC/llmc

This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

Python
bilivideos bilivideos Public

Forked from cauyxy/bilivideos

Jupyter Notebook