Skip to content
View rk119's full-sized avatar
🤺
Im tryna get it together OKAY?
🤺
Im tryna get it together OKAY?
  • Dubai
  • 05:16 - 4h ahead
  • LinkedIn in/rk119

Highlights

  • Pro

Block or report rk119

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…

2,016 217 Updated Mar 4, 2025

Done in a safe environment for educational purposes :)

2 Updated Oct 5, 2024

Visualizing various metrics collected from various cryptographies

Jupyter Notebook 2 Updated Oct 4, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 24,252 2,103 Updated Mar 7, 2025

The Hugging Face course on Transformers

MDX 2,662 837 Updated Mar 7, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 36,659 2,777 Updated Mar 8, 2025
Python 779 38 Updated Mar 5, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,698 617 Updated Mar 6, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 40,786 6,137 Updated Mar 8, 2025

FuriosaAI SDK

Python 40 10 Updated Aug 7, 2024
Python 1,028 144 Updated Feb 12, 2025

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 33,883 2,433 Updated Mar 8, 2025

Open weights LLM from Google DeepMind.

Python 2,636 344 Updated Mar 8, 2025
1 Updated Oct 30, 2024

On-device AI across mobile, embedded and edge for PyTorch

C++ 2,574 468 Updated Mar 8, 2025

The website for PyTorch

HTML 244 299 Updated Mar 7, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,476 242 Updated Feb 20, 2025

Official inference framework for 1-bit LLMs

C++ 12,790 899 Updated Feb 18, 2025

A curated list of neural network pruning resources.

2,417 331 Updated Apr 4, 2024

ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.

Python 322 59 Updated Mar 7, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 31,568 2,926 Updated Mar 8, 2025

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python 2,345 263 Updated Mar 7, 2025

Dataflow compiler for QNN inference on FPGAs

Python 792 251 Updated Mar 7, 2025

OpenBao exists to provide a software solution to manage, store, and distribute sensitive data including secrets, certificates, and keys.

Go 3,423 174 Updated Mar 8, 2025

The Open Source Feature Store for AI/ML

Python 5,852 1,054 Updated Mar 8, 2025

Monocle is a framework for tracing GenAI app code. This repo contains implementation of Monocle for GenAI apps written in Python.

Python 22 14 Updated Mar 7, 2025

GenAI components at micro-service level; GenAI service composer to create mega-service

Python 122 177 Updated Mar 7, 2025

A scikit-learn compatible neural network library that wraps PyTorch

Jupyter Notebook 5,980 395 Updated Feb 4, 2025
Next
Showing results