Skip to content
View alikolling's full-sized avatar

Highlights

  • Pro

Block or report alikolling

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 88 14 Updated Feb 5, 2025
Python 5 1 Updated Nov 24, 2024

the resources I use to learn computer science in my spare time

3,818 354 Updated Feb 14, 2023

Introduction to automated task planning using the Unified Planning library

Jupyter Notebook 27 1 Updated Feb 2, 2025

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 4,723 606 Updated Dec 26, 2024
Verilog 10 Updated Nov 11, 2024

Pytorch implementation of diffusion models on Lie Groups for 6D grasp pose generation https://sites.google.com/view/se3dif/home

Jupyter Notebook 280 33 Updated Jul 3, 2024

Train transformer language models with reinforcement learning.

Python 11,102 1,484 Updated Feb 5, 2025

Fine-tune LLM agents with online reinforcement learning

Python 1,055 48 Updated Mar 19, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 36,396 5,504 Updated Feb 6, 2025

MR.Q is a general-purpose model-free reinforcement learning algorithm.

Python 37 Updated Feb 5, 2025

Rex is a JAX-powered framework for sim-to-real robotics.

Python 16 Updated Jan 28, 2025

Awesome Equinox - A curated list of resources of https://github.com/patrick-kidger/equinox

14 1 Updated Jun 24, 2024

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 8,199 1,065 Updated Feb 1, 2025

An educational resource to help anyone learn deep reinforcement learning.

Python 10,461 2,268 Updated Aug 5, 2024

An open-source library for GPU-accelerated robot learning and sim-to-real transfer.

Jupyter Notebook 581 47 Updated Jan 29, 2025

A Wadler--Lindig pretty printer for Python

Python 31 1 Updated Feb 3, 2025

References on Optimal Control, Reinforcement Learning and Motion Planning

939 206 Updated Feb 26, 2022
Python 21 6 Updated Oct 28, 2020
Jupyter Notebook 12 1 Updated Feb 4, 2025
Python 35 3 Updated Dec 17, 2024

Near Zero-Overhead Python Code Coverage

Python 512 22 Updated Dec 17, 2024

Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

Python 12,401 402 Updated Jan 27, 2025

Flow Matching implemented in PyTorch

Python 24 2 Updated Jan 13, 2025

We are on a mission to make robotics available to the regular software engineers, by decoupling it from ROS and physical hardware.

Python 125 9 Updated Dec 30, 2024
Next
Showing results