-
the University of Edinburgh
- https://xinhuajian.wordpress.com/
- https://scholar.google.com/citations?hl=en&user=E5M9x8wAAAAJ
Highlights
- Pro
Stars
SGLang is a fast serving framework for large language models and vision language models.
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge managemen…
✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
Notebooks for the O'Reilly book "Learning Ray"
A repo lists papers related to LLM based agent
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
📃 A better UX for chat, writing content, and coding with LLMs.
aider is AI pair programming in your terminal
Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling
SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?
Python client to interact with the lean4 language server.
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
CP4 Free Source Code Project (C++17, Java11, Python3 and OCaml)
32 projects in the framework of Deep Reinforcement Learning algorithms: Q-learning, DQN, PPO, DDPG, TD3, SAC, A2C and others. Each project is provided with a detailed training log.
Tactics for discharging Lean goals into SMT solvers.
A Foreign Function Interface (FFI) to cvc5 solver in Lean.
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
Auriga is a minimalist LaTeX beamer presentation theme 📽
A proof assistant adapter designed for machine learning
A bibliography and survey of the papers surrounding o1
The mirror of RL_Coding_Exercise.
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."