Stars
ComfyUI implemtation for timestep shift used in NitroFusion
NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training
Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".
CSGO: Content-Style Composition in Text-to-Image Generation 🔥
ControlNet++: All-in-one ControlNet for image generations and editing!
Code for Photo-Sketching: Inferring Contour Drawings from Images 🐶
Stanford NLP Python library for Representation Finetuning (ReFT)
VMamba: Visual State Space Models,code is based on mamba
[NeurIPS 2024] Official implementation of "Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models"
[ICLR 2024] Github Repo for "HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion"
[IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
Controlling diffusion-based image generation with just a few strokes
A gradio web UI demo for Stable Diffusion XL 1.0, with refiner and MultiGPU support
Generative Models by Stability AI
Implementation of Key-Locked Rank One Editing, from Nvidia AI
A linear estimator on top of clip to predict the aesthetic quality of pictures
[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
CLIP+MLP Aesthetic Score Predictor
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…
Implementation of I-JEPA from "Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture"
The official PyTorch implementation of L2CS-Net for gaze estimation and tracking
Official Pytorch implementation of 'RoI Tanh-polar Transformer Network for Face Parsing in the Wild.'
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.
Official PyTorch implementation of Fully Attentional Networks