knickknack/DDPM_MNIST at main · slatter666/knickknack

History

Name		Name	Last commit message	Last commit date
parent directory ..
gen		gen
dataset.py		dataset.py
model.py		model.py
readme.md		readme.md
run.py		run.py
utils.py		utils.py

readme.md

Denoising Diffusion Probabilistic Model for Generating Handwriting Numbers

1. Introduction

Here we will train a simple diffusion model to generate handwriting number
The dataset is MNIST, it will be downloaded under the folder dataset using torchvision, the dataset folder structure looks like this:

dataset
├── mnist
│   └── MNIST
│   │   └── raw
│   │       ├── t10k-images-idx3-ubyte
│   │       ├── t10k-images-idx3-ubyte.gz
│   │       ├── t10k-labels-idx1-ubyte
│   │       ├── t10k-labels-idx1-ubyte.gz
│   │       ├── train-images-idx3-ubyte
│   │       ├── train-images-idx3-ubyte.gz
│   │       ├── train-labels-idx1-ubyte
│   │       └── train-labels-idx1-ubyte.gz

2. Load dataset, Build model, Train model

Actually I try to use a simplest model which is MLP to do this task, but I find that doesn't work
For this task, we build a simple U-Net which contains convolution and residual connection
Here I use a NVIDIA GeForce RTX 3090 to train, each epoch will cost about 15 seconds
If you want to train from scratch, you don't have to modify anything. If you finish training and want to generate number picture, modify mode, simply run program and wait for your generated numbers

python run.py

Of course, you can modify the model architecture or try some other hyper-parameters, do anything you want
In fact this dataset is very easy, so you'll find the result is pretty good after only 100 epochs' training

3. Check the quality of generated image

First of all, we will use random Gaussian Noise to sample some images, here are 256 examples

Then let's check the diffusion process, here we show first six diffusion process
I think the quality is pretty good and note that our model's parameters are only 2.3M(of course this model is more complex than a simple MLP)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DDPM_MNIST

DDPM_MNIST

readme.md

Denoising Diffusion Probabilistic Model for Generating Handwriting Numbers

1. Introduction

2. Load dataset, Build model, Train model

3. Check the quality of generated image

4. Some references

Files

DDPM_MNIST

Directory actions

More options

Directory actions

More options

Latest commit

History

DDPM_MNIST

Folders and files

parent directory

readme.md

Denoising Diffusion Probabilistic Model for Generating Handwriting Numbers

1. Introduction

2. Load dataset, Build model, Train model

3. Check the quality of generated image

4. Some references