ReSwapper

ReSwapper aims to reproduce the implementation of inswapper. This repository provides code for training, inference, and includes pretrained weights.

Here is the comparesion of the output of Inswapper and Reswapper.

Target	Source	Inswapper Output	Reswapper Output (256 resolution) (Step 1399500)	Reswapper Output (Step 1019500)	Reswapper Output (Step 429500)

Installation

git clone https://github.com/somanchiu/ReSwapper.git
cd ReSwapper
python -m venv venv

venv\scripts\activate

pip install -r requirements.txt

pip install torch torchvision --force --index-url https://download.pytorch.org/whl/cu121
pip install onnxruntime-gpu --force --extra-index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/onnxruntime-cuda-12/pypi/simple/

The details of inswapper

Model architecture

The inswapper model architecture can be visualized in Netron. You can compare with ReSwapper implementation to see architectural similarities. Exporting the model with opset_version=10 makes it easier to compare the graph in Netron. However, it will cause issue #8.

We can also use the following Python code to get more details:

model = onnx.load('test.onnx')
printable_graph=onnx.helper.printable_graph(model.graph)

The model architectures of InSwapper and SimSwap are extremely similar and worth paying attention to.

Model inputs

target: [1, 3, 128, 128] shape image in RGB format with face alignment, normalized to [-1, 1] range
source (latent): [1, 512] shape vector, the features of the source face
- Calculation of latent, "emap" can be extracted from the original inswapper model.
```
latent = source_face.normed_embedding.reshape((1,-1))
latent = np.dot(latent, emap)
latent /= np.linalg.norm(latent)
```
- It can also be used to calculate the similarity between two faces using cosine similarity.

Model output

Model inswapper_128 not only changes facial features, but also body shape.

Target	Source	Inswapper Output	Reswapper Output (Step 429500)

Loss Functions

There is no information released from insightface. It is an important part of the training. However, there are a lot of articles and papers that can be referenced. By reading a substantial number of articles and papers on face swapping, ID fidelity, and style transfer, you'll frequently encounter the following keywords:

content loss
style loss/id loss
perceptual loss

Face alignment

Face alignment is handled incorrectly at resolutions other than 128. To resolve this issue, add an offset to "dst" in both x and y directions in the function "face_align.estimate_norm". The offset is approximately given by the formula: Offset = 0.0039 * Resolution - 0.5

Training

0. Pretrained weights (Optional)

If you don't want to train the model from scratch, you can download the pretrained weights and pass model_path into the train function in train.py.

1. Dataset Preparation

Download FFHQ to use as target and source images. For the swaped face images, we can use the inswapper output.

2. Model Training

Optimizer: Adam

Learning rate: 0.0001

Modify the code in train.py if needed. Then, execute:

python train.py

The model will be saved as "reswapper-<total steps>.pth". You can also save the model as ONNX using the ModelFormat.save_as_onnx_model function. The ONNX model can then be used with the original INSwapper class.

All losses will be logged into TensorBoard.

Using images with different resolutions simultaneously to train the model will enhance its generalization ability. To apply this strategy, you can pass "resolutions" into the train function.

Generalization ability of the model trained with resolutions of 128 and 256:

Output resolution	128	160	256
Output

Enhancing data diversity will improve output quality, you can pass "enableDataAugmentation" into the train function to perform data augmentation.

Target	Source	Inswapper Output	Reswapper Output (Step 1567500)	Reswapper Output (Step 1399500)

Notes

Do not stop the training too early.
I'm using an RTX3060 12GB for training. It takes around 12 hours for 50,000 steps.
The optimizer may need to be changed to SGD for the final training, as many articles show that SGD can result in lower loss.
To get inspiration for improving the model, you might want to review the commented code and unused functions in commit c2a12e10021ecd1342b9ba50570a16b18f9634b9.

Inference

python swap.py

Pretrained Model

256 Resolution

128 Resolution

Notes

If you downloaded the ONNX format model before 2024/11/25, please download the model again or export the model with opset_version=11. This is related to issue #8.

To Do

Create a 512-resolution model (alternative to inswapper_512)

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
example		example
Image.py		Image.py
LICENSE		LICENSE
ModelFormat.py		ModelFormat.py
README.md		README.md
StyleTransferLoss.py		StyleTransferLoss.py
StyleTransferModel_128.py		StyleTransferModel_128.py
emap.npy		emap.npy
face_align.py		face_align.py
requirements-colab.txt		requirements-colab.txt
requirements.txt		requirements.txt
swap.py		swap.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReSwapper

Installation

The details of inswapper

Model architecture

Model inputs

Model output

Loss Functions

Face alignment

Training

0. Pretrained weights (Optional)

1. Dataset Preparation

2. Model Training

Notes

Inference

Pretrained Model

256 Resolution

128 Resolution

Notes

To Do

About

Releases

Packages

Contributors 2

Languages

License

somanchiu/ReSwapper

Folders and files

Latest commit

History

Repository files navigation

ReSwapper

Installation

The details of inswapper

Model architecture

Model inputs

Model output

Loss Functions

Face alignment

Training

0. Pretrained weights (Optional)

1. Dataset Preparation

2. Model Training

Notes

Inference

Pretrained Model

256 Resolution

128 Resolution

Notes

To Do

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages