ADFormer: Generalizable Few-Shot Anomaly Detection with Dual CNN-Transformer Architecture

💡 Introduction

In Generalizable Few-Shot Anomaly Detection (GFSAD), a common model must be learned and shared across several categories, while simultaneously ensuring that the model is adaptable to new categories with a restricted number of normal images. While CNN-transformer architectures obtain high success in many vision tasks, the potential of CNN-transformer architectures in GFSAD is still to be discovered. In this paper, we introduce ADFormer, a dual CNN-transformer architecture that combines the strengths of CNNs and transformers, with the aim of learning discriminative features that have both local and global receptive fields. We also incorporate a self-supervised bipartite matching approach in ADFormer that reconstructs query images from support images, followed by detecting anomalies based on the high loss in reconstruction. Additionally, we present a consistency-enhanced loss to enhance the spatial and semantic consistency of features, thereby reducing the dependence on a large AD dataset for training. Experimental results show that ADFormer with consistency-enhanced loss significantly improves GFSAD performance. Compared to other anomaly detection methods, ADFormer outperforms considerably on the MVTec AD, MPDD, and VisA datasets.

Dataset

We follow RegAD for the dataset preparation.

Download the support dataset for few-shot anomaly detection on Google Drive or Baidu Disk (i9rx) and unzip the dataset. For those who have problem downloading the support set, please optional download categories of capsule and grid on Baidu Disk (pll9) and Baidu Disk (ns0n).

🔥 Training

python train.py

💻 Evaluation

python test.py

🤗Acknowledgements

Thanks to MaskFormer and RegAD for their wonderful work and code!

📖BibTeX

@article{zhu2024adformer,
  title={ADFormer: Generalizable Few-Shot Anomaly Detection with Dual CNN-Transformer Architecture},
  author={Zhu, Bingke and Gu, Zhaopeng and Zhu, Guibo and Chen, Yingying and Tang, Ming and Wang, Jinqiao},
  journal={IEEE Transactions on Instrumentation and Measurement},
  year={2024},
  publisher={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
asset		asset
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
model.py		model.py
mvtec.py		mvtec.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ADFormer: Generalizable Few-Shot Anomaly Detection with Dual CNN-Transformer Architecture

💡 Introduction

Dataset

🔥 Training

💻 Evaluation

🤗Acknowledgements

📖BibTeX

About

Releases

Packages

Contributors 2

Languages

License

BingkeZhu/ADFormer

Folders and files

Latest commit

History

Repository files navigation

ADFormer: Generalizable Few-Shot Anomaly Detection with Dual CNN-Transformer Architecture

💡 Introduction

Dataset

🔥 Training

💻 Evaluation

🤗Acknowledgements

📖BibTeX

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages