Stable-Diffusion-Anomaly-Detection

ABSTRACT

Visual anomaly detection is essential for industrial quality inspection and medical diagnosis. Previous research in this field has focused on training custom models for each specific task, which requires thousands of images and annotation. In this work, we depart from this approach, drawing inspiration from reconstruction based methodologies and leveraging the remarkable zero-shot generalization capabilities of foundation models. We propose a novel framework, Stable Diffusion Anomaly Detection (SDAD), which operates by reconstructing target images using pre-trained diffusion models and employs Segment Anything to enhance the adaptability of modern foundation models to anomaly detection. In VisA dataset, SDAD achieves pixel level state-of-the-art results in zero-shot visual anomaly detection without further tuning. This highlights the effectiveness of our framework in achieving superior anomaly detection performance without the task-specific constraints of traditional approaches.

SDAD Framework

Our proposed anomaly detection framework, Stable Diffusion Anomaly Detection (SDAD), consists of three components which are background remover, image reconstructor, and change detector, as shown in Figure. The initial step in the SDAD framework involves the use of a background remover to generate a mask that isolates the object from the background, allowing the image reconstructor to focus more attentively on the object itself. This strategic step is essential, as defects are expected to manifest on the objects rather than on the background. Second, we employ a denoising U-Net as an image reconstructor to transform the defective image into a defect-free version. Leveraging that image reconstructor generates samples representing the entire distribution of normal samples while being incapable of generating samples deviating from that distribution. This enables the detection of anomalies by comparing the anomalous input with its predicted flawless reconstruction. Lastly, the change detector component is employed to compare the input image with the reconstructed image to identify and highlight the differences, thereby producing the resultant mask highlighting detected anomalies. Through this multi-stage process, our framework offers a comprehensive solution for anomaly detection, leveraging the strengths of each component to effectively identify and delineate anomalies within the input images.

Results

Installation

python -m pip install -r requirements.txt

Download the pretrained weights

mkdir models
cd models
wget -q https://dl.fbaipublicfiles.com/segment_anything/sam_vit_h_4b8939.pth

Inference

python scripts/main.py

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
assets		assets
data		data
diffusers @ 7d4a257		diffusers @ 7d4a257
ip_adapter @ ba40563		ip_adapter @ ba40563
scripts		scripts
segment-anything @ 6fdee8f		segment-anything @ 6fdee8f
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stable-Diffusion-Anomaly-Detection

ABSTRACT

SDAD Framework

Results

Installation

Inference

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Stable-Diffusion-Anomaly-Detection

ABSTRACT

SDAD Framework

Results

Installation

Inference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages