Back-in-Time Diffusion (BTD): Unsupervised Detection of Medical Deepfakes

Overview

Back-in-Time Diffusion (BTD) is a novel approach for detecting medical deepfakes using Denoising Diffusion Probabilistic Models (DDPMs). This repository contains the implementation of BTD, which is designed to detect tampered CT and MRI medical images, specifically focusing on the injection and removal of tumors.

BTD leverages the generative abilities of DDPMs in a reverse diffusion process to reveal synthetic content by analyzing residuals. This approach allows for robust detection of deepfakes in medical imagery, outperforming other state-of-the-art unsupervised detection methods.

For more details, please see our paper: Back-in-Time Diffusion: Unsupervised Detection of Medical Deepfakes

Introduction

Recent advancements in generative AI have made it possible to create highly realistic deepfakes, raising significant concerns in the medical field. The proliferation of these technologies allows for the manipulation of medical images, posing threats such as false diagnoses and insurance fraud.

An illustration of the danger: slices from real medical scans (left) have tumors rejected into them or removed from (center).

BTD introduces a new paradigm in unsupervised anomaly detection for medical images. By reversing the diffusion process, BTD can identify subtle forensics left behind by generative models, even when these artifacts are not apparent to the human eye or conventional AI-based detectors.

Methodology

BTD operates by applying a backward diffusion process to a given image and measuring the residuals between the original and partially denoised image. The model is trained unsupervised on genuine medical images, making it adaptable to a variety of unseen deepfake technologies.

Overview of the Back-in-Time Diffusion framework: On the left, the model architecture is depicted, where the U-Net predicts the noise added to the images during training. The center part shows the training process, where noise is progressively added to an image (x0) to create noisy versions (xt), and the model learns to predict the noise at each step. On the right, the detection process is outlined, where the model calculates the difference between the original image (x0) and the partially denoised image (x-1) after one reverse step, with the error serving as the detection signal.

Key Features:

Unsupervised Learning: BTD does not require labeled datasets, making it practical for real-world applications.
Robust Detection: The model outperforms existing methods in detecting both injection and removal deepfakes in CT and MRI scans.

Dataset

The test sets used in this repository are specifically curated to evaluate the performance of the Back-in-Time Diffusion (BTD) model in detecting medical deepfakes in CT and MRI scans. These datasets are available for download on Kaggle:

Kaggle Dataset: BTD MRI and CT Deepfake Test Sets

Dataset Description

The test sets consist of both original and tampered images, where fake tumors were either injected into or removed from the scans. These manipulations were performed using advanced generative models such as CT-GAN and fine-tuned Stable Diffusion.

CT Test Set: Contains slices of lung CT scans with injected and removed tumors. The tampered images were generated using CT-GAN and fine-tuned Stable Diffusion models.
MRI Test Set: Contains breast MRI slices with injected and removed tumors. The tampered images were generated using fine-tuned Stable Diffusion models.

Provenance

The MRI scans used in this work were sourced from Duke Breast Cancer MRI Dataset found here. The CT scans were sourced from LIDC-IDRI CT Dataset found here.

BTD Code

The BTD codebase provides the implementation of the Back-in-Time Diffusion (BTD) models for detecting deepfakes in medical images, including training scripts, pretrained model loading, and evaluation tools. Below, you'll find detailed instructions for training the models, loading pretrained weights, and evaluating their performance on MRI and CT datasets.

Implementation Notes:

Tested on an RTX 4090 GPU with 24GB VRAM for training and evaluation of BTD.
Tested on a Linux system with 128GB RAM and Intel Xeon E5-1650 v4 CPU (12 cores).
Tested using Anaconda 2023.03 with Python 3.10.6, PyTorch 2.0, and CUDA 12.4.

Python dependencies:

Common in most installations: numpy, tqdm, pillow
What you may need to install: torch, einops, ema-pytorch, accelerate, pytorch-fid, pandas, denoising_diffusion_pytorch==1.8.3

To install the dependencies, run this in the terminal:

pip install torch torchvision einops pillow tqdm ema-pytorch accelerate numpy pytorch-fid pandas denoising_diffusion_pytorch==1.8.3

Model Weights

Pretrained model weights for both CT and MRI models are available for download.

Download Links:

CT Model Weights: CT_model.pt
MRI Model Weights: MRI_model.pt

Usage

To load the pretrained weights in your project, you can use the following code:

import torch
from denoising_diffusion_pytorch import Unet, GaussianDiffusion

# Load model weights from the release
CT_weights = "path_to_downloaded/CT_model.pt"
MRI_weights = "path_to_downloaded/MRI_model.pt"

# Initialize the UNet model for CT
CT_unet = Unet(
    dim = 32,
    dim_mults = (1, 2, 4, 8),
    channels = 1
)

# Load CT model
CT_model = GaussianDiffusion(
    CT_unet,
    objective = "pred_noise",
    image_size = 96,
    timesteps = 1000,
    sampling_timesteps = 250 
).to('cuda' if torch.cuda.is_available() else 'cpu')

CT_model.load_state_dict(torch.load(CT_weights)['model'])
CT_model.eval()

# Initialize the UNet model for MRI
MRI_unet = Unet(
    dim = 32,
    dim_mults = (1, 2, 4, 8),
    channels = 1
)

# Load MRI model
MRI_model = GaussianDiffusion(
    MRI_unet,
    objective = "pred_noise",
    image_size = 128,
    timesteps = 1000,
    sampling_timesteps = 250   
).to('cuda' if torch.cuda.is_available() else 'cpu')

MRI_model.load_state_dict(torch.load(MRI_weights)['model'])
MRI_model.eval()

BTD Evaluation Notebook

For a ready-to-use evaluation of the Back-in-Time Diffusion (BTD) models on the provided test sets, please refer to the BTD Evaluation Notebook.

This notebook demonstrates how to:

Load the test sets from the Kaggle dataset.
Load the pretrained models.
Evaluate the models on the test sets.

This notebook is a comprehensive example for users looking to quickly assess the performance of BTD on medical deepfake detection tasks.

Training

MRI Training

To train the BTD model on MRI data, use the MRI_trainer.py script. Before starting the training, ensure you have downloaded the MRI dataset from the following link:

MRI Dataset Download: Download MRI Dataset

Usage:

from denoising_diffusion_pytorch import Unet, GaussianDiffusion
from MRI_trainer import Trainer

# Initialize the UNet model
model = Unet(
    dim = 32,
    dim_mults = (1, 2, 4, 8),
    channels = 1
)

# Set up the diffusion process
diffusion = GaussianDiffusion(
    model,
    image_size = 128,
    objective = 'pred_noise',
    timesteps = 1000,           # number of steps
    sampling_timesteps = 250    # number of sampling timesteps (using ddim for faster inference [see citation for ddim paper])
)

# Initialize the trainer
trainer = Trainer(
    diffusion,
    'path/to/MRI_images',
    train_batch_size = 64,
    train_lr = 8e-6,
    train_num_steps = 700000,         # total training steps
    gradient_accumulate_every = 2,    # gradient accumulation steps
    ema_decay = 0.995,                # exponential moving average decay
    #amp = True,                       # turn on mixed precision
    calculate_fid = True,             # whether to calculate fid during training
    save_and_sample_every = 20000,
    results_folder = 'path/to/MRI/checkpoints'
)

# Start training
trainer.train()

CT Training

To train the BTD model on CT data, use the CT_trainer.py script. Make sure to download the CT dataset from the following link before beginning the training:

CT Dataset Download: Download CT Dataset

Usage:

from denoising_diffusion_pytorch import Unet, GaussianDiffusion
from CT_trainer import Trainer

# Initialize the UNet model
model = Unet(
    dim = 32,
    dim_mults = (1, 2, 4, 8),
    channels = 1
)

# Set up the diffusion process
diffusion = GaussianDiffusion(
    model,
    image_size = 96,
    objective = 'pred_noise',
    timesteps = 1000,           # number of steps
    sampling_timesteps = 250    # number of sampling timesteps (using ddim for faster inference [see citation for ddim paper])
)

# Initialize the trainer
trainer = Trainer(
    diffusion,
    "path/to/CT_images",
    train_batch_size = 64,
    train_lr = 8e-6,
    train_num_steps = 700000,         # total training steps
    gradient_accumulate_every = 2,    # gradient accumulation steps
    ema_decay = 0.995,                # exponential moving average decay
    #amp = True,                       # turn on mixed precision
    calculate_fid = True,             # whether to calculate fid during training
    save_and_sample_every = 20000,
    results_folder = 'path/to/CT/checkpoints'
)

# Start training
trainer.train()

This setup will allow you to train the BTD models on MRI and CT datasets efficiently. The training process will save the model checkpoints at specified intervals.

Citation

If you use this code, the pretrained models, or any of the provided datasets in your research, please cite our paper:

@article{grabovski2024back,
  title={Back-in-Time Diffusion: Unsupervised Detection of Medical Deepfakes},
  author={Grabovski, Fred and Yasur, Lior and Amit, Guy and Elovici, Yuval and Mirsky, Yisroel},
  journal={arXiv preprint arXiv:2407.15169},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.gitattributes		.gitattributes
BTD_evaluation.ipynb		BTD_evaluation.ipynb
CT_trainer.py		CT_trainer.py
LICENSE		LICENSE
MRI_trainer.py		MRI_trainer.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Back-in-Time Diffusion (BTD): Unsupervised Detection of Medical Deepfakes

Overview

Introduction

Methodology

Key Features:

Dataset

Dataset Description

Provenance

BTD Code

Implementation Notes:

Python dependencies:

Model Weights

Download Links:

Usage

BTD Evaluation Notebook

Training

MRI Training

Usage:

CT Training

Usage:

Citation

About

Releases 1

Packages

Languages

License

FreddieMG/BTD--Unsupervised-Detection-of-Medical-Deepfakes

Folders and files

Latest commit

History

Repository files navigation

Back-in-Time Diffusion (BTD): Unsupervised Detection of Medical Deepfakes

Overview

Introduction

Methodology

Key Features:

Dataset

Dataset Description

Provenance

BTD Code

Implementation Notes:

Python dependencies:

Model Weights

Download Links:

Usage

BTD Evaluation Notebook

Training

MRI Training

Usage:

CT Training

Usage:

Citation

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages