Skip to content

Filters ink out of whole slide images. Potentially can also correct it

License

Notifications You must be signed in to change notification settings

Vishwesh4/Ink-WSI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Ink Removal in Whole Slide Images using Hallucinated Data

This repository contains the code for Ink Removal in Whole Slide Images using Hallucinated Data. The trained model weights are at this link

Description

This project is about identifying and removing ink markings from histopathology whole slides for aiding downstream computational analysis. The algorithm requires no annotation or manual curation of data and requires only clean slides, making it easy to adapt and deploy in new set of histopathology slides.

Methodology

The methodlogy consists of two networks:-

  1. Ink filter: A binary classifier with Resnet 18 backbone
  2. Ink corrector: Pix2pix module for removing ink from a patch by image to image translation An overview of the methodology and its results are shown below

Fig.1 - Methodology overview

Fig.2 - Ink filter output

Fig.3 - Pix2pix output

Getting Started

Dependencies

opencv
dominate
visdom
trainer - https://github.com/Vishwesh4/TrainerCode
pytorch-gpu
wandb
openslide
scikit-learn
scipy
scikit-image

Modules

The project has 6 modules:-

  1. Ink filter module - ./train_filter
  2. Ink removal module (Pix2pix) - ./ink_removal
  3. Patch Extraction - ./modules/patch_extraction
  4. Image Metric Calculate - ./modules/metrics
  5. Registration - ./modules/register
  6. Deployment of methodology over new slides - ./deploy

Ink Filter module

  1. The model can be trained by modifying config.yml file, specifying the location of path of clean slides to be used, and set of colors to be used
  2. The training can be done by using
python train.py -c [CONFIG FILE LOCATION]

Ink Removal module

  1. The code has been taken from the original repository link
  2. For training with your own dataset, please follow a similar code structure to ./ink_removal/data/dcisink_dataset.py or ./ink_removal/data/tiger_dataset.py. Mixture of the two datasets was used for the given model ./ink_removal/data/mixed_dataset.py
  3. The model can be trained by using
./train_pix2pix.sh
  1. The model can be tested by using
./test_pix2pix.sh

For testing, corresponding ink and clean slides should be available 5. The image metrics can be calculated by using

./run_calc_metrics.sh

The test model name has to be specified

Deploy module

  1. The modules can be deployed using the class Ink_deploy. An example is shown in ./deploy/process.py. It also has a script ./deploy/construct_wsi.py for running algorithm over a whole slide image, however it expects sedeen annotation.
ink_deploy = Ink_deploy(filter_path:str=INK_PATH,
                        output_dir:str=None, 
                        pix2pix_path:str=PIX2PIX_PATH, 
                        device=torch.device("cpu"))

Authors

Contact

If you want to contact, you can reach the authors by raising an issue or email at [email protected]

Acknowledgments

  • The registeration code ./modules/register/register.py was developed by Wenchao Han at Sunnybrook Research Institute ([email protected])
  • The pix2pix code was taken from link
  • The ./modules/metrics/quality_metrics.py code was taken from link

Cite

@inproceedings{ramanathan2023ink,
  title={Ink removal in whole slide images using hallucinated data},
  author={Ramanathan, Vishwesh and Han, Wenchao and Bassiouny, Dina and Rakovitch, Eileen and Martel, Anne L},
  booktitle={Medical Imaging 2023: Digital and Computational Pathology},
  volume={12471},
  pages={230--238},
  year={2023},
  organization={SPIE}
}

About

Filters ink out of whole slide images. Potentially can also correct it

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published