Arxiv Link: TransResU-Net: Transformer based ResU-Net for Real-Time Colonoscopy Polyp Segmentation
Colorectal cancer is one of the most common causes of cancer and cancer-related mortality worldwide. Colonoscopy is the primary technique to diagnose colon cancer. However, the polyp miss rate is significantly high. Early detection of polyp at the precancerous stage can help reduce the mortality rate. Deep learning based computer-aided diagnosis system may help colonoscopists to identify neoplastic polyps and act as a second pair of eyes. Additionally, CADX system could be a cost-effective solution and might contribute to long-term colorectal cancer prevention. In this study, we proposed deep learning-based architecture, Transformer ResU-Net (TransResU-Net), for automatic polyp segmentation. TresResU-Net is an encoder-decoder based architecture built upon residual block and takes the advantage of transformer self-attention mechanism and dilated convolution. Experimental result on two publicly available polyp segmentation benchmark datasets shows that TransResU-Net obtained a promising dice coefficient and a real-time speed.
The proposed architecture is implemented using the PyTorch framework (1.9.0+cu111) with a single GeForce RTX 3090 GPU of 24 GB memory.
We have used the following datasets:
BKAI dataset follows an 80:10:10 split for training, validation and testing, while the Kvasir-SEG follows an official split of 880/120.
Qualitative results comparison along with the heatmap@article{tomar2022transresu, title={TransResU-Net: Transformer based ResU-Net for Real-Time Colonoscopy Polyp Segmentation}, author={Tomar, Nikhil Kumar and Shergill, Annie and Rieders, Brandon and Bagci, Ulas and Jha, Debesh}, journal={arXiv preprint arXiv:2206.08985}, year={2022} }
The source code is free for research and education use only. We allow comercial use, however a prior permission is required.
Please contact [email protected] for any further questions.