TSMSA

The Pytorch implementation of the our paper of UESTC-nnLab [TSMSA: Temporal Segmentation Modeling with Sample Augmentation for Moving Infrared Small Target Detection]

Abstract

Infrared Small Target Detection (IRSTD) has emerged as a critical and hot research topic within the broader field of object detection in recent years. Most sophisticated approaches rely on U-shaped neural networks to address the challenges with small target sizes and low contrast against the background. However, these methods predominantly focus on extracting semantic information from single images and often neglect the temporal relationships across multiple frames. Additionally, the targets in infrared images are typically sparse, leading to issues of insufficient sample data. To address the challenge of temporal modeling in infrared small target segmentation, this paper proposes a new scheme of Temporal Segmentation Modeling with Sample Augmentation (TSMSA). Our temporal data augmentation strategy includes two algorithms: one for target augmentation, randomly cloning target representations over time to generate sufficient training samples, and the other for batch augmentation, further diversifying training scenes. Moreover, we introduce a Convolutional LSTM-based Network that leverages Long Short-Term Memory (LSTM) cells for temporal modeling, effectively utilizing temporal relationships for improving segmentation. In our TSMSA scheme, we modify a cross-slice ConvLSTM node to capture spatio-temporal features from input video clips. A Motion-Coupling Module is designed o fuse the spatial features of the key frame with the spatio-temporal output from the ConvLSTM node, enhancing the integration of information across both spatial and temporal domains. Finally, the enhanced spatio-temporal features are progressively fused with multi-scale spatial features of the key frame to generate the final feature map.

Datasets

-MWIRDST and NUDT-MIRSDT

Usage

Train

-Single-frame-based methods

CUDA_VISIBLE_DEVICES=0 python train_{dataset}_single.py

-Multi-frame-based methods

CUDA_VISIBLE_DEVICES=0 python train_{dataset}.py

Test

CUDA_VISIBLE_DEVICES=0 python test.py

Comparision method

Results

Contact

IF any questions, please contact with Shuang Peng via email: [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
model		model
readme		readme
README.md		README.md
complexity.py		complexity.py
compute_metirc_image.py		compute_metirc_image.py
dataset.py		dataset.py
evulate.mat		evulate.mat
loss.py		loss.py
metrics.py		metrics.py
mshf_loss.py		mshf_loss.py
net.py		net.py
rename_img.py		rename_img.py
seq_dataset.py		seq_dataset.py
test.py		test.py
test_MWIRSTD.txt		test_MWIRSTD.txt
test_NUDTMIRSDT.txt		test_NUDTMIRSDT.txt
test_block.py		test_block.py
train.py		train.py
train_MWIRDST.py		train_MWIRDST.py
train_MWIRDST_single.py		train_MWIRDST_single.py
train_MWIRSTD.txt		train_MWIRSTD.txt
train_NUDTMIRSDT.py		train_NUDTMIRSDT.py
train_NUDTMIRSDT.txt		train_NUDTMIRSDT.txt
train_NUDTMIRSDT_single.py		train_NUDTMIRSDT_single.py
train_time.py		train_time.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TSMSA

Abstract

Datasets

Usage

Train

Test

Comparision method

Results

Contact

About

Uh oh!

Releases

Packages

Languages

UESTC-nnLab/TSMSA

Folders and files

Latest commit

History

Repository files navigation

TSMSA

Abstract

Datasets

Usage

Train

Test

Comparision method

Results

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages