Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community

Jiancheng Pan, Yanxing Liu, Yuqian Fu✉, Muyuan Ma,

Jiaohao Li, Danda Pani Paudel, Luc Van Gool, Xiaomeng Huang✉

* Equal Contribution Corresponding Author ✉

News | Abstract | Dataset | Model | Statement

TODO

Release LAE-Label Engine
Release LAE-1M Dataset
Release LAE-DINO Model

News

[2024/8/17] Our paper of "Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community" is up on arXiv.

Abstract

Object detection, particularly open-vocabulary object detection, plays a crucial role in Earth sciences, such as environmental monitoring, natural disaster assessment, and land-use planning. However, existing open-vocabulary detectors, primarily trained on natural-world images, struggle to generalize to remote sensing images due to a significant data domain gap. Thus, this paper aims to advance the development of open-vocabulary object detection in remote sensing community. To achieve this, we first reformulate the task as Locate Anything on Earth (LAE) with the goal of detecting any novel concepts on Earth. We then developed the LAE-Label Engine which collects, auto-annotates, and unifies up to 10 remote sensing datasets creating the LAE-1M - the first large-scale remote sensing object detection dataset with broad category coverage. Using the LAE-1M, we further propose and train the novel LAE-DINO Model, the first open-vocabulary foundation object detector for the LAE task, featuring Dynamic Vocabulary Construction (DVC) and Visual-Guided Text Prompt Learning (VisGT) modules. DVC dynamically constructs vocabulary for each training batch, while VisGT maps visual features to semantic space, enhancing text features. We comprehensively conduct experiments on established remote sensing benchmark DIOR, DOTAv2.0, as well as our newly introduced 80-class LAE-80C benchmark. Results demonstrate the advantages of the LAE-1M dataset and the effectiveness of the LAE-DINO method.

Dataset

LAE-1M dataset contains abundance categories composed of coarse-grained LAE-COD and fine-grained LAE-FOD. LAE-1M samples from these datasets by category and does not count instances of overlap duplicates when slicing.

Model

The pipeline for solving the LAE task: LAE-Label Engine expands vocabulary for open-vocabulary pre-training; LAE-DINO is a DINO-based open-vocabulary detector with Dynamic Vocabulary Construction (DVC) and Visual-Guided Text Prompt Learning (VisGT), which has a pre-training and fine-tuning paradigm for open-set and closed-set detection.

Statement

Acknowledgement

This project references and uses the following open source models and datasets.

Related Open Source Models

MM-Grounding-DINO
segment-anything
InternVL
MTP

Related Open Source Datasets

DOTA Dataset
DIOR Dataset
FAIR1M Dataset
AID Dataset
RSICD Dataset
NWPU Dataset

Citation

If you are interested in the following work, please cite the following paper.

@misc{pan2024locateearthadvancingopenvocabulary,
    title={Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community}, 
    author={Jiancheng Pan and Yanxing Liu and Yuqian Fu and Muyuan Ma and Jiaohao Li and Danda Pani Paudel and Luc Van Gool and Xiaomeng Huang},
    year={2024},
    eprint={2408.09110},
    archivePrefix={arXiv},
    primaryClass={cs.CV},
    url={https://arxiv.org/abs/2408.09110}, 
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community

Jiancheng Pan, Yanxing Liu, Yuqian Fu✉, Muyuan Ma,

Jiaohao Li, Danda Pani Paudel, Luc Van Gool, Xiaomeng Huang✉

TODO

News

Abstract

Dataset

Model

Statement

Acknowledgement

Related Open Source Models

Related Open Source Datasets

Citation

Files

README.md

Latest commit

History

README.md

File metadata and controls

Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community

Jiancheng Pan*, Yanxing Liu*, Yuqian Fu✉, Muyuan Ma,

Jiaohao Li, Danda Pani Paudel, Luc Van Gool, Xiaomeng Huang✉

TODO

News

Abstract

Dataset

Model

Statement

Acknowledgement

Related Open Source Models

Related Open Source Datasets

Citation

Jiancheng Pan, Yanxing Liu, Yuqian Fu✉, Muyuan Ma,