Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery

Xavier Bou, Gabriele Facciolo, Rafael Grompone, Jean-Michel Morel, Thibaud Ehret

Centre Borelli, ENS Paris-Saclay

This repository is the official implementation of the paper Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery.

News

🎉 Our Paper Has Been Accepted to the EarthVision Workshop at CVPR24! 🌍
🚀 Initialized and trained prototypes have been added to Google Drive directory

The goal of this paper is to perform object detection in satellite imagery with only a few examples, thus enabling users to specify any object class with minimal annotation. To this end, we explore recent methods and ideas from open-vocabulary detection for the remote sensing domain. We develop a few-shot object detector based on a traditional two-stage architecture, where the classification block is replaced by a prototype-based classifier. A large-scale pre-trained model is used to build class-reference embeddings or prototypes, which are compared to region proposal contents for label prediction. In addition, we propose to fine-tune prototypes on available training images to boost performance and learn differences between similar classes, such as aircraft types. We perform extensive evaluations on two remote sensing datasets containing challenging and rare objects. Moreover, we study the performance of both visual and image-text features, namely DINOv2 and CLIP, including two CLIP models specifically tailored for remote sensing applications. Results indicate that visual features are largely superior to vision-language models, as the latter lack the necessary domain-specific vocabulary. Lastly, the developed detector outperforms fully supervised and few-shot methods evaluated on the SIMD and DIOR datasets, despite minimal training parameters.

  conda create -n ovdsat python=3.9 -y
  conda activate ovdsat
  pip install torch==1.13.0+cu116 torchvision==0.14.0+cu116 torchaudio==0.13.0 --extra-index-url https://download.pytorch.org/whl/cu116
  python -m pip install 'git+https://github.com/facebookresearch/detectron2.git'
  pip install opencv-python albumentations transformers

Data preparation and weights

To set up the data and pre-trained weights, download the contents of the following Google Drive folder. We provide the same splits and labels we use in our article for the SIMD dataset (N = {5, 10, 30}). Add the data/ and weights/ directories into the project directory. The data path should follow the structure below for each dataset, e.g. simd, dior or your own:

data/
│
├── simd/
│   ├── train_coco_subset_N5.json
│   ├── train_coco_subset_N10.json
│   ├── train_coco_subset_N30.json
│   ├── train_coco_finetune_val.json
│   ├── val_coco.json
│   ├── train/
│   │   ├── image1.jpg
│   │   ├── image2.jpg
│   │   └── ...
│   └── val/
│       ├── image1.jpg
│       ├── image2.jpg
│       └── ...
│
├── dior/
│   ├── train_coco_subset_N5.json
│   ├── train_coco_subset_N10.json
│   ├── train_coco_subset_N30.json
│   └── ...
...

Notice that the train_coco_finetune_val.json corresponds to a small subset of training data from the used dataset that is used as validation set during training, so that no real validation data is used as such.

Weights

We pre-trained a FasterRCNN model on DOTA for the RPN using the code from DroneDetectron2. The pre-trained checkpoints can be found in the Google Drive directory. If you plan to use any of the Remote Sensing CLIP models tested in the paper, download the pre-trained weights (RemoteCLIP and GeoRSClip) and add them to the weights/ directory.

Create prototypes

To generate the class-reference and background prototypes using DINOv2 features, run the following command:

bash scripts/init_prototypes.sh

Important: Add the path to your data and the in the DATA_DIR variable in the bash files. You can adapt the used datasets, value of N as well. If you are running other data or the files/paths differ from ours, you can adapt the contents of the bash file to your own structure.

Fine-tune prototypes

Train the pre-initialised class-reference prototypes on the available data:

bash scripts/train_prototypes_bbox.sh

Evaluate

Evaluate the learned prototypes on unsen data:

bash scripts/eval_detection.sh

Citation

If you found our work useful, please cite it as follows:

@article{Bou:2024,
  title={Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery},
  author={Bou, Xavier and Facciolo, Gabriele and von Gioi, Rafael Grompone and Morel, Jean-Michel and Ehret, Thibaud},
  journal={arXiv preprint arXiv:2403.05381},
  year={2024}
}

License and Acknowledgement

This project is licensed under the GNU Affero General Public License v3.0 - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
assets		assets
configs		configs
datasets		datasets
models		models
scripts		scripts
utils_dir		utils_dir
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build_prototypes.py		build_prototypes.py
eval_classification.py		eval_classification.py
eval_detection.py		eval_detection.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery

News

Contents

Overview

Requirements:

Data preparation and weights

Weights

Create prototypes

Fine-tune prototypes

Evaluate

Citation

License and Acknowledgement

About

Releases

Packages

Languages

License

xavibou/ovdsat

Folders and files

Latest commit

History

Repository files navigation

Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery

News

Contents

Overview

Requirements:

Data preparation and weights

Weights

Create prototypes

Fine-tune prototypes

Evaluate

Citation

License and Acknowledgement

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages