[go: up one dir, main page]

Skip to content

[Ofiicial Repo] CrossEarth: Geospatial Vision Foundation Model for Promoting Cross-Domain Generalization in Remote Sensing Semantic Segmentation

License

Notifications You must be signed in to change notification settings

Cuzyoung/CrossEarth

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation

Ziyang Gong1 ∗, Zhixiang Wei2 ∗, Di Wang3 ∗, Xianzheng Ma3, Hongruixuan Chen4, Yuru Jia56, Yupeng Deng1, Zhenming Ji1 †, Xiangwei Zhu1 †, Naoto Yokoya4, Jing Zhang3, Bo Du3, Liangpei Zhang3

1 Sun Yat-sen University, 2 University of Science and Technology of China, 3 Wuhan University,

4 The University of Tokyo, 5 KU Leuven, 6 KTH Royal Institute of Technology

Equal contribution, Corresponding author



🔥🔥🔥 News

  • [2024/11/06] The most checkpoints have been uploaded and you can access them in the huggingface badges.

  • The environment and inference steps please refer to the following installation. The inference codes and weights will be coming soon.

  • The benchmark collection in the paper is releasing and you can access it at here.

  • 🎉🎉🎉 CrossEarth is the first VFM for Remote Sensing Domain Generalization (RSDG) semantic segmentation. We just release the arxiv paper of CrossEarth. You can access CrossEarth at here.

📑 Table of Content

Visualization

In Radar figure:

  • CrossEarth achieves SOTA performances on 23 evaluation benchmarks across various segmentation scenes, demonstrating strong generalizability.

In UMAP figures:

  • CrossEarth extracts features that cluster closely for the same class across different domains, forming well-defined groups in feature space, demonstrating its ability to learn robust, domain-invariant features.

  • Moreover, CrossEarth features exhibit high inter-class separability, forming unique clusters for each class and underscoring its strong representational ability to distingguish different categories.

Environment Requirements:

conda create -n CrossEarth -y
conda activate CrossEarth
conda install pytorch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 pytorch-cuda=11.7 -c pytorch -c nvidia -y
pip install -U openmim
mim install mmengine
mim install "mmcv>=2.0.0"
pip install "mmsegmentation>=1.0.0"
pip install "mmdet>=3.0.0"
pip install xformers=='0.0.20' 
pip install -r requirements.txt
pip install future tensorboard

Inference steps:

First, download the model weights from the huggingface or Baidu Netdisk in the above badges. Notably, the checkpoints of dinov2_converted.pth and dinov2_converted_1024x1024.pth are needed for inference. Please download them and put them in the CrossEarth/checkpoints folder.

Second, change the file path in experiment config files (configs/base/datasets/xxx.py and configs/CrossEarth_dinov2/xxx.py), and run the following command to inference. (Take 512x512 inference as an example)

python tools/test.py configs/CrossEarth_dinov2/CrossEarth_dinov2_mask2former_512x512_bs1x4.py ./checkpoints/xxx.pth

Notably, save path of pseudo labels is in the experiment config file. When testing CrossEarth on different benchmarks, you also need to change the class number in CrossEarth_dinov2_mask2former.py file.

Training steps:

Coming soon.

Model Weights with Configs

Dataset Benchmark Model Config Log
ISPRS Potsdam and Vaihingen P(i)2V Coming Soon Coming Soon Coming Soon
- P(i)2P(r) - - -
- P(r)2P(i) Coming Soon Coming Soon Coming Soon
- P(r)2V - - -
- V2P(i) Coming Soon Coming Soon Coming Soon
- V2P(r) - - -
LoveDA Urban2Rural Coming Soon Coming Soon Coming Soon
- Rural2Urban Coming Soon Coming Soon Coming Soon
WHU Building A2S Coming Soon Coming Soon Coming Soon
- S2A Coming Soon Coming Soon Coming Soon
DeepGlobe and Massachusetts D2M Coming Soon Coming Soon Coming Soon
ISPRS Potsdam and RescueNet P(r)2Res Coming Soon Coming Soon Coming Soon
- P(i)2Res Coming Soon Coming Soon Coming Soon
CASID Sub2Sub Coming Soon Coming Soon Coming Soon
- Sub2Tem - - -
- Sub2Tms - - -
- Susb2Trf - - -
- Tem2Sub Coming Soon Coming Soon Coming Soon
- Tem2Tem - - -
- Tem2Tms - - -
- Tem2Trf - - -
- Tms2Sub Coming Soon Coming Soon Coming Soon
- Tms2Tem - - -
- Tms2Trf - - -
- Trf2Sub Coming Soon Coming Soon Coming Soon
- Trf2Tem - - -
- Trf2Tms - - -
- Trf2Trf - - -

Citation

If you find CrossEarth helpful, please consider giving this repo a ⭐ and citing:

@article{crossearth,
  title={CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation},
  author={Gong, Ziyang and Wei, Zhixiang and Wang, Di and Ma, Xianzheng and Chen, Hongruixuan and Jia, Yuru and Deng, Yupeng and Ji, Zhenming and Zhu, Xiangwei and Yokoya, Naoto and Zhang, Jing and Du, Bo and Zhang, Liangpei},
  journal={arXiv preprint arXiv:2410.22629},
  year={2024}
}

Other Related Works

About

[Ofiicial Repo] CrossEarth: Geospatial Vision Foundation Model for Promoting Cross-Domain Generalization in Remote Sensing Semantic Segmentation

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages