Domain-Adaptive Semantic Segmentation with Memory-Efficient Cross-Domain Transformers

This repository contains the official implementation of the Unsupervised Domain Adaptation (UDA) framework for Semantic Segmentation described in:

Ruben Mascaro, Lucas Teixeira, and Margarita Chli. Domain-Adaptive Semantic Segmentation with Memory-Efficient Cross-Domain Transformers. The British Machine Vision Conference (BMVC), 2023.

An overview of the proposed method is illustrated in the following diagram. For more details, please check our [Paper] and [Video].

Environment Setup

This project relies on the MIM and the MMSegmentation toolboxes. For the experiments in the paper, we specifically used MMSegmentation v0.16.0. Other requirements can be found in the requirements.txt file.

We recommend setting up a working conda environment as follows:

conda create -n memcdt python=3.8.5 pip=22.3.1
conda activate memcdt
pip install -r requirements.txt -f https://download.pytorch.org/whl/torch_stable.html
pip install mmcv-full==1.3.7 -f https://download.openmmlab.com/mmcv/dist/cu110/torch1.7/index.html
pip install openmim==0.1.5
pip install mmsegmentation==0.16.0

Note: The codebase should be compatible with newer versions of MMSegmentation (up to v0.30.0). If not using MMSegmentation v0.16.0, please check for compatible versions of Pytorch, CUDA, MMCV and MIM. We note that using different versions of these libraries might lead to changes in performance.

Data Preparation

We follow the same data preparation procedure as in DAFormer.

Cityscapes: Please, download leftImg8bit_trainvaltest.zip and gt_trainvaltest.zip from here and extract them to data/cityscapes.

GTA: Please, download all image and label packages from here and extract them to data/gta.

Synthia (Optional): Please, download SYNTHIA-RAND-CITYSCAPES from here and extract it to data/synthia.

ACDC (Optional): Please, download rgb_anon_trainvaltest.zip and gt_trainval.zip from here and extract them to data/acdc. Further, please restructure the folders from condition/split/sequence/ to split/ using the following commands:

rsync -a data/acdc/rgb_anon/*/train/*/* data/acdc/rgb_anon/train/
rsync -a data/acdc/rgb_anon/*/val/*/* data/acdc/rgb_anon/val/
rsync -a data/acdc/gt/*/train/*/*_labelTrainIds.png data/acdc/gt/train/
rsync -a data/acdc/gt/*/val/*/*_labelTrainIds.png data/acdc/gt/val/

The final folder structure should look like this:

MemCDT
├── ...
├── data
│   ├── acdc (optional)
│   │   ├── gt
│   │   │   ├── train
│   │   │   ├── val
│   │   ├── rgb_anon
│   │   │   ├── train
│   │   │   ├── val
│   ├── cityscapes
│   │   ├── leftImg8bit
│   │   │   ├── train
│   │   │   ├── val
│   │   ├── gtFine
│   │   │   ├── train
│   │   │   ├── val
│   ├── gta
│   │   ├── images
│   │   ├── labels
│   ├── synthia (optional)
│   │   ├── RGB
│   │   ├── GT
│   │   │   ├── LABELS
├── ...

Data Preprocessing: Finally, please run the following scripts to convert the label IDs to the train IDs and to generate the class index for the DAFormer Rare Class Sampling (RCS) strategy:

python tools/convert_datasets/gta.py data/gta --nproc 8
python tools/convert_datasets/cityscapes.py data/cityscapes --nproc 8
python tools/convert_datasets/synthia.py data/synthia/ --nproc 8

Training

Please, download the MiT weights (Google Drive | OneDrive) pretrained on ImageNet-1K provided by the official SegFormer repository and put them in a folder pretrained/ within this project. For the experiments in this work, only mit_b5.pth is necessary.

A training job can be launched using the train.sh script. The header of the file provides detailed instructions on how to use it.

Example: Train on GTA→Cityscapes using the proposed method (i.e. with the cross-domain TS branch enabled):

./tools/train.sh configs/daformer/gta2cityscapes/gta2cityscapes_dacs_daformer_mitb5_memtsbranch.py --seed 0 --work-dir /path/to/work_dir

Evaluation

A trained model can be evaluated using the test.sh script. The header of the file provides detailed instructions on how to use it.

Example: Evaluate a model trained using the proposed method on GTA→Cityscapes:

./tools/test.sh configs/daformer/gta2cityscapes/gta2cityscapes_dacs_daformer_mitb5_memtsbranch.py /path/to/checkpoint_file --eval mIoU

Trained Models

Our trained models for the three evaluated benchkmarks can be downloaded using the links below. Since the results in the paper are provided as the mean over three random seeds, here we provide the checkpoint with the median performance on the validation set.

Acknowledgements

This project is based on the following open-source projects. We thank their authors for making the source code publically available.

License

This project is released under the Apache License 2.0. However, some specific features in this repository are with other licenses. Please carefully check LICENSES.md if you are using this code for commercial purposes.

Citation

If you use this code in your academic work, please consider citing:

@inproceedings{mascaro2023memcdt,
  title={Domain-Adaptive Semantic Segmentation with Memory-Efficient Cross-Domain Transformers},
  author={Mascaro, Ruben and Teixeira, Lucas and Chli, Margarita},
  booktitle={The British Machine Vision Conference (BMVC)},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
configs		configs
mmseg_uda		mmseg_uda
resources		resources
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
LICENSES.md		LICENSES.md
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Domain-Adaptive Semantic Segmentation with Memory-Efficient Cross-Domain Transformers

Environment Setup

Data Preparation

Training

Evaluation

Trained Models

Acknowledgements

License

Citation

About

Releases

Packages

Languages

License

VIS4ROB-lab/MemCDT

Folders and files

Latest commit

History

Repository files navigation

Domain-Adaptive Semantic Segmentation with Memory-Efficient Cross-Domain Transformers

Environment Setup

Data Preparation

Training

Evaluation

Trained Models

Acknowledgements

License

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages