PromptTA: Prompt-driven Text Adapter for Source-free Domain Generalization

Authors: Haoran Zhang, Shuanghao Bai, Wanqi Zhou, Jingwen Fu, Badong Chen.

This repo contains implementation of several soure-free domain generalization methods: zero-shot CLIP, PromptStyler, PromptTA (Ours)

Highlights

Abstract: Source-free domain generalization (SFDG) tackles the challenge of adapting models to unseen target domains without access to source domain data. To deal with this challenging task, recent advances in SFDG have primarily focused on leveraging the text modality of vision-language models such as CLIP. These methods involve developing a transferable linear classifier based on diverse style features extracted from the text and learned prompts or deriving domain-unified text representations from domain banks. However, both style features and domain banks have limitations in capturing comprehensive domain knowledge. In this work, we propose Prompt-Driven Text Adapter (PromptTA) method, which is designed to better capture the distribution of style features and employ resampling to ensure thorough coverage of domain knowledge. To further leverage this rich domain information, we introduce a text adapter that learns from these style features for efficient domain information storage. Extensive experiments conducted on four benchmark datasets demonstrate that PromptTA achieves state-of-the-art performance.

Main Contributions

We propose PromptTA, a novel adapter-based framework for SFDG that incorporates a text adapter to effectively leverage rich domain information.
We introduce style feature resampling that ensures comprehensive coverage of textual domain knowledge.
Extensive experiments demonstrate that our PromptTA achieves the state of the art on DG benchmarks.

Installation

For installation and other package requirements, please follow the instructions as follows. This codebase is tested on Ubuntu 20.04 LTS with python 3.7. Follow the below steps to create environment and install dependencies.

Setup conda environment.

# Create a conda environment
conda create -y -n prompt_ta python=3.7

# Activate the environment
conda activate prompt_ta

# Install torch (requires version >= 1.8.1) and torchvision
# Please refer to https://pytorch.org/get-started/previous-versions/ if your cuda version is different
conda install pytorch==1.12.0 torchvision==0.13.0 torchaudio==0.12.0 cudatoolkit=11.3 -c pytorch

Install dassl library.

# Instructions borrowed from https://github.com/KaiyangZhou/Dassl.pytorch#installation

# Clone this repo
git clone https://github.com/KaiyangZhou/Dassl.pytorch.git
cd Dassl.pytorch

# Install dependencies
pip install -r requirements.txt

# Install dassl library (no need to re-build if the source code is modified)
python setup.py develop
cd ..

Install clip library.

# Dependencies, may have been installed in former steps
pip install ftfy regex tqdm

# Install clip library from git
pip install git+https://github.com/openai/CLIP.git

Clone PromptTA code repository and install requirements.

# Clone PromptTA code base
git clone https://github.com/zhanghr2001/PromptTA.git
cd PromptTA

# Install requirements
pip install -r requirements.txt

Data Preparation

Download datasets:

For PACS, VLCS and OfficeHome, unzip the files with the original folder names (pacs, VLCS, office_home_dg). For DomainNet, put the extracted image folders and train/test splits in the following structure or you can modify the configuration in datasets to fit your own file structure.

your_directory
├─domainnet
│  ├─images
│  │  ├─clipart
│  │  └─infograph
│  └─splits
│     ├─clipart_train.txt
│     ├─clipart_test.txt
│     ├─infograph_train.txt
│     └─infograph_test.txt
├─office_home_dg
├─pacs
└─VLCS

Training and Evaluation

Scripts for training and evaluation are in scripts folder. Modify DATA to your dataset directory before running.

# Example: train and evaluate on PACS dataset, with backbone ResNet-50 and GPU 0
bash scripts/prompt_ta/main_ta_all.sh pacs b128_ep50_pacs RN50 0

Citation

If our code is helpful to your research or projects, please consider citing:

@misc{zhang2024prompttapromptdriventextadapter,
      title={PromptTA: Prompt-driven Text Adapter for Source-free Domain Generalization}, 
      author={Haoran Zhang and Shuanghao Bai and Wanqi Zhou and Jingwen Fu and Badong Chen},
      year={2024},
      eprint={2409.14163},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2409.14163}, 
}

Acknowledgements

Our style of readme refers to PDA. And our code is based on CoOp and PromptStyler. We thank the authors for their great work.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
configs		configs
datasets		datasets
scripts		scripts
trainers		trainers
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
model.jpg		model.jpg
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PromptTA: Prompt-driven Text Adapter for Source-free Domain Generalization

Highlights

Installation

Data Preparation

Training and Evaluation

Citation

Acknowledgements

About

Languages

License

zhanghr2001/PromptTA

Folders and files

Latest commit

History

Repository files navigation

PromptTA: Prompt-driven Text Adapter for Source-free Domain Generalization

Highlights

Installation

Data Preparation

Training and Evaluation

Citation

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Languages