🔮 Readout Guidance: Learning Control from Diffusion Features

Grace Luo, Trevor Darrell, Oliver Wang, Dan B Goldman, Aleksander Holynski

This repository contains the PyTorch implementation of Readout Guidance: Learning Control from Diffusion Features.

This is not an officially supported Google product.

[Project Page][arXiv]

Releases

🚀 2024/04/26: Additional code for pose estimation with readout heads in the readout_pose directory.
🚀 2024/01/31: Initial codebase release with demos for drag-based manipulation and spatial control, as well as readout head training code. Includes weights for SDXL and SDv1-5 readout heads for appearance, correspondence, depth, edge, pose.

Setup

This code was tested with Python 3.8. To install the necessary packages, please run:

conda env create -f environment.yml
conda activate readout

Readout Heads

All model weights can be found on our HuggingFace page. To automatically download the weights run:

./download_weights.sh

Readout Head Type	SDv1-5	SDXL
Pose Head	download	download
Depth Head	download	download
Edge Head	download	download
Correspondence Feature Head	download	download
Appearance Similarity Head	download	download

Demos

Note that the generation process is non-deterministic, even without Readout Guidance, so re-running the same cell or script with the exact same settings can yield better results.

demo_drag.ipynb: This demo walks through drag-based manipulation on either real images or generated images, where the user can also annotate the desired drags.
demo_spatial.ipynb: This demo walks through spatial control with the pose head on pose inputs derived from MSCOCO images.

Generation Scripts

You can also automatically generate many samples using the following scripts.

conda activate readout

# Run drag-based manipulation on samples in data/drag/real
python3 script_drag.py configs/drag_real.yaml

# Run spatial control on samples in data/spatial/pose
python3 script_spatial.py configs/spatial.yaml

Training Code

To train your own readout heads, please check out readout_training/README.md.

Citing

@inproceedings{luo2024readoutguidance,
    title={Readout Guidance: Learning Control from Diffusion Features},
    author={Grace Luo and Trevor Darrell and Oliver Wang and Dan B Goldman and Aleksander Holynski},
    journal={CVPR},
    year={2024}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔮 Readout Guidance: Learning Control from Diffusion Features

Releases

Setup

Readout Heads

Demos

Generation Scripts

Training Code

Citing

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
annotations/spatial		annotations/spatial
configs		configs
data		data
dhf		dhf
readout_guidance		readout_guidance
readout_pose		readout_pose
readout_training		readout_training
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
demo_drag.ipynb		demo_drag.ipynb
demo_spatial.ipynb		demo_spatial.ipynb
download_weights.sh		download_weights.sh
environment.yml		environment.yml
script_drag.py		script_drag.py
script_spatial.py		script_spatial.py

License

google-research/readout_guidance

Folders and files

Latest commit

History

Repository files navigation

🔮 Readout Guidance: Learning Control from Diffusion Features

Releases

Setup

Readout Heads

Demos

Generation Scripts

Training Code

Citing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages