RealCustom++

Existing text-to-image customization (or subject-driven generation) methods follow the pseudo-word paradigm, which involves representing given subjects as pseudo-words and combining them with given texts to collectively guide the generation. However, the inherent conflict and entanglement between the pseudo-words and texts result in a dual-optimum paradox, where subject similarity and text controllability cannot be optimal simultaneously. In this paper, we present RealCustom++, for the first time, disentangles subject similarity from text controllability and thereby allows both to be optimized simultaneously without any conflicts. The core idea of RealCustom++ is to represent given subjects as real words that can be seamlessly integrated with given texts, and further leveraging the relevance between real words and image regions to disentangle subjects from texts.

Enjoy on Dreamina at Two Steps

RealCustom++ has now been commercially applied in Dreamina, ByteDance. You can enjoy the customized generation for any subjects you like following the two steps:

Step 1: Create A Character:

Create character images and corresponding appearance descriptions through prompt descriptions, uploading reference images. Specifically: 1. Character Image: Best in clean background, close-up, prominent subject, high-quality resolution. 2. Character Description: Brief, includes the subject and key appearance elements.

Step 2: Character-Driven Generation:

Input prompts where the subject is replaced by the selected character, guiding the character to make corresponding changes such as style, actions, expressions, scenes, and modifiers. There is no need to add descriptions of the subject in the prompt. "Face Reference Strength" is the weight for ID retention, and "Body Reference Strength" is the weight for IP retention.

About Code Release

Unfortunately, according to company policy, considering the model's performance is closely aligned with Dreamina's online effects, we are currently unable to open-source the code and model. We plan to open-source RealCustom++ after the next version update in Dreamina. Please stay tuned!

Reference

@inproceedings{huang2024realcustom,
  title={RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization},
  author={Huang, Mengqi and Mao, Zhendong and Liu, Mingcong and He, Qian and Zhang, Yongdong},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={7476--7485},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
assets		assets
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RealCustom++

Enjoy on Dreamina at Two Steps

Step 1: Create A Character:

Step 2: Character-Driven Generation:

About Code Release

Reference

About

Releases

Packages

License

Corleone-Huang/RealCustomProject

Folders and files

Latest commit

History

Repository files navigation

RealCustom++

Enjoy on Dreamina at Two Steps

Step 1: Create A Character:

Step 2: Character-Driven Generation:

About Code Release

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages