-
Notifications
You must be signed in to change notification settings - Fork 737
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Work plan and enhancement / 工作计划和用户诉求 #194
Comments
#182 #191 #47 #84 have similar content, all looking forward to the open-source CogVideoX I2V model. We are conducting research and evaluation. #111 #186 are similar, both expecting to provide fine-tuning work for VAE. We will try to place it in the fine-tuning version, and there is a probability that it can be adapted in diffusers fine-tuning, but it will consume relatively high resources |
5b image to video please! I2V would be lovely! |
The 3D VAE model consumes significantly more memory compared to diffusion models, which is severely limiting the batch size for fine-tuning. Any suggestions or optimizations to reduce memory usage would be greatly appreciated. |
You make a very good point. We will work together with the Diffusers team to modify the fake quantization (fakecp) process in the VAE section to optimize it for lower memory usage. Please give us some time, as we will collaborate with the Diffusers team to develop a version of the model that is fine-tuned specifically for Diffusers, which is expected to save a significant amount of memory. |
First of all, thank you for your excellent work! The dataset format used SAT way for fine tuning & full training be the same as the format that will be used for fine-tuning Diffusers version models? + wrong discord link |
We are currently completing several tasks
Work that has been completed
|
When will vertical video generation be supported? |
The current model cannot generate vertical videos, such as 480x720 resolution. We are working on fine-tuning to reach this capability, but it’s still in progress. Once we have any updates, we will share them as soon as possible. |
Two related issues working now:
Many thanks to @a-r-r-o-w for the help with these two tasks! |
when will CogVideoX-2B-I2V be released? |
|
@zRzRzRzRzRzRzR Many thanks to you and the team! I know fine-tuning vae is not very useful, but I'm curious is there any way I can just fine-tuning decoder part? |
Our publicly available fine-tuning code is for the fine-tuning of the transformers part, not for vae. We indeed have not updated the training and fine-tuning parts of vae (because I have not received the corresponding permissions either). |
@zRzRzRzRzRzRzR thank you but actually I'm asking how to fine-tuning VAE decoder, any advice? |
Hi @zRzRzRzRzRzRzR, what's your plan about diffuser I2V lora fine-tune code? Thanks! |
Thank you for your great works! I would like to covert a full-finetuned 2b model weight in sat into a model weight in diffusers. How can I do it? |
Tasks that have been identified and scheduled:
已经明确并排期的任务:
如果你有更多诉求,欢迎在这里提出
The text was updated successfully, but these errors were encountered: