-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Why the shift is 17 in the distill_hunyuan.sh? #133
Comments
We follow Figure 11 of the Hunyuan Video paper |
This comment was marked as duplicate.
This comment was marked as duplicate.
Thanks for your reply, i have tried the same distilling pipeline on my video model, and the core distill configs are: --num_euler_timesteps: 50 but the validation results after training 18k are like these: 0000-A.stylish.woman.walks.down.a.Tokyo.street.filled.w.2.mp4Why did the student model degrade? |
Can you share your exact script & hardware & batch size, etc, for us to reproduce? |
My hardward is 64 H100, context parallel is 2, so batch size is 32. |
This email has rejected my requestion, like this: [email protected] | 投递失败,已退信 |
Motivation
Hello, why the shift is 17 in the distill_hunyuan.sh, i know the shift param is used to transform timestep to high noise area, but why it is 17 ? and i found the shift is actually 7 in hunyuanvideo git
Another question, i do not find any timestep transformation operations in finetune_hunyuan.sh because the weighting_scheme is set as "uniform", so why there is no timestep transfer?
Related resources
No response
The text was updated successfully, but these errors were encountered: