Clarification on Number of Training Steps #65

alberthli · 2025-01-02T08:21:26Z

In the paper and in the config comment, you state that you train the model for 1M steps each for the discriminator and generator. However, the config itself uses

max_steps: 20000000  # 20M, not 2M!

Could you clarify which of these numbers is a typo?

The text was updated successfully, but these errors were encountered:

jishengpeng · 2025-01-02T10:36:08Z

In the paper and in the config comment, you state that you train the model for 1M steps each for the discriminator and generator. However, the config itself uses
max_steps: 20000000  # 20M, not 2M!
Could you clarify which of these numbers is a typo?

Thank you for your attention. We set an upper limit of 2 million steps for training. And in practice, the training process is often terminated earlier based on observations from TensorBoard.

alberthli · 2025-01-02T10:39:48Z

So you mean to say that the number 20000000 (20 million, not 2 million) in the config is always set, but you terminate early, usually around 2 million steps instead? And do you just manually terminate the run rather than using some automated condition?

jishengpeng · 2025-01-02T10:46:50Z

So you mean to say that the number 20000000 (20 million, not 2 million) in the config is always set, but you terminate early, usually around 2 million steps instead? And do you just manually terminate the run rather than using some automated condition?

2 million, not 20 million.

alberthli · 2025-01-02T10:49:04Z

I'm not sure I understand. This number in your config is not 2 million, it is 20 million (there are 7, not 6 zeros). Are you saying that the number specified in the paper and the comments is wrong, or that the config is wrong? These two numbers are not consistent with each other.

jishengpeng · 2025-01-02T10:55:35Z

I'm not sure I understand. This number in your config is not 2 million, it is 20 million (there are 7, not 6 zeros). Are you saying that the number specified in the paper and the comments is wrong, or that the config is wrong? These two numbers are not consistent with each other.

We have further updated the config for the better understanding. Thank you.

alberthli · 2025-01-02T10:56:52Z

Thank you for updating the config - did you use 2M or 20M when training the models presented in the paper? This affects things like the learning rate scheduler.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarification on Number of Training Steps #65

Clarification on Number of Training Steps #65

alberthli commented Jan 2, 2025

jishengpeng commented Jan 2, 2025

alberthli commented Jan 2, 2025

jishengpeng commented Jan 2, 2025

alberthli commented Jan 2, 2025

jishengpeng commented Jan 2, 2025

alberthli commented Jan 2, 2025

Clarification on Number of Training Steps #65

Clarification on Number of Training Steps #65

Comments

alberthli commented Jan 2, 2025

jishengpeng commented Jan 2, 2025

alberthli commented Jan 2, 2025

jishengpeng commented Jan 2, 2025

alberthli commented Jan 2, 2025

jishengpeng commented Jan 2, 2025

alberthli commented Jan 2, 2025