Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Epoch Starts from 1 When Using --continue_train #1686

Open
H-skyfxxcker opened this issue Jan 16, 2025 · 0 comments
Open

Epoch Starts from 1 When Using --continue_train #1686

H-skyfxxcker opened this issue Jan 16, 2025 · 0 comments

Comments

@H-skyfxxcker
Copy link

Hello everyone,

I have a question regarding the use of the --continue_train flag in my training process. After applying this flag, the terminal shows that the epoch count starts from 1. This has made me uncertain whether I am actually continuing the previous training or if it’s starting over from scratch.

I had previously saved the training state, but with this command, it seems like the training is restarting. Is this normal behavior? Or might I have made an error somewhere? What settings or files should I check to troubleshoot this?

Thank you for your assistance!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant