-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Add timing tracker to fms-hf-tuning #378
base: main
Are you sure you want to change the base?
Conversation
Signed-off-by: Will Johnson <[email protected]>
Signed-off-by: Will Johnson <[email protected]>
Signed-off-by: Will Johnson <[email protected]>
Signed-off-by: Will Johnson <[email protected]>
Thanks for making a pull request! 😃 |
Signed-off-by: Will Johnson <[email protected]>
Signed-off-by: Will Johnson <[email protected]>
@willmj should we consider using https://github.com/foundation-model-stack/hf-resource-scanner for tracking? Cc: @ChanderG |
Yes definitely, thanks for the suggestion @ashokponkumar - I had not heard of this scanner. I can look into this implementation. |
Signed-off-by: Will Johnson <[email protected]>
Signed-off-by: Will Johnson <[email protected]>
Signed-off-by: Will Johnson <[email protected]>
@will Johnson few questions for you...
|
By putting a space in between will and Johnson it sends an alert to me and not Mr Johnson |
Hey @anhuong sorry for the late response:
|
Description of the change
This change will add another tracker to
fms-hf-tuning
which will, per process, log the training time. Additionally, if tuning is run usingaccelerate_launch.py
, it will log the entire run time from beginning to end (including pre and post processing).Here is an example log of a run with two processes:
Questions:
Related issue number
How to verify the PR
Run these changes in a dev branch on any model and view the output directory once training is complete.
Was the PR tested