Skip to content

Commit

Permalink
add pre-emption callback to esm2 train (#433)
Browse files Browse the repository at this point in the history
Adds the callback to trigger checkpoint saving to the ESM2 pretraining
script. Now we just need to figure out how to add the `sbatch
--signal=TERM@300` to our virtualpipe script
  • Loading branch information
pstjohn authored Nov 15, 2024
1 parent 5c04442 commit 47ba1ec
Showing 1 changed file with 1 addition and 0 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -190,6 +190,7 @@ def main(
PerplexityLoggingCallback(log_train=False, log_val=True),
RichModelSummary(max_depth=4),
LearningRateMonitor(),
nl_callbacks.PreemptionCallback(),
]
if nsys_profiling:
if nsys_end_step is None:
Expand Down

0 comments on commit 47ba1ec

Please sign in to comment.