You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@liujiqiang999 Do not register the hooks.
# accelerator.register_save_state_pre_hook(save_model_hook)
# accelerator.register_load_state_pre_hook(load_model_hook)
Source code in
Accelerate
lib shows thatweights
in hooks is empty if the training task is launched via Deepspeed.https://github.com/huggingface/accelerate/blob/b8c85839531ded28efb77c32e0ad85af2062b27a/src/accelerate/accelerator.py#L2778-L2824
Threrfore, IndexError will be raised in
save_model_hook
.e5-mistral-7b-instruct/peft_lora_embedding_semantic_search.py
Lines 158 to 162 in 9902191
Another error is that if "--checkpointing_steps" is set as "epoch",
acceleator.save_state()
times out but it works if an integer is set.The text was updated successfully, but these errors were encountered: