Skip to content

Commit

Permalink
Set teacher ckptr output_dir to match student in KD configs (#2185)
Browse files Browse the repository at this point in the history
  • Loading branch information
ebsmothers authored Dec 19, 2024
1 parent 0528c11 commit 46a1ef0
Show file tree
Hide file tree
Showing 4 changed files with 4 additions and 4 deletions.
2 changes: 1 addition & 1 deletion recipes/configs/llama3_2/8B_to_1B_KD_lora_distributed.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -59,7 +59,7 @@ teacher_checkpointer:
model-00004-of-00004.safetensors
]
recipe_checkpoint: null
output_dir: /tmp/Meta-Llama-3.1-8B-Instruct/
output_dir: ${output_dir}
model_type: LLAMA3

# Dataset and Sampler
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -59,7 +59,7 @@ teacher_checkpointer:
model-00004-of-00004.safetensors
]
recipe_checkpoint: null
output_dir: /tmp/Meta-Llama-3.1-8B-Instruct/
output_dir: ${output_dir}
model_type: LLAMA3

# Dataset and Sampler
Expand Down
2 changes: 1 addition & 1 deletion recipes/configs/qwen2/1.5_to_0.5B_KD_lora_distributed.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -51,7 +51,7 @@ teacher_checkpointer:
hf_model_0001_0.pt
]
recipe_checkpoint: null
output_dir: /tmp/Qwen2-1.5B-Instruct-lora-finetune
output_dir: ${output_dir}
model_type: QWEN2

resume_from_checkpoint: False
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -51,7 +51,7 @@ teacher_checkpointer:
model.safetensors
]
recipe_checkpoint: null
output_dir: /tmp/Qwen2-1.5B-Instruct
output_dir: ${output_dir}
model_type: QWEN2

resume_from_checkpoint: False
Expand Down

0 comments on commit 46a1ef0

Please sign in to comment.