Q: is the auto-saved model to be used instead of lag-llama.ckpt ? #86

kenadianu · 2024-06-27T03:24:51Z

During fine-tuning the models are automatically saved, the best being, say
\lag-llama-main\lightning_logs\version_12\checkpoints\epoch=34-step=5678.ckpt'
(this is displayed on the console)

For subsequent prediction, with zero-shot, should I provide LagLlamaEstimator(...) with
epoch=34-step=5678.ckpt instead of lag-llama.ckpt ? Thank you.

The text was updated successfully, but these errors were encountered:

ashok-arjun · 2024-07-12T19:55:18Z

Hi, if you're using finetuning, then the evaluation is no longer "zero-shot". And yes, you should provide the right (latest) checkpoint if you want to evaluate results on that checkpoint.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Q: is the auto-saved model to be used instead of lag-llama.ckpt ? #86

Q: is the auto-saved model to be used instead of lag-llama.ckpt ? #86

kenadianu commented Jun 27, 2024

ashok-arjun commented Jul 12, 2024

Q: is the auto-saved model to be used instead of lag-llama.ckpt ? #86

Q: is the auto-saved model to be used instead of lag-llama.ckpt ? #86

Comments

kenadianu commented Jun 27, 2024

ashok-arjun commented Jul 12, 2024