[bug] Failed to load pretrained model with huggingface transformers #1317

kehuanfeng · 2024-11-06T11:51:51Z

torch 2.4.0a0+3bcc3cddb5.nv24.7
transformer-engine 1.11.0+c27ee60
transformers 4.45.0

[rank5]: Traceback (most recent call last):
[rank5]:   File "/data/kehuan/LLaMA-Factory/src/train.py", line 28, in <module>
[rank5]:     main()
[rank5]:   File "/data/kehuan/LLaMA-Factory/src/train.py", line 19, in main
[rank5]:     run_exp()
[rank5]:   File "/data/kehuan/LLaMA-Factory/src/llamafactory/train/tuner.py", line 50, in run_exp
[rank5]:     run_sft(model_args, data_args, training_args, finetuning_args, generating_args, callbacks)
[rank5]:   File "/data/kehuan/LLaMA-Factory/src/llamafactory/train/sft/workflow.py", line 48, in run_sft
[rank5]:     model = load_model(tokenizer, model_args, finetuning_args, training_args.do_train)
[rank5]:   File "/data/kehuan/LLaMA-Factory/src/llamafactory/model/loader.py", line 162, in load_model
[rank5]:     model = load_class.from_pretrained(**init_kwargs)
[rank5]:   File "/usr/local/lib/python3.10/dist-packages/transformers/models/auto/auto_factory.py", line 559, in from_pretrained
[rank5]:     return model_class.from_pretrained(
[rank5]:   File "/usr/local/lib/python3.10/dist-packages/transformers/modeling_utils.py", line 4008, in from_pretrained
[rank5]:     ) = cls._load_pretrained_model(
[rank5]:   File "/usr/local/lib/python3.10/dist-packages/transformers/modeling_utils.py", line 4272, in _load_pretrained_model
[rank5]:     if param.device == torch.device("meta"):
[rank5]: AttributeError: '_io.BytesIO' object has no attribute 'device'

I know it's due to the missing of _extra_state related to fp8, but have no idea how to fix this kind of issue?

The text was updated successfully, but these errors were encountered:

timmoon10 · 2024-11-15T00:16:52Z

Can you try running with #1335?

timmoon10 linked a pull request Nov 15, 2024 that will close this issue

[PyTorch] Store module extra state in tensor #1335

Open

13 tasks

timmoon10 added the bug Something isn't working label Nov 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bug] Failed to load pretrained model with huggingface transformers #1317

[bug] Failed to load pretrained model with huggingface transformers #1317

kehuanfeng commented Nov 6, 2024 •

edited

Loading

timmoon10 commented Nov 15, 2024

[bug] Failed to load pretrained model with huggingface transformers #1317

[bug] Failed to load pretrained model with huggingface transformers #1317

Comments

kehuanfeng commented Nov 6, 2024 • edited Loading

timmoon10 commented Nov 15, 2024

kehuanfeng commented Nov 6, 2024 •

edited

Loading