FP8 support #171

markoarnauto · 2024-07-24T07:29:33Z

No description provided.

btakeya · 2024-12-02T15:44:37Z

Hi coreweave team, could you please give any consideration on it? I've just faced unknown quantization type error as below:

ValueError: Unknown quantization type, got fp8 - supported types are: ['awq', 'bitsandbytes_4bit', 'bitsandbytes_8bit', 'gptq', 'aqlm', 'quanto', 'eetq', 'hqq', 'compressed-tensors', 'fbgemm_fp8', 'torchao', 'bitnet']

It'd be helped a lot if fp8 would be supported. Thanks in advance.

sangstar · 2024-12-03T15:12:22Z

Hi @btakeya

Where did you get that error from? Can you send the full traceback? This looks like it might be an error from transformers.

btakeya · 2024-12-03T15:24:13Z

@sangstar Thanks for your attention!
I just tried with this code -- used my private fp8-quantized model (changed L16 with its hf repo name)
attached full traceback as well:

Traceback (most recent call last):
  File "tensorize.py", line 40, in <module>
    model = original_model(model_ref)
  File "tensorize.py", line 20, in original_model
    return AutoModelForCausalLM.from_pretrained(ref)
  File "/home/juhwan/venv/lib/python3.8/site-packages/transformers/models/auto/auto_factory.py", line 564, in from_pretrained
    return model_class.from_pretrained(
  File "/home/juhwan/venv/lib/python3.8/site-packages/transformers/modeling_utils.py", line 3647, in from_pretrained
    config.quantization_config = AutoHfQuantizer.merge_quantization_configs(
  File "/home/juhwan/venv/lib/python3.8/site-packages/transformers/quantizers/auto.py", line 173, in merge_quantization_configs
    quantization_config = AutoQuantizationConfig.from_dict(quantization_config)
  File "/home/juhwan/venv/lib/python3.8/site-packages/transformers/quantizers/auto.py", line 97, in from_dict
    raise ValueError(
ValueError: Unknown quantization type, got fp8 - supported types are: ['awq', 'bitsandbytes_4bit', 'bitsandbytes_8bit', 'gptq', 'aqlm', 'quanto', 'eetq', 'hqq', 'compressed-tensors', 'fbgemm_fp8', 'torchao', 'bitnet']

(seems transformer as you've expected)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FP8 support #171

FP8 support #171

markoarnauto commented Jul 24, 2024

btakeya commented Dec 2, 2024

sangstar commented Dec 3, 2024 •

edited

Loading

btakeya commented Dec 3, 2024 •

edited

Loading

FP8 support #171

FP8 support #171

Comments

markoarnauto commented Jul 24, 2024

btakeya commented Dec 2, 2024

sangstar commented Dec 3, 2024 • edited Loading

btakeya commented Dec 3, 2024 • edited Loading

sangstar commented Dec 3, 2024 •

edited

Loading

btakeya commented Dec 3, 2024 •

edited

Loading