Skip to content

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops #3867

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops #3867

Annotations

1 error

build (3.9, ubuntu-20.04)

failed Jan 7, 2025 in 4m 32s