Skip to content

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops #3866

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops #3866

build (3.9, ubuntu-20.04)

succeeded Jan 7, 2025 in 28m 29s