Skip to content

Actions: ZX-ModelCloud/GPTQModel

Actions

Ruff Check

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
289 workflow runs
289 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Merge branch 'ModelCloud:main' into main
Ruff Check #164: Commit 443db44 pushed by ZX-ModelCloud
December 13, 2024 05:17 22s main
December 13, 2024 05:17 22s
Revert "IPEXQuantLinear supports group_size=-1"
Ruff Check #163: Commit 1305c2e pushed by ZX-ModelCloud
December 13, 2024 05:17 15s main
December 13, 2024 05:17 15s
IPEXQuantLinear supports group_size=-1
Ruff Check #161: Commit fba1efb pushed by ZX-ModelCloud
December 13, 2024 03:59 16s main
December 13, 2024 03:59 16s
add patch_vllm() (#829)
Ruff Check #160: Commit 7c376a9 pushed by ZX-ModelCloud
December 13, 2024 03:56 19s main
December 13, 2024 03:56 19s
fix GPTQMarlinLinearMethod load weights
Ruff Check #159: Commit 8f3532b pushed by ZX-ModelCloud
December 13, 2024 03:37 22s zx_monkey_patch_vllm
December 13, 2024 03:37 22s
update gptq_marlin.py to vllm latest
Ruff Check #158: Commit 14e7cef pushed by ZX-ModelCloud
December 13, 2024 01:24 17s zx_monkey_patch_vllm
December 13, 2024 01:24 17s
add patch_vllm()
Ruff Check #157: Commit ba337bc pushed by ZX-ModelCloud
December 13, 2024 00:55 24s zx_monkey_patch_vllm
December 13, 2024 00:55 24s
[MODEL] add qwen2_vl support (#826)
Ruff Check #156: Commit 59cb59b pushed by ZX-ModelCloud
December 12, 2024 14:25 15s main
December 12, 2024 14:25 15s
fix module was skipped but still be looped (#806)
Ruff Check #155: Commit 023300d pushed by ZX-ModelCloud
December 9, 2024 06:38 16s main
December 9, 2024 06:38 16s
only AUTO will try other quant linears (#797)
Ruff Check #154: Commit a2cbcfa pushed by ZX-ModelCloud
December 7, 2024 09:05 15s main
December 7, 2024 09:05 15s
Fixed ipex linear param check and logging once (#795)
Ruff Check #153: Commit 26961ce pushed by ZX-ModelCloud
December 6, 2024 05:42 19s main
December 6, 2024 05:42 19s
[CI] max parallel 12 (#789)
Ruff Check #152: Commit d4d0fde pushed by ZX-ModelCloud
December 6, 2024 02:04 16s main
December 6, 2024 02:04 16s
receive checkpoint_format argument (#747)
Ruff Check #151: Commit eb75129 pushed by ZX-ModelCloud
December 4, 2024 12:36 26s main
December 4, 2024 12:36 26s
add test_asym_gptq_v1.py (#740)
Ruff Check #150: Commit b11d112 pushed by ZX-ModelCloud
December 4, 2024 08:01 26s main
December 4, 2024 08:01 26s
add TorchQuantLinear (#735)
Ruff Check #149: Commit 1995602 pushed by ZX-ModelCloud
December 4, 2024 05:45 16s main
December 4, 2024 05:45 16s
add test_q4_torch.py
Ruff Check #148: Commit 35dfd75 pushed by ZX-ModelCloud
December 4, 2024 05:19 16s zx_add_torch_qlinear
December 4, 2024 05:19 16s
Explicitly specify SUPPORTS_DEVICES
Ruff Check #147: Commit 9b44f80 pushed by ZX-ModelCloud
December 4, 2024 04:18 19s zx_add_torch_qlinear
December 4, 2024 04:18 19s
add TorchQuantLinear
Ruff Check #146: Commit c0f8b11 pushed by ZX-ModelCloud
December 4, 2024 04:06 15s zx_add_torch_qlinear
December 4, 2024 04:06 15s
Update README.md (#733)
Ruff Check #145: Commit 0538d79 pushed by ZX-ModelCloud
December 4, 2024 02:54 24s main
December 4, 2024 02:54 24s
Start 1.3.2-dev cycle (#711)
Ruff Check #144: Commit 3767247 pushed by ZX-ModelCloud
November 29, 2024 08:59 17s main
November 29, 2024 08:59 17s
hymba quant needs desc_act=False
Ruff Check #143: Commit 22a7436 pushed by ZX-ModelCloud
November 29, 2024 03:42 17s zx_fix_hymba_desc_act
November 29, 2024 03:42 17s
use lm_eval (#704)
Ruff Check #142: Commit b3a5fc3 pushed by ZX-ModelCloud
November 29, 2024 01:54 17s main
November 29, 2024 01:54 17s
use lm_eval
Ruff Check #141: Commit 7a970bb pushed by ZX-ModelCloud
November 28, 2024 17:13 17s zx_fix_test_hymba
November 28, 2024 17:13 17s
[CI] fix gpu selector (#703)
Ruff Check #140: Commit 6a98dbc pushed by ZX-ModelCloud
November 28, 2024 12:44 17s main
November 28, 2024 12:44 17s