Skip to content

Actions: HabanaAI/vllm-fork

clang-format

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
328 workflow run results
328 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Support loading checkpoints quantized using Autofp8
clang-format #328: Pull request #286 synchronize by Yantom1
September 16, 2024 12:41 20s yan_autofp8
September 16, 2024 12:41 20s
Support loading checkpoints quantized using Autofp8
clang-format #327: Pull request #286 synchronize by Yantom1
September 16, 2024 12:39 17s yan_autofp8
September 16, 2024 12:39 17s
Support loading checkpoints quantized using Autofp8
clang-format #326: Pull request #286 opened by Yantom1
September 16, 2024 12:28 22s yan_autofp8
September 16, 2024 12:28 22s
Support loading checkpoints quantized using Autofp8
clang-format #323: Pull request #225 synchronize by Yantom1
September 16, 2024 11:00 16s autofp8
September 16, 2024 11:00 16s
Add Dockerfile.hpu (#200)
clang-format #317: Commit f4ac1f9 pushed by michalkuligowski
September 13, 2024 13:24 12s habana_main
September 13, 2024 13:24 12s
September 13, 2024 13:24 19s
fix rotary embedding rotary_dim not equal head_size case (#245)
clang-format #315: Commit 8a92591 pushed by michalkuligowski
September 13, 2024 13:24 14s habana_main
September 13, 2024 13:24 14s
[Bugfix][Habana_main] fix guided_decode HPU failing issue (#236)
clang-format #314: Commit 54c1688 pushed by michalkuligowski
September 13, 2024 13:00 20s habana_main
September 13, 2024 13:00 20s
Optimize LoRA mask creation
clang-format #313: Pull request #285 opened by SanjuCSudhakaran
September 13, 2024 11:36 14s optimize-lora-mask
September 13, 2024 11:36 14s
Increase garbage collector's threshold (#281)
clang-format #312: Commit 88b06c2 pushed by kwisniewski98
September 13, 2024 10:36 21s habana_main
September 13, 2024 10:36 21s
optimize qwen2 model on Gaudi
clang-format #311: Pull request #233 synchronize by czhu15
September 13, 2024 00:52 21s czhu15:optimize_qwen2
September 13, 2024 00:52 21s
Remove hardcoded value from softmax in flat_pa (#280)
clang-format #307: Commit 35a4a98 pushed by szutenberg
September 12, 2024 13:53 15s habana_main
September 12, 2024 13:53 15s
Increase garbage collector's threshold
clang-format #306: Pull request #281 synchronize by kwisniewski98
September 12, 2024 12:08 19s private/kwisniewski/gc_frequency_fix
September 12, 2024 12:08 19s
Increase garbage collector's threshold
clang-format #305: Pull request #281 synchronize by kwisniewski98
September 12, 2024 12:03 14s private/kwisniewski/gc_frequency_fix
September 12, 2024 12:03 14s
Increase garbage collector's threshold
clang-format #304: Pull request #281 synchronize by kwisniewski98
September 12, 2024 12:00 18s private/kwisniewski/gc_frequency_fix
September 12, 2024 12:00 18s
Remove hardcoded value from softmax in flat_pa
clang-format #303: Pull request #280 synchronize by madamczykhabana
September 12, 2024 11:50 16s dev/madamczyk/flat_pa_acc
September 12, 2024 11:50 16s