Skip to content

Actions: HabanaAI/vllm-fork

clang-format

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
895 workflow run results
895 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add HPU specific changes to benchmark_latency.py (#436)
clang-format #814: Commit 4fd5c4c pushed by michalkuligowski
October 28, 2024 09:47 25s habana_main
October 28, 2024 09:47 25s
Add fp8 test to jenkins CI
clang-format #812: Pull request #429 synchronize by afierka-intel
October 28, 2024 09:02 18s dev/afierka/add-ci-fp8-scenarios
October 28, 2024 09:02 18s
HPU: offload logits processing to CPU
clang-format #811: Pull request #358 synchronize by madamczykhabana
October 28, 2024 08:56 23s dev/madamczyk/offload_logits
October 28, 2024 08:56 23s
Support long contexts with LoRA (#418)
clang-format #810: Commit 3a55e77 pushed by michalkuligowski
October 28, 2024 08:47 25s habana_main
October 28, 2024 08:47 25s
Add fp8 test to jenkins CI
clang-format #809: Pull request #429 synchronize by afierka-intel
October 28, 2024 08:15 18s dev/afierka/add-ci-fp8-scenarios
October 28, 2024 08:15 18s
Fix one_hot bug in torch compile mode
clang-format #808: Pull request #427 synchronize by yuwenzho
October 28, 2024 07:49 18s yuwenzho:yuwen/tc_one_hot
October 28, 2024 07:49 18s
Support long contexts with LoRA
clang-format #806: Pull request #418 synchronize by SanjuCSudhakaran
October 28, 2024 05:52 19s lora-long-contexts
October 28, 2024 05:52 19s
Add fp8 test to jenkins CI
clang-format #804: Pull request #429 synchronize by afierka-intel
October 26, 2024 09:26 16s dev/afierka/add-ci-fp8-scenarios
October 26, 2024 09:26 16s
Add fp8 test to jenkins CI
clang-format #803: Pull request #429 synchronize by afierka-intel
October 26, 2024 09:20 17s dev/afierka/add-ci-fp8-scenarios
October 26, 2024 09:20 17s
Add fp8 test to jenkins CI
clang-format #802: Pull request #429 synchronize by afierka-intel
October 26, 2024 06:06 15s dev/afierka/add-ci-fp8-scenarios
October 26, 2024 06:06 15s
Lora layers
clang-format #801: Pull request #435 synchronize by rsshaik1
October 26, 2024 04:00 18s lora-layers
October 26, 2024 04:00 18s
Reduce block fragmentation
clang-format #799: Pull request #426 synchronize by yangw1234
October 25, 2024 15:24 17s yangw1234:reduce_fragmentation
October 25, 2024 15:24 17s
Lora layers
clang-format #798: Pull request #435 opened by rsshaik1
October 25, 2024 14:52 20s lora-layers
October 25, 2024 14:52 20s
Contiguous PA
clang-format #797: Pull request #433 synchronize by mfylcek
October 25, 2024 14:32 22s dev/mfylcek/contiguous_pa_main_24_10
October 25, 2024 14:32 22s
Create run-lm-eval-mmlu.sh
clang-format #796: Pull request #399 synchronize by michalkuligowski
October 25, 2024 13:46 17s michalkuligowski-mmlu-test
October 25, 2024 13:46 17s
Enable Dynamic MoE for Mixtral on 1.19.0 (#425)
clang-format #795: Commit 93609a2 pushed by tpawlows
October 25, 2024 13:30 18s habana_main
October 25, 2024 13:30 18s
Enable Dynamic MoE for Mixtral on 1.19.0
clang-format #794: Pull request #425 synchronize by tpawlows
October 25, 2024 13:25 25s dev/tpawlowski/dynamic_moe_to_1_19
October 25, 2024 13:25 25s
Contiguous PA
clang-format #793: Pull request #433 opened by mfylcek
October 25, 2024 12:54 15s dev/mfylcek/contiguous_pa_main_24_10
October 25, 2024 12:54 15s
Revert "Contiguous PA" (#432)
clang-format #792: Commit e3ae2eb pushed by madamczykhabana
October 25, 2024 12:49 19s habana_main
October 25, 2024 12:49 19s
HPU: offload logits processing to CPU
clang-format #791: Pull request #358 synchronize by madamczykhabana
October 25, 2024 12:47 14s dev/madamczyk/offload_logits
October 25, 2024 12:47 14s
Contiguous PA (#424)
clang-format #789: Commit 5b7f685 pushed by michalkuligowski
October 25, 2024 12:35 15s habana_main
October 25, 2024 12:35 15s
Add fp8 test to jenkins CI
clang-format #788: Pull request #429 synchronize by afierka-intel
October 25, 2024 12:13 20s dev/afierka/add-ci-fp8-scenarios
October 25, 2024 12:13 20s