Fix: selecting correct backend for MultiHeadAttention #1410
Annotations
2 errors and 1 warning
Analysing the code with ruff:
vllm/attention/layer.py#L197
vllm/attention/layer.py:197:81: E501 Line too long (83 > 80)
|
Analysing the code with ruff
Process completed with exit code 1.
|
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
|
Loading