Intern2 habana #489

skirdey-inflection · 2024-11-13T00:45:16Z

Making sure the model runs on habana devices. Original code did not run due to error in the split_qkv code as param unpacking was assuming lack of batch dimension. Tested inference with the changes and InternLM2 works on Gaudi2 as expected.

changing view to shape in split_qvk changing view to shape in split_qvk changing view to shape in split_qvk internlm2 habana compatibility

kzawora-intel · 2024-11-21T10:40:33Z

vllm/model_executor/models/internlm2.py

@@ -144,20 +144,23 @@ def __init__(
        )

    def split_qkv(self, qkv: torch.Tensor):
-        seq_len = qkv.shape[0]
+        batch_size, seq_len, _ = qkv.shape


This will break all non-HPU accelerators. Can we make this more generic? E.g. pass *qkv.shape[:-1] into reshape, so that 2D tensors will work as well?

sure, let me add and test on non-hpu

michalkuligowski · 2024-11-26T10:20:00Z

vllm/model_executor/models/internlm2.py


        if self.tp_size > 1:
            splitter = partial(split_tensor_along_last_dim,
                               num_partitions=self.tp_size)
            q = splitter(q)[self.tp_rank]
            k = splitter(k)[self.tp_rank]
            v = splitter(v)[self.tp_rank]
+


Yapf fails on this empty line, please remove

michalkuligowski · 2024-11-26T10:20:58Z

vllm/model_executor/models/internlm2.py

@@ -144,27 +144,33 @@ def __init__(
        )

    def split_qkv(self, qkv: torch.Tensor):
-        seq_len = qkv.shape[0]
+        # Unpack all dimensions except the last one
+        *batch_dims, last_dim = qkv.shape


last_dim seems that is not used, please replace with _

The "Trigger Jenkins Tests" failure can be ignored now

Stan Kirdey added 3 commits November 12, 2024 23:13

changing view to shape in split_qvk

1087c1f

changing view to shape in split_qvk changing view to shape in split_qvk changing view to shape in split_qvk internlm2 habana compatibility

cleanup

c6a28a8

yapf formatting

479c88e

michalkuligowski requested review from kzawora-intel and madamczykhabana November 14, 2024 08:42

kzawora-intel requested changes Nov 21, 2024

View reviewed changes

skirdey-inflection added 3 commits November 24, 2024 18:48

Merge branch 'HabanaAI:habana_main' into intern2-habana

9aec429

handling on any arch

19285e8

Merge branch 'HabanaAI:habana_main' into intern2-habana

2ae1e39

kzawora-intel approved these changes Nov 26, 2024

View reviewed changes

michalkuligowski requested changes Nov 26, 2024

View reviewed changes

formatting

34c9f7d

michalkuligowski approved these changes Nov 26, 2024

View reviewed changes

michalkuligowski merged commit b7d75b8 into HabanaAI:habana_main Nov 26, 2024
7 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Intern2 habana #489

Intern2 habana #489

skirdey-inflection commented Nov 13, 2024

kzawora-intel Nov 21, 2024

skirdey Nov 21, 2024

michalkuligowski Nov 26, 2024

michalkuligowski Nov 26, 2024

michalkuligowski Nov 26, 2024

Intern2 habana #489

Intern2 habana #489

Conversation

skirdey-inflection commented Nov 13, 2024

kzawora-intel Nov 21, 2024

Choose a reason for hiding this comment

skirdey Nov 21, 2024

Choose a reason for hiding this comment

michalkuligowski Nov 26, 2024

Choose a reason for hiding this comment

michalkuligowski Nov 26, 2024

Choose a reason for hiding this comment

michalkuligowski Nov 26, 2024

Choose a reason for hiding this comment