Add lm_eval test for HPU #298

kzawora-intel · 2024-09-18T15:46:36Z

Currently only testing Llama3.1-8b-Instruct with lm_eval on GSM-8K and IFeval. Both tests pass, can be expanded if needed.

madamczykhabana · 2024-09-19T04:33:40Z

tests/hpu/test_hpu_lmeval.py

+                         [TaskConfigs.gsm8k_llama_cot, TaskConfigs.ifeval],
+                         ids=['gsm8k_llama_cot', 'ifeval'],


I think you can pass a function as 'ids'. Something like:

def get_task_name(task_cfg): return task_cfg["task_name"] ... @pytest.mark.parametrize("task_cfg", [TaskConfigs.gsm8k_llama_cot, TaskConfigs.ifeval], ids=get_task_name,

fixed, thanks!

madamczykhabana · 2024-09-19T04:34:41Z

tests/hpu/test_hpu_lmeval.py

Perhaps we should move it to the extension repo?

Yeah, the PR got a little bloated now. I'll close it.

kzawora-intel added 4 commits September 18, 2024 18:45

Add lm_eval test for HPU

68d4178

oopsie i forgot the test

4119b1d

restore requirements-hpu.txt

4375fbb

now i can format the test!

10675d1

madamczykhabana reviewed Sep 19, 2024

View reviewed changes

add more tests

594061f

kzawora-intel closed this Sep 20, 2024

kzawora-intel added the habana Issues or PRs submitted by Habana Labs label Sep 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add lm_eval test for HPU #298

Add lm_eval test for HPU #298

kzawora-intel commented Sep 18, 2024 •

edited

Loading

madamczykhabana Sep 19, 2024

kzawora-intel Sep 20, 2024

madamczykhabana Sep 19, 2024

kzawora-intel Sep 20, 2024

		[TaskConfigs.gsm8k_llama_cot, TaskConfigs.ifeval],
		ids=['gsm8k_llama_cot', 'ifeval'],

Add lm_eval test for HPU #298

Add lm_eval test for HPU #298

Conversation

kzawora-intel commented Sep 18, 2024 • edited Loading

madamczykhabana Sep 19, 2024

Choose a reason for hiding this comment

kzawora-intel Sep 20, 2024

Choose a reason for hiding this comment

madamczykhabana Sep 19, 2024

Choose a reason for hiding this comment

kzawora-intel Sep 20, 2024

Choose a reason for hiding this comment

kzawora-intel commented Sep 18, 2024 •

edited

Loading