Make config.json consistent between shortfin and sharktank #487

renxida · 2024-11-12T17:41:28Z

And remove the adaption layer in buidl_tools/integration_tests/llm/conftest.py

stbaione · 2024-11-12T20:57:49Z

sharktank/sharktank/examples/export_paged_llm_v1.py

        return {
            "module_name": "module",
            "module_abi_version": 1,
            "max_seq_len": hp.context_length,
-            "attn_head_count": hp.attention_head_count,
+            # "attn_head_count": hp.attention_head_count, # we don't need the attention head count we just need the kvcache attention head count for shortfin


Since all of the docs are updated and references to "attn_head_count" are removed, this is probably fine to delete. Otherwise, it'll sit here lingering for who knows how long

stbaione

Looks good! We can probably remove the commented out line for attn_head_count in export_paged_llm_v1.py

Added types and docstrings definitely clears things up

renxida added 8 commits November 12, 2024 09:37

the configs should be consistent now

c0de8d2

remove config editing from integration test

f8782de

fix cache_test.py and paged_kv_unit_size_elements

b4074bb

remove attention_head_count (superceded by attention_head_count_kv)

5ec2d10

typo

b2c0291

add a type hint

93fa5a0

make cpu_llm_server_test grab the tokenizer config

bc6701c

fix config writing

98f7d12

stbaione reviewed Nov 12, 2024

View reviewed changes

stbaione approved these changes Nov 12, 2024

View reviewed changes

add a docstring and remove a commented out line

c7b1d3b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make config.json consistent between shortfin and sharktank #487

Make config.json consistent between shortfin and sharktank #487

renxida commented Nov 12, 2024

stbaione Nov 12, 2024

stbaione left a comment

Make config.json consistent between shortfin and sharktank #487

Are you sure you want to change the base?

Make config.json consistent between shortfin and sharktank #487

Conversation

renxida commented Nov 12, 2024

stbaione Nov 12, 2024

Choose a reason for hiding this comment

stbaione left a comment

Choose a reason for hiding this comment