[Question] Llama-v2-7B-Chat Qnn-Gpu model generation #138

sparkleholic · 2024-12-07T01:41:36Z

According to the updates of Qualcomm Direct SDK 2.29.0.241129 release, Llama-v2-7B-Chat model can be supported by Qnn-Gpu Backend as well as Qnn-HTP backends.

As far as I know, when I build a Llama-v2-7B-Chat via ai-hub, it supports Qnn-HTP.
Could you explain how to make a Llama-v2-7B-Chat model for supporting Qnn-Gpu backend?

mestrona-3 · 2024-12-13T22:32:13Z

Hi @sparkleholic , thanks for the question! We'd recommend asking questions via our Slack Community for faster response times :)

That being said, we (AI Hub team) haven't validated our Llama-v2 recipe on the QNN-Gpu backend.

mestrona-3 added the question Please ask any questions on Slack. This issue will be closed once responded to. label Dec 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Llama-v2-7B-Chat Qnn-Gpu model generation #138

[Question] Llama-v2-7B-Chat Qnn-Gpu model generation #138

sparkleholic commented Dec 7, 2024

mestrona-3 commented Dec 13, 2024

[Question] Llama-v2-7B-Chat Qnn-Gpu model generation #138

[Question] Llama-v2-7B-Chat Qnn-Gpu model generation #138

Comments

sparkleholic commented Dec 7, 2024

mestrona-3 commented Dec 13, 2024