Added QNN EP for ARM64 builds. #41

azchohfi · 2024-11-27T00:33:49Z

No description provided.

azchohfi · 2024-11-27T00:35:26Z

Not ready for review since the exported projects are not great yet... I'm not sure what to do there...

AIDevGallery/Models/ModelCompatibility.cs

AIDevGallery/Samples/SharedCode/EmbeddingGenerator.cs

AIDevGallery/Samples/ModelsDefinitions/embeddings.modelgroup.json

nmetulev · 2024-12-13T15:29:15Z

@azchohfi, I made few changes to address my initial comments, please take a look.

In addition, the exported projects still need work to include ORT.QNN instead of ORT.DML. Also, the execution provider in the exported project defaults to CPU even if NPU is selected in the app. Once this is added, I would be good to merge this in

azchohfi · 2024-12-13T19:18:10Z

Not sure why the build is not showing up as executed here. It is passing.

zateutsch

Approved with one minor comment.

zateutsch · 2024-12-13T19:55:36Z

AIDevGallery/Samples/SharedCode/EmbeddingGenerator.cs

+        else if (hardwareAccelerator == HardwareAccelerator.QNN)
+        {
+            Dictionary<string, string> options = new()
+            {
+                { "backend_path", "QnnHtp.dll" },
+                { "htp_performance_mode", "high_performance" },
+                { "htp_graph_finalization_optimization_mode", "3" }
+            };
+            _sessionOptions.AppendExecutionProvider("QNN", options);
+            _chunkSize = 8;
+        }


Should be removed if we're not doing NPU for all-Mini

azchohfi added 4 commits November 26, 2024 14:24

Added QNN EP for ARM64 builds.

144afab

Added QNN EP.

0694dc1

Added QNN for pose model and embeddings.

df1dc6d

Merge branch 'main' into alzollin/qnn

ed6ab9b

Merge branch 'main' into alzollin/qnn

d88efc2

nmetulev reviewed Nov 27, 2024

View reviewed changes

AIDevGallery/Models/ModelCompatibility.cs Outdated Show resolved Hide resolved

AIDevGallery/Samples/SharedCode/EmbeddingGenerator.cs Outdated Show resolved Hide resolved

AIDevGallery/Samples/ModelsDefinitions/embeddings.modelgroup.json Outdated Show resolved Hide resolved

azchohfi and others added 3 commits December 6, 2024 10:32

Merge branch 'main' into alzollin/qnn

05916db

Merge branch 'main' into alzollin/qnn

182ace8

Merge branch 'main' into alzollin/qnn

8c918af

nmetulev changed the base branch from main to dev/npu December 13, 2024 14:29

Disabled DML on QC, removed QNN from embeddings, minor cleanup

0e6c68c

azchohfi added 3 commits December 13, 2024 14:56

Added hardware accelerator as a param to sample generator.

cf53a56

Merge branch 'main' into alzollin/qnn

cf1fc07

Fixed generator for ARM64/QNN.

ddfd3c7

azchohfi marked this pull request as ready for review December 13, 2024 18:43

zateutsch approved these changes Dec 13, 2024

View reviewed changes

nmetulev enabled auto-merge December 13, 2024 20:15

nmetulev added 3 commits December 13, 2024 21:34

Merge branch 'dev/npu' into alzollin/qnn

a5f6945

Merge branch 'dev/npu' into alzollin/qnn

a566665

Merge branch 'dev/npu' into alzollin/qnn

d6cdd72

nmetulev merged commit 012dd7f into dev/npu Dec 13, 2024
3 checks passed

nmetulev deleted the alzollin/qnn branch December 13, 2024 21:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added QNN EP for ARM64 builds. #41

Added QNN EP for ARM64 builds. #41

azchohfi commented Nov 27, 2024

azchohfi commented Nov 27, 2024

nmetulev commented Dec 13, 2024

azchohfi commented Dec 13, 2024

zateutsch left a comment

zateutsch Dec 13, 2024

Added QNN EP for ARM64 builds. #41

Added QNN EP for ARM64 builds. #41

Conversation

azchohfi commented Nov 27, 2024

azchohfi commented Nov 27, 2024

nmetulev commented Dec 13, 2024

azchohfi commented Dec 13, 2024

zateutsch left a comment

Choose a reason for hiding this comment

zateutsch Dec 13, 2024

Choose a reason for hiding this comment