Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MVE: Move kernel sums from core loop for FC and SVDF #69

Merged
merged 2 commits into from
Sep 18, 2023

Conversation

mansnils
Copy link
Contributor

This enables improved read efficiency by using 4 instead of 3 kernels streams for fully connected. This optimization requires a buffer to store sum of kernels. Please refer to the interface document for details. Adds new svdf int8 unit test as well.

Co-authored-by: Felix Johnny Thomasmathibalan [email protected]

felix-johnny and others added 2 commits September 13, 2023 13:22
This enables improved FC read efficiency by using 4 instead of 3 kernels
streams. This optimization requires a buffer to store sum of kernels.
Please refer to the interface document for details.
Adds new unit test as well to cover this for svdf s8.
For MVE and FC intention is to calculate kernel sums once before
inference and this patch address that.
@mansnils mansnils added the improvement Performance or general improvement label Sep 14, 2023
@mansnils mansnils merged commit 58f1770 into ARM-software:main Sep 18, 2023
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
improvement Performance or general improvement
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants