Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix: Fix bug for bypass umd caching w/ OV 2025 #501

Open
wants to merge 23 commits into
base: master
Choose a base branch
from

Conversation

MayureshV1
Copy link

Description
This PR fixes below filed issues -
https://jira.devtools.intel.com/browse/HAFP-2910

Fixes a bug which causes dual caching of model starting OV2025.0 when EPCtx is enabled.

jatinwadhwa921 and others added 23 commits October 23, 2024 06:44
Add check for NPU device type in allocator creation logic
Create multi infer requests in OVEP based on num_threads
Updated API Documentation with required comments
* Add support for dynmaic workload type

* Fix the iteration on gsl::span dynamic keys and values

* Fix lint issues
Fix inference with EP context embed mode 0
@MayureshV1 MayureshV1 added the bug Something isn't working label Nov 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants