Preetha/workload type rebased #446

preetha-intel · 2024-09-19T17:39:25Z

Description

Add support for workload type as session_option and runtime_option

* Implements blob compatibility check for NPU * OVEP catches the NPU driver exception and return failure status * NPU to CPU fallback is disabled when inferencing with blob * Update NPU device exception handling approach * Changes failure status code to exception (std::runtime_error) * Capture all NPU related errors * Throw minimal error message with error type and error code for Release builds * Fix lint issues * Address review comments * Address review comments --------- Co-authored-by: Srirammaswamy <srirammaswamy.s@intel.com>

…PU (#441) * Prototype shared memory allocator on Windows using OV-EP * Partially working allocator. Crashing on tensor destruction. Might have UMD exceptions. Needs further debug. Unknown if values are correct. * Hard code onnx perf to use RT NPU allocator for inputs * Fix allocation lookups coming from different level zero contexts * Page align OV allocation * Allocate input as WC * Only set tensors when they have changed. * Revert "Allocate input as WC" This reverts commit d43219f. * Hard code onnx perf to use RT NPU for outputs * Revert "Hard code onnx perf to use RT NPU for outputs" This reverts commit c1f3b3e. * Hard code onnx perf to use RT NPU for outputs fixed * Fix onnx_perf_test app crash on tensor destroy * refactor: remove redundant ort_shape_to_ovshape lambda function * alocate buffer in NPU visible region from perf test application * remove redundant code * add command line parameter in perf test for using remote tensors * remove redundant code * remove redundant statements * fix crash during inference * remove redundant code * enable backward compatibility of remote tensor feature * Revert "enable backward compatibility of remote tensor feature" This reverts commit 1791b90. * enable backward compatibility of remote tensor feature in OVEP --------- Co-authored-by: Javier E. Martinez <javier.e.martinez@intel.com> Co-authored-by: Eric Crawford <eric.r.crawford@intel.com>

…ter then 2024.3

Disable driver caching for NPU when epctx enabled for ov version greater then 2024.3

* fix debug build issue and lint issues * change naming for OVEP NPU specific macro * fix unit tests and lint issues

preetha-intel · 2024-10-08T05:58:00Z

Outdated.
Refer to microsoft#22282 for reimplementation

preetha-intel and others added 12 commits September 6, 2024 20:58

Disable driver caching for NPU when epctx enabled for ov version grea…

df7febe

…ter then 2024.3

Merge pull request #443 from intel/jatin/ov_2024_4_cache_fix

f24a235

Disable driver caching for NPU when epctx enabled for ov version greater then 2024.3

Merge branch 'microsoft:main' into ovep-develop-lnl-1.2

24968dc

Ovep release lnl 1.2.1 (#445)

8259b03

* fix debug build issue and lint issues * change naming for OVEP NPU specific macro * fix unit tests and lint issues

Merge branch 'microsoft:main' into ovep-develop-lnl-1.2

5a9c8af

Add support to set session.workload_type in OVEP

d69f6c9

Add runtime option for workload type in OVEP

c89cc2e

Set property for worklod type as anymap property

a49cbf6

Fix reference to global_context to be set during run_option

d31050b

Pass global_context by reference to backend_manager

541afee

preetha-intel requested a review from sfatimar September 19, 2024 17:52

preetha-intel added 3 commits September 20, 2024 10:43

Reset runtime_workload_type on run end

76ca087

Fix lint issues

7e9caab

Add check for workload_type

0f007a3

saurabhkale17 force-pushed the ovep-develop-lnl-1.2 branch from 4d64dc0 to f9b995c Compare September 25, 2024 13:20

Fix infer request bug

a8c4f0b

preetha-intel marked this pull request as draft October 1, 2024 13:51

preetha-intel closed this Oct 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preetha/workload type rebased #446

Preetha/workload type rebased #446

preetha-intel commented Sep 19, 2024

preetha-intel commented Oct 8, 2024

Preetha/workload type rebased #446

Preetha/workload type rebased #446

Conversation

preetha-intel commented Sep 19, 2024

Description

preetha-intel commented Oct 8, 2024