optimum openvino #467

michaelfeil · 2024-11-16T04:31:31Z

@tjtanaa FYI, continued by merging your branch into this and main.

greptile-apps

PR Summary

This PR adds OpenVINO backend support to optimize model inference on Intel hardware, with changes spanning Docker configuration, embedder implementation, and utility functions.

Added OpenVINO execution provider in /libs/infinity_emb/infinity_emb/transformer/utils_optimum.py with bf16 precision support
Implemented OpenVINO model file handling with new get_openvino_files() function
Set default INFINITY_ENGINE="optimum" in CPU Docker configuration with OpenVINO extras
Added CHECK_OPTIMUM_INTEL optional import for Intel optimization capabilities
Duplicate optimization config code in optimize_model() needs to be cleaned up

_{5 file(s) reviewed, 9 comment(s)}
_{Edit PR Review Bot Settings | Greptile}

greptile-apps · 2024-11-16T04:31:59Z

libs/infinity_emb/Docker.template.yaml

@@ -20,6 +20,8 @@ cpu:
  extra_env_variables: |
    # Sets default to onnx
    ENV INFINITY_ENGINE="optimum"
+    RUN poetry run python -m pip install --upgrade --upgrade-strategy eager "optimum[openvino]"


style: Installing optimum[openvino] in extra_env_variables section is unconventional. Should be moved to main_install or extra_installs_main.

greptile-apps · 2024-11-16T04:32:20Z

libs/infinity_emb/Dockerfile.cpu_auto

 RUN ./requirements_install_from_poetry.sh --no-root --without lint,test "https://download.pytorch.org/whl/cpu"
+RUN poetry run python -m pip install --upgrade --upgrade-strategy eager "optimum[openvino]"


logic: Redundant installation of optimum[openvino] - this package is already included via EXTRAS='all openvino' in the environment variables

greptile-apps · 2024-11-16T04:32:21Z

libs/infinity_emb/Dockerfile.cpu_auto

 RUN ./requirements_install_from_poetry.sh  --without lint,test "https://download.pytorch.org/whl/cpu"
+RUN poetry run python -m pip install --upgrade --upgrade-strategy eager "optimum[openvino]"


style: Installing optimum[openvino] multiple times in different build stages may cause version conflicts or increase build time unnecessarily

greptile-apps · 2024-11-16T04:32:55Z

libs/infinity_emb/infinity_emb/transformer/embedder/optimum.py

+            except Exception as e:  # show error then let the optimum intel compress on the fly
+                print(str(e))


logic: Silently printing errors and continuing is dangerous. Consider logging the error and/or raising a more specific exception if OpenVINO file loading fails.

greptile-apps · 2024-11-16T04:32:56Z

libs/infinity_emb/infinity_emb/transformer/embedder/optimum.py

-        )
+        if provider == "OpenVINOExecutionProvider":
+            CHECK_OPTIMUM_INTEL.mark_required()
+            filename = ""


style: Empty filename could cause issues if exception occurs. Initialize with None instead to make the failure case more explicit.

greptile-apps · 2024-11-16T04:33:21Z

libs/infinity_emb/infinity_emb/transformer/utils_optimum.py

+        else:  # Optimum onnx cpu path
+            optimizer = ORTOptimizer.from_pretrained(unoptimized_model)
+
+            is_gpu = "cpu" not in execution_provider.lower()


logic: Duplicate optimization config block. Remove lines 231-239 as they are identical to 222-230.

greptile-apps · 2024-11-16T04:33:22Z

libs/infinity_emb/infinity_emb/transformer/utils_optimum.py

+    openvino_files = [p for p in repo_files if p.match(pattern)]
+
+    if len(openvino_files) > 1:
+        logger.info(f"Found {len(openvino_files)} onnx files: {openvino_files}")


syntax: Log message incorrectly refers to 'onnx files' when listing OpenVINO files

Suggested change

logger.info(f"Found {len(openvino_files)} onnx files: {openvino_files}")

logger.info(f"Found {len(openvino_files)} OpenVINO files: {openvino_files}")

greptile-apps · 2024-11-16T04:33:22Z

libs/infinity_emb/infinity_emb/transformer/utils_optimum.py

+        if files_optimized:
+            file_optimized = files_optimized[-1]
+        if file_name:
+            file_optimized = file_name


logic: Overwriting file_optimized with file_name could bypass optimization caching if file_name is set

greptile-apps · 2024-11-16T04:33:23Z

libs/infinity_emb/infinity_emb/transformer/utils_optimum.py

+                ov_config={
+                    "INFERENCE_PRECISION_HINT": "bf16"
+                },  # fp16 for now as it has better precision than bf16


style: Using bf16 precision by default may reduce accuracy compared to fp16/fp32 on some hardware

codecov-commenter · 2024-11-16T04:47:28Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

Attention: Patch coverage is 44.00000% with 42 lines in your changes missing coverage. Please review.

Project coverage is 78.74%. Comparing base (8ac0b3c) to head (5961be9).

Files with missing lines	Patch %	Lines
...nity_emb/infinity_emb/transformer/utils_optimum.py	44.23%	29 Missing ⚠️
...y_emb/infinity_emb/transformer/embedder/optimum.py	38.09%	13 Missing ⚠️

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #467      +/-   ##
==========================================
- Coverage   79.51%   78.74%   -0.77%     
==========================================
  Files          41       41              
  Lines        3417     3468      +51     
==========================================
+ Hits         2717     2731      +14     
- Misses        700      737      +37

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

michaelfeil added 3 commits November 15, 2024 20:22

try merge 1

07eb38a

merge openvino

91948cd

update template

d588ffc

greptile-apps bot reviewed Nov 16, 2024

View reviewed changes

add optimum

4718e69

michaelfeil added 2 commits November 15, 2024 22:20

latest push

5ae8c43

fmt

5961be9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optimum openvino #467

optimum openvino #467

michaelfeil commented Nov 16, 2024 •

edited

Loading

greptile-apps bot left a comment

greptile-apps bot Nov 16, 2024

greptile-apps bot Nov 16, 2024

greptile-apps bot Nov 16, 2024

greptile-apps bot Nov 16, 2024

greptile-apps bot Nov 16, 2024

greptile-apps bot Nov 16, 2024

greptile-apps bot Nov 16, 2024

greptile-apps bot Nov 16, 2024

greptile-apps bot Nov 16, 2024

codecov-commenter commented Nov 16, 2024 •

edited

Loading

		RUN ./requirements_install_from_poetry.sh --no-root --without lint,test "https://download.pytorch.org/whl/cpu"
		RUN poetry run python -m pip install --upgrade --upgrade-strategy eager "optimum[openvino]"

		except Exception as e: # show error then let the optimum intel compress on the fly
		print(str(e))

	logger.info(f"Found {len(openvino_files)} onnx files: {openvino_files}")
	logger.info(f"Found {len(openvino_files)} OpenVINO files: {openvino_files}")

optimum openvino #467

Are you sure you want to change the base?

optimum openvino #467

Conversation

michaelfeil commented Nov 16, 2024 • edited Loading

greptile-apps bot left a comment

Choose a reason for hiding this comment

PR Summary

greptile-apps bot Nov 16, 2024

Choose a reason for hiding this comment

greptile-apps bot Nov 16, 2024

Choose a reason for hiding this comment

greptile-apps bot Nov 16, 2024

Choose a reason for hiding this comment

greptile-apps bot Nov 16, 2024

Choose a reason for hiding this comment

greptile-apps bot Nov 16, 2024

Choose a reason for hiding this comment

greptile-apps bot Nov 16, 2024

Choose a reason for hiding this comment

greptile-apps bot Nov 16, 2024

Choose a reason for hiding this comment

greptile-apps bot Nov 16, 2024

Choose a reason for hiding this comment

greptile-apps bot Nov 16, 2024

Choose a reason for hiding this comment

codecov-commenter commented Nov 16, 2024 • edited Loading

Codecov Report

michaelfeil commented Nov 16, 2024 •

edited

Loading

codecov-commenter commented Nov 16, 2024 •

edited

Loading