Bump onnxruntime.version from 1.19.2 to 1.20.0 #680

dependabot · 2024-11-04T04:47:43Z

Bumps onnxruntime.version from 1.19.2 to 1.20.0.
Updates com.microsoft.onnxruntime:onnxruntime from 1.19.2 to 1.20.0

Release notes

Sourced from com.microsoft.onnxruntime:onnxruntime's releases.

ONNX Runtime v1.20.0

Release Manager: @apsonawane

Announcements

All ONNX Runtime Training packages have been deprecated. ORT 1.19.2 was the last release for which onnxruntime-training (PyPI), onnxruntime-training-cpu (PyPI), Microsoft.ML.OnnxRuntime.Training (Nuget), onnxruntime-training-c (CocoaPods), onnxruntime-training-objc (CocoaPods), and onnxruntime-training-android (Maven Central) were published.

ONNX Runtime packages will stop supporting Python 3.8 and Python 3.9. This decision aligns with NumPy Python version support. To continue using ORT with Python 3.8 and Python 3.9, you can use ORT 1.19.2 and earlier.

ONNX Runtime 1.20 CUDA packages will include new dependencies that were not required in 1.19 packages. The following dependencies are new: libcudnn_adv.so.9, libcudnn_cnn.so.9, libcudnn_engines_precompiled.so.9, libcudnn_engines_runtime_compiled.so.9, libcudnn_graph.so.9, libcudnn_heuristic.so.9, libcudnn_ops.so.9, libnvrtc.so.12, and libz.so.1.

Build System & Packages

Python 3.13 support is included in PyPI packages.

ONNX 1.17 support will be delayed until a future release, but the ONNX version used by ONNX Runtime has been patched to include a shape inference change to the Einsum op.

DLLs in the Maven build are now digitally signed (fix for issue reported here).

(Experimental) vcpkg support added for the CPU EP. The DML EP does not yet support vcpkg, and other EPs have not been tested.

Core

MultiLoRA support.

Reduced memory utilization.

Fixed alignment that was causing mmap to fail for external weights.

Eliminated double allocations when deserializing external weights.

Added ability to serialize pre-packed weights so that they don’t cause an increase in memory utilization when the model is loaded.

Support bfloat16 and float8 data types in python I/O binding API.

Performance

INT4 quantized embedding support on CPU and CUDA EPs.

Miscellaneous performance improvements and bug fixes.

EPs

CPU

FP16 support for MatMulNbits, Clip, and LayerNormalization ops.

CUDA

Added support of cuDNN Flash Attention and Lean Attention in MultiHeadAttention op.

TensorRT

TensorRT 10.4 and 10.5 support.

QNN

QNN HTP support for weight sharing across multiple ORT inference sessions. (See ORT QNN EP documentation for more information.)

Support for QNN SDK 2.27.

OpenVINO

Added support up to OpenVINO 2024.4.1.

Compile-time memory optimizations.

Enhancement of ORT EPContext Session option for optimized first inference latency.

Added remote tensors to ensure direct memory access for inferencing on NPU.

DirectML

DirectML 1.15.2 support.

... (truncated)

Commits

c4fb724 ORT 1.20.0 release preparation: Cherry pick round 2 (#22643)
2d00351 ORT 1.20.0 Release: Cherry pick round 1 (#22526)
f9e623e Update CMake to 3.31.0rc1 (#22433)
691de83 Enable BrowserStack tests (#22457)
bf60442 [ROCm] Update ROCm Nuget pipeline to ROCm 6.2 (#22461)
2b8fc55 Enable RunMatMulTest all test cases support FP16 (#22440)
af00a20 Change ORT nightly python packages' name (#22450)
a5e85a9 Fix training artifacts for 2GB+ models and MSELoss (#22414)
6407d81 Disable BrowserStack testing stage (#22438)
4c47bca [MIGraphX EP] Add additional operators (#22446)
Additional commits viewable in compare view

Updates com.microsoft.onnxruntime:onnxruntime_gpu from 1.19.2 to 1.20.0

Release notes

Sourced from com.microsoft.onnxruntime:onnxruntime_gpu's releases.

ONNX Runtime v1.20.0

Release Manager: @apsonawane

Announcements

All ONNX Runtime Training packages have been deprecated. ORT 1.19.2 was the last release for which onnxruntime-training (PyPI), onnxruntime-training-cpu (PyPI), Microsoft.ML.OnnxRuntime.Training (Nuget), onnxruntime-training-c (CocoaPods), onnxruntime-training-objc (CocoaPods), and onnxruntime-training-android (Maven Central) were published.

ONNX Runtime packages will stop supporting Python 3.8 and Python 3.9. This decision aligns with NumPy Python version support. To continue using ORT with Python 3.8 and Python 3.9, you can use ORT 1.19.2 and earlier.

ONNX Runtime 1.20 CUDA packages will include new dependencies that were not required in 1.19 packages. The following dependencies are new: libcudnn_adv.so.9, libcudnn_cnn.so.9, libcudnn_engines_precompiled.so.9, libcudnn_engines_runtime_compiled.so.9, libcudnn_graph.so.9, libcudnn_heuristic.so.9, libcudnn_ops.so.9, libnvrtc.so.12, and libz.so.1.

Build System & Packages

Python 3.13 support is included in PyPI packages.

ONNX 1.17 support will be delayed until a future release, but the ONNX version used by ONNX Runtime has been patched to include a shape inference change to the Einsum op.

DLLs in the Maven build are now digitally signed (fix for issue reported here).

(Experimental) vcpkg support added for the CPU EP. The DML EP does not yet support vcpkg, and other EPs have not been tested.

Core

MultiLoRA support.

Reduced memory utilization.

Fixed alignment that was causing mmap to fail for external weights.

Eliminated double allocations when deserializing external weights.

Added ability to serialize pre-packed weights so that they don’t cause an increase in memory utilization when the model is loaded.

Support bfloat16 and float8 data types in python I/O binding API.

Performance

INT4 quantized embedding support on CPU and CUDA EPs.

Miscellaneous performance improvements and bug fixes.

EPs

CPU

FP16 support for MatMulNbits, Clip, and LayerNormalization ops.

CUDA

Added support of cuDNN Flash Attention and Lean Attention in MultiHeadAttention op.

TensorRT

TensorRT 10.4 and 10.5 support.

QNN

QNN HTP support for weight sharing across multiple ORT inference sessions. (See ORT QNN EP documentation for more information.)

Support for QNN SDK 2.27.

OpenVINO

Added support up to OpenVINO 2024.4.1.

Compile-time memory optimizations.

Enhancement of ORT EPContext Session option for optimized first inference latency.

Added remote tensors to ensure direct memory access for inferencing on NPU.

DirectML

DirectML 1.15.2 support.

... (truncated)

Commits

c4fb724 ORT 1.20.0 release preparation: Cherry pick round 2 (#22643)
2d00351 ORT 1.20.0 Release: Cherry pick round 1 (#22526)
f9e623e Update CMake to 3.31.0rc1 (#22433)
691de83 Enable BrowserStack tests (#22457)
bf60442 [ROCm] Update ROCm Nuget pipeline to ROCm 6.2 (#22461)
2b8fc55 Enable RunMatMulTest all test cases support FP16 (#22440)
af00a20 Change ORT nightly python packages' name (#22450)
a5e85a9 Fix training artifacts for 2GB+ models and MSELoss (#22414)
6407d81 Disable BrowserStack testing stage (#22438)
4c47bca [MIGraphX EP] Add additional operators (#22446)
Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR
@dependabot recreate will recreate this PR, overwriting any edits that have been made to it
@dependabot merge will merge this PR after your CI passes on it
@dependabot squash and merge will squash and merge this PR after your CI passes on it
@dependabot cancel merge will cancel a previously requested merge and block automerging
@dependabot reopen will reopen this PR if it is closed
@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
@dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps `onnxruntime.version` from 1.19.2 to 1.20.0. Updates `com.microsoft.onnxruntime:onnxruntime` from 1.19.2 to 1.20.0 - [Release notes](https://github.com/microsoft/onnxruntime/releases) - [Changelog](https://github.com/microsoft/onnxruntime/blob/main/docs/ReleaseManagement.md) - [Commits](microsoft/onnxruntime@v1.19.2...v1.20.0) Updates `com.microsoft.onnxruntime:onnxruntime_gpu` from 1.19.2 to 1.20.0 - [Release notes](https://github.com/microsoft/onnxruntime/releases) - [Changelog](https://github.com/microsoft/onnxruntime/blob/main/docs/ReleaseManagement.md) - [Commits](microsoft/onnxruntime@v1.19.2...v1.20.0) --- updated-dependencies: - dependency-name: com.microsoft.onnxruntime:onnxruntime dependency-type: direct:production update-type: version-update:semver-minor - dependency-name: com.microsoft.onnxruntime:onnxruntime_gpu dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>

dependabot bot added dependencies Pull requests that update a dependency file java Pull requests that update Java code labels Nov 4, 2024

kinow approved these changes Nov 4, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump onnxruntime.version from 1.19.2 to 1.20.0 #680

Bump onnxruntime.version from 1.19.2 to 1.20.0 #680

dependabot bot commented on behalf of github Nov 4, 2024

Bump onnxruntime.version from 1.19.2 to 1.20.0 #680

Are you sure you want to change the base?

Bump onnxruntime.version from 1.19.2 to 1.20.0 #680

Conversation

dependabot bot commented on behalf of github Nov 4, 2024

ONNX Runtime v1.20.0

Announcements

Build System & Packages

Core

Performance

EPs

CPU

CUDA

TensorRT

QNN

OpenVINO

DirectML

ONNX Runtime v1.20.0

Announcements

Build System & Packages

Core

Performance

EPs

CPU

CUDA

TensorRT

QNN

OpenVINO

DirectML