Sync release with main for RHOAI 2.12#110

Merged

dtrifiro merged 393 commits intoreleasefrom sync-release-with-main

Jul 26, 2024

+42,884-11,710

This pull request is big! We're only showing the most recent 250 commits

Commits on Jul 4, 2024

[VLM] Calculate maximum number of multi-modal tokens by model (vllm-project#6121 )
DarkLight1337
authored

Commits on Jul 5, 2024

[VLM] Improve consistency between feature size calculation and dummy data for profiling (vllm-project#6146 )
ywang96
authored
[VLM] Cleanup validation and update docs (vllm-project#6149 )
DarkLight1337
authored
[Bugfix] Use templated datasource in grafana.json to allow automatic imports (vllm-project#6136 )
frittentheke
authored
[Frontend] Continuous usage stats in OpenAI completion API (vllm-project#5742 )
jvlunteren
authored
[Bugfix] Add verbose error if scipy is missing for blocksparse attention (vllm-project#5695 )
JGSweets
authored
bump version to v0.5.1 (vllm-project#6157 )
simon-mo
authored
[Docs] Fix readthedocs for tag build (vllm-project#6158 )
simon-mo
authored
Update wheel builds to strip debug (vllm-project#6161 )
simon-mo
authored
Fix release wheel build env var (vllm-project#6162 )
simon-mo
authored

Commits on Jul 8, 2024

Commits on Jul 9, 2024

Commits on Jul 10, 2024

Commits on Jul 11, 2024

Commits on Jul 12, 2024

Commits on Jul 13, 2024

Commits on Jul 14, 2024

Commits on Jul 15, 2024

Commits on Jul 16, 2024

Commits on Jul 17, 2024

Commits on Jul 18, 2024

Commits on Jul 19, 2024

Commits on Jul 20, 2024

Commits on Jul 22, 2024

Commits on Jul 23, 2024

Commits on Jul 24, 2024

deps: bump vllm-tgis-adapter to 0.2.3
dtrifiro
committed

Commits on Jul 25, 2024

Dockerfile.ubi: get rid of custom cache manager
dtrifiro
committed

Commits on Jul 26, 2024

Merge branch 'release' into sync-release-with-main
dtrifiro
committed