Add ChatQnA megaservice E2E (frontend) metric based autoscaling support #866
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
Additional ChatQnA values file can be used to apply frontend metric based scaling to all HPA controlled components. It should be on top of the base HPA values file.
Custom metrics are provided for all components that have HPA enabled, even if they've been configured to use frontEndMetrics. That way user can easily change their scaling between frontend and backend metrics by re-installing Helm chart (because Prometheus-adapter custom metrics configMap does not change, its manual install step can be skipped).
Issues
n/a
.Type of change
Dependencies
Manual testing with this revealed issue with the E2E metric used for scaling, which needs to be fixed first: opea-project/GenAIComps#1121
Tests
Manual testing that HPA scaling works based on frontend metric.