-
Notifications
You must be signed in to change notification settings - Fork 606
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support file paths for ONNX predictor #1711
Conversation
# Conflicts: # docs/workloads/realtime/configuration.md
…paths # Conflicts: # pkg/types/spec/errors.go # pkg/types/spec/validations.go
…paths # Conflicts: # cli/local/model_cache.go # pkg/types/spec/validations.go # pkg/workloads/cortex/lib/model/type.py
pkg/operator/operator/k8s.go
Outdated
if api.Predictor.Models.Path != nil && strings.HasSuffix(*api.Predictor.Models.Path, ".onnx") { | ||
downloadContainerArs = append(downloadContainerArs, downloadContainerArg{ | ||
From: *api.Predictor.Models.Path, | ||
To: path.Join(_modelDir, consts.SingleModelName, "1"), |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: I think if we add ItemName: "the onnx model"
to the struct, it will print a nice message in the downloader container
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thought of not doing it because adding this log would have been misleading since we advertise the predictor.models
predictor as having the LR active. That being said, I actually think this is a good idea because it highlights to the user that a model gets downloaded at deploy time (suggestive that it's a special case and LR is not active) and it's also potentially easier for us to debug if the user shares their logs.
# Conflicts: # cli/local/docker_spec.go # cli/local/model_cache.go # docs/workloads/batch/configuration.md # docs/workloads/realtime/configuration.md # docs/workloads/realtime/models.md # docs/workloads/realtime/predictors.md # pkg/cortex/serve/cortex_internal/lib/model/type.py # pkg/operator/operator/k8s.go # pkg/types/spec/errors.go # pkg/types/spec/validations.go # pkg/types/userconfig/api.go
Closes #1686.
Blocked on #1702.
checklist:
make test
andmake lint