-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add PVC as storage option for LM-Eval results #325
Comments
Here is my thought on this feature, suggestions/comments are welcome
For the argument's sake, the new option is named type Outputs struct {
// +optional
PersistentVolumeClaim *corev1.PersistentVolumeClaim `json:"pvc,omitempty"`
} Only support PVC for now. One thing worth mentioning is If users want to create a PVC by themselves and then use it in an LM-Eval job, they can do that already. Here is an example and the name of the PVC that has been created is apiVersion: trustyai.opendatahub.io/v1alpha1
kind: LMEvalJob
metadata:
name: evaljob-sample
spec:
model: hf
modelArgs:
- name: pretrained
value: google/flan-t5-base
taskList:
taskRecipes:
- card:
name: "cards.wnli"
template: "templates.classification.multi_class.relation.default"
logSamples: true
pod:
volumes:
- name: pvc
persistentVolumeClaim:
claimName: mypvc
container:
volumeMounts:
- mountPath: "/opt/app-root/src/output"
name: PVC |
If we want to make the case of using an existing PVC easier, the type Outputs struct {
// Use an existing PVC to store the outputs
// +optional
PersistentVolumeClaimName *string `json:"pvcName,omitempty"`
// Create a PVC and use it to store the outputs
// +optional
PersistentVolumeClaim *corev1.PersistentVolumeClaim `json:"pvc,omitempty"`
} When the apiVersion: trustyai.opendatahub.io/v1alpha1
kind: LMEvalJob
metadata:
name: evaljob-sample
spec:
model: hf
modelArgs:
- name: pretrained
value: google/flan-t5-base
taskList:
taskRecipes:
- card:
name: "cards.wnli"
template: "templates.classification.multi_class.relation.default"
logSamples: true
outputs:
pvcName: mypvc |
Hi @yhwang, thanks for the suggestions. I think that's a good idea. IMO the two use cases (for the PVC scope, at the moment) are:
I think that adding the ability to specify a full PVC spec under the LMEvalJob CR might be avoided with your |
Your implementation looks great to me. Left a minor comment and I think it's good to go! |
For large, full-results (not just summaries) storing the evaluation results in the CR's status is not practical.
An option should be available to persist full evaluation results to PVC.
The text was updated successfully, but these errors were encountered: