Add PVC as storage option for LM-Eval results #325

ruivieira · 2024-10-17T07:50:55Z

For large, full-results (not just summaries) storing the evaluation results in the CR's status is not practical.
An option should be available to persist full evaluation results to PVC.

yhwang · 2024-10-22T22:19:39Z

Here is my thought on this feature, suggestions/comments are welcome

provide an option to save the outputs, including stdout, stderr, results, and sampling log, to external storage
for external storage, we can support different types of storage, i.e. PVC, COS, etc. Let's add the PVC for now.

For the argument's sake, the new option is named Outputs and users can specify a full data struct of the PersistentVolumeClaim . LM-Eval controller would create the PVC and mount it on the /opt/app-root/src/output and all the outputs would store to the PVC. The data struct for the Outputs could be:

type Outputs struct {
	// +optional
	PersistentVolumeClaim *corev1.PersistentVolumeClaim `json:"pvc,omitempty"`
}

Only support PVC for now.

One thing worth mentioning is If users want to create a PVC by themselves and then use it in an LM-Eval job, they can do that already. Here is an example and the name of the PVC that has been created is mypvc:

apiVersion: trustyai.opendatahub.io/v1alpha1
kind: LMEvalJob
metadata:
  name: evaljob-sample
spec:
  model: hf
  modelArgs:
  - name: pretrained
    value: google/flan-t5-base
  taskList:
    taskRecipes:
    - card:
        name: "cards.wnli"
      template: "templates.classification.multi_class.relation.default"
  logSamples: true
  pod:
    volumes:
      - name: pvc
        persistentVolumeClaim:
          claimName: mypvc
    container:
      volumeMounts:
      - mountPath: "/opt/app-root/src/output"
        name: PVC

yhwang · 2024-10-23T04:06:36Z

If we want to make the case of using an existing PVC easier, the Outputs data struct could be:

type Outputs struct {
	// Use an existing PVC to store the outputs
	// +optional
	PersistentVolumeClaimName *string `json:"pvcName,omitempty"`
	// Create a PVC and use it to store the outputs
	// +optional
	PersistentVolumeClaim *corev1.PersistentVolumeClaim `json:"pvc,omitempty"`
}

When the pvcName is specified, the LM-Eval controller will create the corresponding volumes and volumeMOunts for the job pod and store the outputs in the PVC. Basically, it makes the LMEvalJob example in previous comment become this:

apiVersion: trustyai.opendatahub.io/v1alpha1
kind: LMEvalJob
metadata:
  name: evaljob-sample
spec:
  model: hf
  modelArgs:
  - name: pretrained
    value: google/flan-t5-base
  taskList:
    taskRecipes:
    - card:
        name: "cards.wnli"
      template: "templates.classification.multi_class.relation.default"
  logSamples: true
  outputs:
    pvcName: mypvc

ruivieira added kind/enhancement New feature or request lm-eval Issues related to LM-Eval labels Oct 17, 2024

ruivieira mentioned this issue Oct 17, 2024

LM-Eval (language model evaluation) #302

Open

ruivieira added this to the LM-Eval milestone Oct 17, 2024

yhwang self-assigned this Oct 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add PVC as storage option for LM-Eval results #325

Add PVC as storage option for LM-Eval results #325

ruivieira commented Oct 17, 2024

yhwang commented Oct 22, 2024 •

edited

Loading

yhwang commented Oct 23, 2024 •

edited

Loading

Add PVC as storage option for LM-Eval results #325

Add PVC as storage option for LM-Eval results #325

Comments

ruivieira commented Oct 17, 2024

yhwang commented Oct 22, 2024 • edited Loading

yhwang commented Oct 23, 2024 • edited Loading

yhwang commented Oct 22, 2024 •

edited

Loading

yhwang commented Oct 23, 2024 •

edited

Loading