登录
首页 >  Golang >  Go问答

我可以将音量安装到 Katib Experiment 吗?

来源:stackoverflow

时间:2024-02-15 17:45:19 421浏览 收藏

今天golang学习网给大家带来了《我可以将音量安装到 Katib Experiment 吗?》,其中涉及到的知识点包括等等,无论你是小白还是老手,都适合看一看哦~有好的建议也欢迎大家在评论留言,若是看完有所收获,也希望大家能多多点赞支持呀!一起加油学习~

问题内容

我正在使用下面的 .yaml 文件在 kubeflow 中创建 katib 实验。然而,我得到了

协调失败:无法从以下位置恢复结构:字符串

错误。对此有什么解决方案吗?大多数 katib 实验示例代码中都没有卷,但我尝试在从 s3 下载数据后挂载卷。

apiVersion: "kubeflow.org/v1alpha3"
kind: Experiment
metadata:
  namespace: apple
  labels:
    controller-tools.k8s.io: "1.0"
  name: transformer-experiment
spec:
  objective:
    type: maximize
    goal: 0.8
    objectiveMetricName: Train-accuracy
    additionalMetricNames:
      - Train-loss
  algorithm:
    algorithmName: random
  parallelTrialCount: 3
  maxTrialCount: 12
  maxFailedTrialCount: 3
  metricsCollectorSpec:
    collector:
      kind: StdOut
  parameters:
    - name: --lr
      parameterType: double
      feasibleSpace:
        min: "0.01"
        max: "0.03"
    - name: --dropout_rate
      parameterType: double
      feasibleSpace:
        min: "0.005"
        max: "0.020"
    - name: --layer_count
      parameterType: int
      feasibleSpace:
        min: "2"
        max: "5"
    - name: --d_model_count
      parameterType: categorical
      feasibleSpace:
        list:
        - "64"
        - "128"
        - "256"
  trialTemplate:
    goTemplate:
        rawTemplate: |-
          apiVersion: batch/v1
          kind: Job
          metadata:
            name: {{.Trial}}
            namespace: {{.NameSpace}}
          spec:
            template:
              spec:
                volumes:
                - name: train-data
                  emptyDir: {}
                containers:
                - name: data-download
                  image: amazon/aws-cli
                  command:
                  - "aws s3 sync s3://kubeflow/kubeflowdata.tar.gz /train-data"
                  volumeMounts:
                  - name: train-data
                    mountPath: /train-data
                - name: {{.Trial}}
                  image: 
                  command:
                  - "cd /train-data"
                  - "ls"
                  - "python"
                  - "/opt/ml/src/main.py"
                  - "--train_batch=64"
                  - "--test_batch=64"
                  - "--num_workers=4"
                  volumeMounts:
                  - name: train-data
                    mountPath: /train-data
                  {{- with .HyperParameters}}
                  {{- range .}}
                  - "{{.Name}}={{.Value}}"
                  {{- end}}
                  {{- end}}
                restartPolicy: Never

解决方案


正如 here 的回答,以下内容对我有用:

apiVersion: batch/v1
kind: Job
spec:
  template:
    spec:
      containers:
        - name: training-container
          image: docker.io/romeokienzler/claimed-train-mobilenet_v2:0.4
          command:
            - "ipython"
            - "/train-mobilenet_v2.ipynb"
            - "optimizer=${trialParameters.optimizer}"
          volumeMounts:
            - mountPath: /data/
              name: data-volume
      restartPolicy: Never
      volumes:
          - name: data-volume
            persistentVolumeClaim:
              claimName: data-pvc

本篇关于《我可以将音量安装到 Katib Experiment 吗?》的介绍就到此结束啦,但是学无止境,想要了解学习更多关于Golang的相关知识,请关注golang学习网公众号!

声明:本文转载于:stackoverflow 如有侵犯,请联系study_golang@163.com删除
相关阅读
更多>
最新阅读
更多>
课程推荐
更多>