tencent cloud

Feedback

MPI Operator

Last updated: 2022-10-12 11:37:14

    Overview

    Developed by the Kubeflow community, MPI-Operator is an add-on used to help deploy and execute data-parallel distributed training such as Horovod in a Kubernetes cluster.

    After deployment, you can create, view, and delete MPI jobs.

    Prerequisite dependencies

    Kubernetes cluster (v1.16 or later)

    Deployment

    During Helm deployment, all configuration items are included in values.yaml.

    Some fields may need to be customized, as listed below:

    Parameter Description Default Value
    image.repository The repository where the MPI-Operator image resides ccr.ccs.tencentyun.com/kubeflow-oteam/mpi-operator
    image.tag MPI-Operator image version "latest"
    namespace.create Whether to create a separate namespace for MPI-Operator true
    namespace.name The namespace where MPI-Operator is to be deployed "mpi-operator"
    Contact Us

    Contact our sales team or business advisors to help your business.

    Technical Support

    Open a ticket if you're looking for further assistance. Our Ticket is 7x24 avaliable.

    7x24 Phone Support