TFJob provides a Kubernetes custom resource that makes it easy to run distributed or non-distributed TensorFlow jobs on Kubernetes.
- For a complete reference of the custom resource definitions, please refer to the API Definition.
- For details on its design, please refer to the v1alpha2 design doc.
- For details on its obersibility, please refer to the monitoring design doc.
- Version >= 1.16 of Kubernetes
kubectl apply -f ./deploy/v1/tf-operator.yaml
Please refer to the quick-start-v1.md and Kubeflow user guide for more information.
Please refer to API Documentation
You can:
- Join our Slack channel.
- Check out who is using this operator.
This is a part of Kubeflow, so please see readme in kubeflow/kubeflow to get in touch with the community.
Please refer to the developer_guide
Please refer to CHANGELOG
The following table lists the most recent few versions of the operator.
Operator Version | API Version | Kubernetes Version |
---|---|---|
latest (master HEAD) |
v1 |
1.16+ |
v1.0.x |
v1 |
1.16+ |