Skip to content

Commit

Permalink
ROCm and AMD ROCm used consistently
Browse files Browse the repository at this point in the history
  • Loading branch information
tara-det-ai committed Sep 10, 2024
1 parent bf83fb5 commit 7b4b376
Show file tree
Hide file tree
Showing 9 changed files with 20 additions and 20 deletions.
4 changes: 2 additions & 2 deletions docs/model-dev-guide/api-guides/apis-howto/_index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -56,8 +56,8 @@ Prefer to use an Example Model?
If you'd like to build off of an existing model that already runs on Determined, visit our
:ref:`example-solutions` to see if the model you'd like to train is already available.

ROCm Support
============
AMD ROCm Support
================

For AMD ROCm support, visit :ref:`rocm-support`

Expand Down
2 changes: 1 addition & 1 deletion docs/model-dev-guide/create-experiment.rst
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,7 @@ Launcher options include:
- A command with arguments, run in the container

If you're using AMD ROCm GPUs, make sure to specify ``slot_type: rocm`` in your experiment
configuration. For more information on ROCm support, see :ref:`AMD ROCm Support <rocm-support>`.
configuration. For more information on AMD ROCm support, see :ref:`AMD ROCm Support <rocm-support>`.

For distributed training, separate the launcher that starts distributed workers from your training
script, which typically runs each worker. The distributed training launcher must:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -29,5 +29,5 @@ preparation needed.
If you need to add additional customization to the training environment, review the
:ref:`custom-env` page.

For details on using ROCm-enabled images, including our ROCm 6.1 images with DeepSpeed support for
MI300x users, see our :ref:`AMD ROCm Support documentation <rocm-support>`.
For details on using AMD ROCm-enabled images, including our ROCm 6.1 images with DeepSpeed support
for MI300x users, see our :ref:`AMD ROCm Support documentation <rocm-support>`.
2 changes: 1 addition & 1 deletion docs/reference/deploy/agent-config-reference.rst
Original file line number Diff line number Diff line change
Expand Up @@ -81,7 +81,7 @@ option was called ``gpu``.

``cpu``: Map all the CPUs to a slot, even when GPUs are present.

``rocm``: The agent will map each detected ROCm AMD GPU to a slot.
``rocm``: The agent will map each detected AMD ROCm GPU to a slot.

****************
``http_proxy``
Expand Down
2 changes: 1 addition & 1 deletion docs/setup-cluster/_index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -97,4 +97,4 @@ Enable Determined to submit jobs to a Slurm cluster.
Deploy on Kubernetes <k8s/_index>
Deploy on Slurm/PBS <slurm/_index>
Cluster Configuration <cluster-configuration>
ROCm Support <rocm-support>
AMD ROCm Support <rocm-support>
2 changes: 1 addition & 1 deletion docs/setup-cluster/k8s/helm-commands.rst
Original file line number Diff line number Diff line change
Expand Up @@ -40,7 +40,7 @@ It is recommended to have just one instance of Determined per Kubernetes cluster
****************************

For specific configuration details related to AMD ROCm GPUs, including how to set up resource pools
and configure experiments, see our :ref:`guide on Configuring Kubernetes for ROCm GPUs
and configure experiments, see our :ref:`guide on Configuring Kubernetes for AMD ROCm GPUs
<rocm-config-k8s>`.

**************************************
Expand Down
2 changes: 1 addition & 1 deletion docs/setup-cluster/k8s/k8s-dev-guide.rst
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@ Kubernetes manually, or you can use a managed Kubernetes service such as :ref:`G
.. note::

For information on using AMD ROCm GPUs with Determined on Kubernetes, please refer to our
:ref:`ROCm Support Guide <rocm-support>`.
:ref:`AMD ROCm Support Guide <rocm-support>`.

**********************************
Set up a Development Environment
Expand Down
18 changes: 9 additions & 9 deletions docs/setup-cluster/rocm-support.rst
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@

.. note::

ROCm support in Determined is experimental. Features and configurations may change in future
AMD ROCm support in Determined is experimental. Features and configurations may change in future
releases. We recommend testing thoroughly in a non-production environment before deploying to
production.

Expand All @@ -35,11 +35,11 @@ For more detailed information about configuration, visit the :ref:`helm-config-r

.. _rocm-config-k8s:

**************************************
Configuring Kubernetes for ROCm GPUs
**************************************
******************************************
Configuring Kubernetes for AMD ROCm GPUs
******************************************

To use ROCm GPUs in your Kubernetes deployment:
To use AMD ROCm GPUs in your Kubernetes deployment:

#. Ensure your Kubernetes cluster has nodes with ROCm-capable GPUs and the necessary drivers
installed.
Expand All @@ -59,11 +59,11 @@ To use ROCm GPUs in your Kubernetes deployment:
#. When submitting experiments or launching tasks, specify ``slot_type: rocm`` in your experiment
configuration.

**********************************
Using ROCm Images in Experiments
**********************************
**************************************
Using AMD ROCm Images in Experiments
**************************************

To use ROCm images in your experiments, specify the image in your experiment configuration:
To use AMD ROCm images in your experiments, specify the image in your experiment configuration:

.. code:: yaml
Expand Down
4 changes: 2 additions & 2 deletions docs/setup-cluster/slurm/slurm-known-issues.rst
Original file line number Diff line number Diff line change
Expand Up @@ -392,10 +392,10 @@ Some constraints are due to differences in behavior between Docker and Singulari
<pbs-config-requirements>`.

***********************
AMD/ROCm Known Issues
AMD ROCm Known Issues
***********************

For AMD/ROCm support and known issues, visit :ref:`AMD ROCm GPUs <rocm-support>`.
For AMD ROCm support and known issues, visit :ref:`AMD ROCm GPUs <rocm-support>`.

***************************************
Determined AI Experiment Requirements
Expand Down

0 comments on commit 7b4b376

Please sign in to comment.