You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/start/multinode.rst
+118Lines changed: 118 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -71,6 +71,124 @@ Slurm
71
71
-----
72
72
TBD
73
73
74
+
dstack
75
+
------
76
+
`dstackai/dstack <https://github.com/dstackai/dstack>`_ is an open-source container orchestrator that simplifies distributed training across cloud providers and on-premises environments
77
+
without the need to use K8S or Slurm.
78
+
79
+
Prerequisite
80
+
~~~~~~~~~~~~
81
+
Once dstack is `installed <https://dstack.ai/docs/installation>`_, initialize the directory as a repo with ``dstack init``.
82
+
83
+
.. code-block:: bash
84
+
85
+
mkdir myproject &&cd myproject
86
+
dstack init
87
+
88
+
**Create a fleet**
89
+
90
+
Before submitting distributed training jobs, create a `dstack` `fleet <https://dstack.ai/docs/concepts/fleets>`_.
91
+
92
+
Run a Ray cluster task
93
+
~~~~~~~~~~~~~~~~~~~~~~
94
+
95
+
Once the fleet is created, define a Ray cluster task, e.g. in ``ray-cluster.dstack.yml``:
0 commit comments