dstackai
diff --git a/‎docs/docs/concepts/backends.md‎
Lines changed: 3 additions & 3 deletions b/‎docs/docs/concepts/backends.md‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎docs/docs/guides/protips.md‎
Lines changed: 4 additions & 4 deletions b/‎docs/docs/guides/protips.md‎
Lines changed: 4 additions & 4 deletions
diff --git a/‎docs/docs/reference/cli/dstack/offer.md‎
Lines changed: 4 additions & 4 deletions b/‎docs/docs/reference/cli/dstack/offer.md‎
Lines changed: 4 additions & 4 deletions
diff --git a/‎docs/docs/reference/dstack.yml/service.md‎
Lines changed: 11 additions & 1 deletion b/‎docs/docs/reference/dstack.yml/service.md‎
Lines changed: 11 additions & 1 deletion
diff --git a/‎docs/docs/reference/server/config.yml.md‎
Lines changed: 0 additions & 2 deletions b/‎docs/docs/reference/server/config.yml.md‎
Lines changed: 0 additions & 2 deletions
diff --git a/‎docs/examples.md‎
Lines changed: 10 additions & 0 deletions b/‎docs/examples.md‎
Lines changed: 10 additions & 0 deletions
diff --git a/‎docs/examples/clusters/nebius/index.md‎ b/‎docs/examples/clusters/nebius/index.md‎
diff --git a/‎examples/clusters/crusoe/README.md‎
Lines changed: 25 additions & 23 deletions b/‎examples/clusters/crusoe/README.md‎
Lines changed: 25 additions & 23 deletions
diff --git a/‎examples/clusters/lambda/README.md‎
Lines changed: 17 additions & 17 deletions b/‎examples/clusters/lambda/README.md‎
Lines changed: 17 additions & 17 deletions
@@ -853,7 +853,7 @@ Then, go ahead and configure the backend:
 projects:
   - name: main
     backends:
-      - type: datacrunch
+      - type: verda
         creds:
           type: api_key
           client_id: xfaHBqYEsArqhKWX-e52x3HH7w8T
@@ -1049,13 +1049,13 @@ projects:
       verbs: ["get", "create"]
     - apiGroups: [""]
       resources: ["pods"]
-      verbs: ["get", "create", "delete"]
+      verbs: ["get", "create", "delete", "list"]
     - apiGroups: [""]
       resources: ["services"]
       verbs: ["get", "create", "delete"]
     - apiGroups: [""]
       resources: ["nodes"]
-      verbs: ["list"]
+      verbs: ["list", "get"]
     ```
 
     Ensure you've created a ClusterRoleBinding to grant the role to the user or the service account you're using.
 
@@ -439,10 +439,10 @@ Getting offers...
 ---> 100%
 
  #   BACKEND     REGION     INSTANCE TYPE          RESOURCES                                     SPOT  PRICE   
- 1   datacrunch  FIN-01     1H100.80S.30V          30xCPU, 120GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.19   
- 2   datacrunch  FIN-02     1H100.80S.30V          30xCPU, 120GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.19   
- 3   datacrunch  FIN-02     1H100.80S.32V          32xCPU, 185GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.19   
- 4   datacrunch  ICE-01     1H100.80S.32V          32xCPU, 185GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.19   
+ 1   verda       FIN-01     1H100.80S.30V          30xCPU, 120GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.19   
+ 2   verda       FIN-02     1H100.80S.30V          30xCPU, 120GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.19   
+ 3   verda       FIN-02     1H100.80S.32V          32xCPU, 185GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.19   
+ 4   verda       ICE-01     1H100.80S.32V          32xCPU, 185GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.19   
  5   runpod      US-KS-2    NVIDIA H100 PCIe       16xCPU, 251GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.39   
  6   runpod      CA         NVIDIA H100 80GB HBM3  24xCPU, 251GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.69   
  7   nebius      eu-north1  gpu-h100-sxm           16xCPU, 200GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.95   
 
@@ -58,10 +58,10 @@ Getting offers...
 ---> 100%
 
  #   BACKEND     REGION     INSTANCE TYPE          RESOURCES                                     SPOT  PRICE   
- 1   datacrunch  FIN-01     1H100.80S.30V          30xCPU, 120GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.19   
- 2   datacrunch  FIN-02     1H100.80S.30V          30xCPU, 120GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.19   
- 3   datacrunch  FIN-02     1H100.80S.32V          32xCPU, 185GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.19   
- 4   datacrunch  ICE-01     1H100.80S.32V          32xCPU, 185GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.19   
+ 1   verda       FIN-01     1H100.80S.30V          30xCPU, 120GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.19   
+ 2   verda       FIN-02     1H100.80S.30V          30xCPU, 120GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.19   
+ 3   verda       FIN-02     1H100.80S.32V          32xCPU, 185GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.19   
+ 4   verda       ICE-01     1H100.80S.32V          32xCPU, 185GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.19   
  5   runpod      US-KS-2    NVIDIA H100 PCIe       16xCPU, 251GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.39   
  6   runpod      CA         NVIDIA H100 80GB HBM3  24xCPU, 251GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.69   
  7   nebius      eu-north1  gpu-h100-sxm           16xCPU, 200GB, 1xH100 (80GB), 100.0GB (disk)  no    $2.95   
 
@@ -63,7 +63,7 @@ The `service` configuration type allows running [services](../../concepts/servic
         1. Doesn't work if your `chat_template` uses `bos_token`. As a workaround, replace `bos_token` inside `chat_template` with the token content itself.
         2. Doesn't work if `eos_token` is defined in the model repository as a dictionary. As a workaround, set `eos_token` manually, as shown in the example above (see Chat template).
 
-        If you encounter any other issues, please make sure to file a
+        If you encounter any ofther issues, please make sure to file a
         [GitHub issue](https://github.com/dstackai/dstack/issues/new/choose).
 
 ### `scaling`
@@ -127,6 +127,16 @@ The `service` configuration type allows running [services](../../concepts/servic
         required: true
 
 
+### `replicas`
+
+#### `replicas[n]`
+
+#SCHEMA# dstack._internal.core.models.configurations.ReplicaGroup
+    overrides:
+      show_root_heading: false
+      type:
+        required: true
+
 ### `retry`
 
 #SCHEMA# dstack._internal.core.models.profiles.ProfileRetry
 
@@ -14,8 +14,6 @@ to configure [backends](../../concepts/backends.md) and other [server-level sett
 #SCHEMA# dstack._internal.server.services.config.ProjectConfig
     overrides:
         show_root_heading: false
-        backends:
-            type: 'Union[AWSBackendConfigWithCreds, AzureBackendConfigWithCreds, GCPBackendConfigWithCreds, HotAisleBackendConfigWithCreds, LambdaBackendConfigWithCreds, NebiusBackendConfigWithCreds, RunpodBackendConfigWithCreds, VastAIBackendConfigWithCreds, KubernetesConfig]'
 
 #### `projects[n].backends` { #backends data-toc-label="backends" }
 
 
@@ -122,6 +122,16 @@ hide:
             Set up Crusoe clusters with optimized networking
         </p>
     </a>
+    <a href="/examples/clusters/nebius"
+       class="feature-cell sky">
+        <h3>
+            Nebius
+        </h3>
+
+        <p>
+            Set up Nebius clusters with optimized networking
+        </p>
+    </a>
     <a href="/examples/clusters/nccl-rccl-tests"
        class="feature-cell sky">
         <h3>
 
@@ -1,24 +1,25 @@
 ---
 title: Crusoe
-description: Setting up Crusoe clusters using Managed Kubernetes or VMs with InfiniBand support
+description: Using Crusoe clusters with InfiniBand support via Kubernetes or VMs
 ---
 
 # Crusoe
 
-Crusoe offers two ways to use clusters with fast interconnect:
+`dstack` allows using Crusoe clusters with fast interconnect via two ways:
 
-* [Crusoe Managed Kubernetes](#kubernetes) – Lets you interact with clusters through the Kubernetes API and includes support for NVIDIA and AMD GPU operators and related tools.
-* [Virtual Machines (VMs)](#vms) – Gives you direct access to clusters in the form of virtual machines with NVIDIA and AMD GPUs.
+* [Kubernetes](#kubernetes) – If you create a Kubernetes cluster on Crusoe and configure a `kubernetes` backend and create a backend fleet in `dstack`, `dstack` lets you fully use this cluster through `dstack`.
+* [VMs](#vms) – If you create a VM cluster on Crusoe and create an SSH fleet in `dstack`, `dstack` lets you fully use this cluster through `dstack`.
+  
+## Kubernetes
 
-Both options use the same underlying networking infrastructure. This example walks you through how to set up Crusoe clusters to use with `dstack`.
+### Create a cluster
 
-## Crusoe Managed Kubernetes { #kubernetes }
+1. Go `Networking` → `Firewall Rules`, click `Create Firewall Rule`, and allow ingress traffic on port `30022`. This port will be used by the `dstack` server to access the jump host.
+2. Go to `Orchestration` and click `Create Cluster`. Make sure to enable the `NVIDIA GPU Operator` add-on.
+3. Go the the cluster, and click `Create Node Pool`. Select the right type of the instance, and  `Desired Number of Nodes`.
+4. Wait until nodes are provisioned.
 
-!!! info "Prerequsisites"
-    1. Go `Networking` → `Firewall Rules`, click `Create Firewall Rule`, and allow ingress traffic on port `30022`. This port will be used by the `dstack` server to access the jump host.
-    2. Go to `Orchestration` and click `Create Cluster`. Make sure to enable the `NVIDIA GPU Operator` add-on.
-    3. Go the the cluster, and click `Create Node Pool`. Select the right type of the instance. If you intend to auto-scale the cluster, make sure to set `Desired Number of Nodes` at least to `1`, since `dstack` doesn't currently support clusters that scale down to `0` nodes.
-    4. Wait until at least one node is running.
+> Even if you enable `autoscaling`, `dstack` can use only the nodes that are already provisioned.
 
 ### Configure the backend
 
@@ -56,7 +57,7 @@ backends: [kubernetes]
 
 resources:
   # Specify requirements to filter nodes
-  gpu: 1..8
+  gpu: 8
 ```
 
 </div>
@@ -75,12 +76,13 @@ Once the fleet is created, you can run [dev environments](https://dstack.ai/docs
 
 ## VMs
 
-Another way to work with Crusoe clusters is through VMs. While `dstack` typically supports VM-based compute providers via [dedicated backends](https://dstack.ai/docs/concepts/backends#vm-based) that automate provisioning, Crusoe does not yet have [such a backend](https://github.com/dstackai/dstack/issues/3378). As a result, to use a VM-based Crusoe cluster with `dstack`, you should use [SSH fleets](https://dstack.ai/docs/concepts/fleets).
+Another way to work with Crusoe clusters is through VMs. While `dstack` typically supports VM-based compute providers via [dedicated backends](https://dstack.ai/docs/concepts/backends#vm-based) that automate provisioning, Crusoe does not yet have [such a backend](https://github.com/dstackai/dstack/issues/3378). As a result, to use a VM-based Crusoe cluster with `dstack`, you should use [SSH fleets](https://dstack.ai/docs/concepts/fleets#ssh-fleets).
 
-!!! info "Prerequsisites"
-    1. Go to `Compute`, then `Instances`, and click `Create Instance`. Make sure to select the right instance type and VM image (that [support interconnect](https://docs.crusoecloud.com/networking/infiniband/managing-infiniband-networks/index.html)). Make sure to create as many instances as needed.
+### Create instances
 
-### Create a fleet
+1. Go to `Compute`, then `Instances`, and click `Create Instance`. Make sure to select the right instance type and VM image (that [support interconnect](https://docs.crusoecloud.com/networking/infiniband/managing-infiniband-networks/index.html)). Make sure to create as many instances as needed.
+
+### Create a `dstack` fleet
 
 Follow the standard instructions for setting up an [SSH fleet](https://dstack.ai/docs/concepts/fleets/#ssh-fleets):
 
@@ -115,9 +117,9 @@ $ dstack apply -f crusoe-fleet.dstack.yml
 
 Once the fleet is created, you can run [dev environments](https://dstack.ai/docs/concepts/dev-environments), [tasks](https://dstack.ai/docs/concepts/tasks), and [services](https://dstack.ai/docs/concepts/services).
 
-## Run NCCL tests
+## NCCL tests
 
-Use a [distributed task](https://dstack.ai/docs/concepts/tasks#distributed-task) that runs NCCL tests to validate cluster network bandwidth.
+Use a [distributed task](https://dstack.ai/docs/concepts/tasks#distributed-tasks) that runs NCCL tests to validate cluster network bandwidth.
 
 === "Crusoe Managed Kubernetes"
 
@@ -253,9 +255,9 @@ Provisioning...
 
 nccl-tests provisioning completed (running)
 
-#                                                              out-of-place                       in-place
-#       size         count      type   redop    root     time   algbw   busbw  #wrong     time   algbw   busbw  #wrong
-#        (B)    (elements)                               (us)  (GB/s)  (GB/s)             (us)  (GB/s)  (GB/s)
+out-of-place                       in-place
+        size         count      type   redop    root     time   algbw   busbw  #wrong     time   algbw   busbw  #wrong
+         (B)    (elements)                               (us)  (GB/s)  (GB/s)             (us)  (GB/s)  (GB/s)
            8             2     float     sum      -1    27.70    0.00    0.00       0    29.82    0.00    0.00       0
           16             4     float     sum      -1    28.78    0.00    0.00       0    28.99    0.00    0.00       0
           32             8     float     sum      -1    28.49    0.00    0.00       0    28.16    0.00    0.00       0
@@ -285,8 +287,8 @@ nccl-tests provisioning completed (running)
    536870912     134217728     float     sum      -1  5300.49  101.29  189.91       0  5314.91  101.01  189.40       0
   1073741824     268435456     float     sum      -1  10472.2  102.53  192.25       0  10485.6  102.40  192.00       0
   2147483648     536870912     float     sum      -1  20749.1  103.50  194.06       0  20745.7  103.51  194.09       0
-# Out of bounds values : 0 OK
-# Avg bus bandwidth    : 53.7387
+  Out of bounds values : 0 OK
+  Avg bus bandwidth    : 53.7387
 ```
 
 </div>
 
@@ -5,18 +5,17 @@ description: Setting up Lambda clusters using Kubernetes or 1-Click Clusters wit
 
 # Lambda
 
-[Lambda](https://lambda.ai/) offers two ways to use clusters with a fast interconnect:
+`dstack` allows using Lambda clusters with fast interconnect via two ways:
 
-* [Kubernetes](#kubernetes) – Lets you interact with clusters through the Kubernetes API and includes support for NVIDIA GPU operators and related tools.
-* [1-Click Clusters (1CC)](#1-click-clusters) – Gives you direct access to clusters in the form of bare-metal nodes.
-
-Both options use the same underlying networking infrastructure. This example walks you through how to set up Lambda clusters to use with `dstack`.
+* [Kubernetes](#kubernetes) – If you create a Kubernetes cluster on Lambda and configure a `kubernetes` backend and create a backend fleet in `dstack`, `dstack` lets you fully use this cluster through `dstack`.
+* [VMs](#vms) – If you create a 1CC cluster on Lambda and create an SSH fleet in `dstack`, `dstack` lets you fully use this cluster through `dstack`.
 
 ## Kubernetes
 
-!!! info "Prerequsisites"
-    1. Follow the instructions in [Lambda's guide](https://docs.lambda.ai/public-cloud/1-click-clusters/managed-kubernetes/#accessing-mk8s) on accessing MK8s.
-    2. Go to `Firewall` → `Edit rules`, click `Add rule`, and allow ingress traffic on port `30022`. This port will be used by the `dstack` server to access the jump host.
+### Prerequsisites
+
+1. Follow the instructions in [Lambda's guide](https://docs.lambda.ai/public-cloud/1-click-clusters/managed-kubernetes/#accessing-mk8s) on accessing MK8s.
+2. Go to `Firewall` → `Edit rules`, click `Add rule`, and allow ingress traffic on port `30022`. This port will be used by the `dstack` server to access the jump host.
 
 ### Configure the backend
 
@@ -75,8 +74,9 @@ Once the fleet is created, you can run [dev environments](https://dstack.ai/docs
 
 Another way to work with Lambda clusters is through [1CC](https://lambda.ai/1-click-clusters). While `dstack` supports automated cluster provisioning via [VM-based backends](https://dstack.ai/docs/concepts/backends#vm-based), there is currently no programmatic way to provision Lambda 1CCs. As a result, to use a 1CC cluster with `dstack`, you must use [SSH fleets](https://dstack.ai/docs/concepts/fleets).
 
-!!! info "Prerequsisites"
-    1.  Follow the instructions in [Lambda's guide](https://docs.lambda.ai/public-cloud/1-click-clusters/) on working with 1-Click Clusters
+### Prerequsisites
+
+1.  Follow the instructions in [Lambda's guide](https://docs.lambda.ai/public-cloud/1-click-clusters/) on working with 1-Click Clusters
 
 ### Create a fleet
 
@@ -171,11 +171,11 @@ $ dstack apply -f lambda-nccl-tests.dstack.yml
 Provisioning...
 ---> 100%
 
-# nccl-tests version 2.17.6 nccl-headers=22602 nccl-library=22602
-# Collective test starting: all_reduce_perf
-#
-#       size         count      type   redop    root     time   algbw   busbw  #wrong     time   algbw   busbw  #wrong
-#        (B)    (elements)                               (us)  (GB/s)  (GB/s)             (us)  (GB/s)  (GB/s)
+  nccl-tests version 2.17.6 nccl-headers=22602 nccl-library=22602
+  Collective test starting: all_reduce_perf
+
+        size         count      type   redop    root     time   algbw   busbw  #wrong     time   algbw   busbw  #wrong
+         (B)    (elements)                               (us)  (GB/s)  (GB/s)             (us)  (GB/s)  (GB/s)
            8             2     float     sum      -1    36.50    0.00    0.00       0    36.16    0.00    0.00       0
           16             4     float     sum      -1    35.55    0.00    0.00       0    35.49    0.00    0.00       0
           32             8     float     sum      -1    35.49    0.00    0.00       0    36.28    0.00    0.00       0
@@ -205,8 +205,8 @@ Provisioning...
    536870912     134217728     float     sum      -1  1625.63  330.25  619.23       0  1687.31  318.18  596.59       0
   1073741824     268435456     float     sum      -1  2972.25  361.26  677.35       0  2971.33  361.37  677.56       0
   2147483648     536870912     float     sum      -1  5784.75  371.23  696.06       0  5728.40  374.88  702.91       0
-# Out of bounds values : 0 OK
-# Avg bus bandwidth    : 137.179
+  Out of bounds values : 0 OK
+  Avg bus bandwidth    : 137.179
 ```
 
 </div>