dstackai
diff --git a/‎assets/images/social/blog/sglang-router.png‎
34 KB b/‎assets/images/social/blog/sglang-router.png‎
34 KB
diff --git a/‎blog/changelog/index.html‎
Lines changed: 66 additions & 70 deletions b/‎blog/changelog/index.html‎
Lines changed: 66 additions & 70 deletions
diff --git a/‎blog/changelog/page/2/index.html‎
Lines changed: 70 additions & 69 deletions b/‎blog/changelog/page/2/index.html‎
Lines changed: 70 additions & 69 deletions
@@ -3575,6 +3575,17 @@
     </label>
     <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
 
+        <li class="md-nav__item">
+  <a href="#sglang-router-integration-and-disaggregated-inference-roadmap" class="md-nav__link">
+    <span class="md-ellipsis">
+      
+        SGLang router integration and disaggregated inference roadmap
+      
+    </span>
+  </a>
+  
+</li>
+      
         <li class="md-nav__item">
   <a href="#orchestrating-gpus-on-kubernetes-clusters" class="md-nav__link">
     <span class="md-ellipsis">
@@ -3672,17 +3683,6 @@
     </span>
   </a>
 
-</li>
-      
-        <li class="md-nav__item">
-  <a href="#built-in-ui-for-monitoring-essential-gpu-metrics" class="md-nav__link">
-    <span class="md-ellipsis">
-      
-        Built-in UI for monitoring essential GPU metrics
-      
-    </span>
-  </a>
-  
 </li>
 
     </ul>
@@ -3861,6 +3861,17 @@
     </label>
     <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
 
+        <li class="md-nav__item">
+  <a href="#sglang-router-integration-and-disaggregated-inference-roadmap" class="md-nav__link">
+    <span class="md-ellipsis">
+      
+        SGLang router integration and disaggregated inference roadmap
+      
+    </span>
+  </a>
+  
+</li>
+      
         <li class="md-nav__item">
   <a href="#orchestrating-gpus-on-kubernetes-clusters" class="md-nav__link">
     <span class="md-ellipsis">
@@ -3958,17 +3969,6 @@
     </span>
   </a>
 
-</li>
-      
-        <li class="md-nav__item">
-  <a href="#built-in-ui-for-monitoring-essential-gpu-metrics" class="md-nav__link">
-    <span class="md-ellipsis">
-      
-        Built-in UI for monitoring essential GPU metrics
-      
-    </span>
-  </a>
-  
 </li>
 
     </ul>
@@ -3989,6 +3989,50 @@ <h1 id="changelog">Changelog<a class="headerlink" href="#changelog" title="Perma
         <article class="md-post md-post--excerpt">
   <header class="md-post__header">
 
+    <div class="md-post__meta md-meta">
+      <ul class="md-meta__list">
+        <li class="md-meta__item">
+          <time datetime="2025-11-25 00:00:00+00:00">November 25, 2025</time></li>
+        
+          <li class="md-meta__item">
+            in
+            
+              <a href="./" class="md-meta__link">Changelog</a></li>
+        
+        
+          
+          <li class="md-meta__item">
+            
+              3 min read
+            
+          </li>
+        
+        
+      </ul>
+      
+    </div>
+  </header>
+  <div class="md-post__content md-typeset">
+    <h2 id="sglang-router-integration-and-disaggregated-inference-roadmap"><a class="toclink" href="../sglang-router/">SGLang router integration and disaggregated inference roadmap</a></h2>
+<p><a href="https://github.com/dstackai/dstack/">dstack</a> provides a streamlined way to handle GPU provisioning and workload orchestration across GPU clouds, Kubernetes clusters, or on-prem environments. Built for interoperability, dstack bridges diverse hardware and open-source tooling.</p>
+<p><img src="https://dstack.ai/static-assets/static-assets/images/dstack-sglang-router.png" width="630"/></p>
+<p>As disaggregated, low-latency inference emerges, we aim to ensure this new stack runs natively on <code>dstack</code>. To move this forward, we’re introducing native integration between dstack and <a href="https://docs.sglang.ai/advanced_features/router.html">SGLang’s Model Gateway</a> (formerly known as the SGLang Router).</p>
+
+    
+      <nav class="md-post__action">
+        <a href="../sglang-router/">
+            <span>Continue reading</span>
+            <span class="icon"><svg viewBox="0 0 13 10" xmlns="http://www.w3.org/2000/svg"><path d="M12.823 4.164L8.954.182a.592.592 0 0 0-.854 0 .635.635 0 0 0 0 .88l2.836 2.92H.604A.614.614 0 0 0 0 4.604c0 .344.27.622.604.622h10.332L8.1 8.146a.635.635 0 0 0 0 .88.594.594 0 0 0 .854 0l3.869-3.982a.635.635 0 0 0 0-.88z" fill-rule="nonzero" fill="currentColor" class="fill-main"></path></svg></span>
+        </a>
+      </nav>
+    
+    
+  </div>
+</article>
+      
+        <article class="md-post md-post--excerpt">
+  <header class="md-post__header">
+    
     <div class="md-post__meta md-meta">
       <ul class="md-meta__list">
         <li class="md-meta__item">
@@ -4412,54 +4456,6 @@ <h2 id="supporting-gpu-provisioning-and-orchestration-on-nebius"><a class="tocli
   </div>
 </article>
 
-        <article class="md-post md-post--excerpt">
-  <header class="md-post__header">
-    
-    <div class="md-post__meta md-meta">
-      <ul class="md-meta__list">
-        <li class="md-meta__item">
-          <time datetime="2025-04-03 00:00:00+00:00">April 3, 2025</time></li>
-        
-          <li class="md-meta__item">
-            in
-            
-              <a href="./" class="md-meta__link">Changelog</a></li>
-        
-        
-          
-          <li class="md-meta__item">
-            
-              2 min read
-            
-          </li>
-        
-        
-      </ul>
-      
-    </div>
-  </header>
-  <div class="md-post__content md-typeset">
-    <h2 id="built-in-ui-for-monitoring-essential-gpu-metrics"><a class="toclink" href="../metrics-ui/">Built-in UI for monitoring essential GPU metrics</a></h2>
-<p>AI workloads generate vast amounts of metrics, making it essential to have efficient monitoring tools. While our recent
-update introduced the ability to export available metrics to Prometheus for maximum flexibility, there are times when
-users need to quickly access essential metrics without the need to switch to an external tool.</p>
-<p><img src="https://dstack.ai/static-assets/static-assets/images/dstack-metrics-ui-v3-min.png" width="630"/></p>
-<p>Previously, we introduced a <a href="../dstack-metrics/">CLI command</a> that allows users to view essential GPU metrics for both NVIDIA
-and AMD hardware. Now, with this latest update, we’re excited to announce the addition of a built-in dashboard within
-the <code>dstack</code> control plane.</p>
-
-    
-      <nav class="md-post__action">
-        <a href="../metrics-ui/">
-            <span>Continue reading</span>
-            <span class="icon"><svg viewBox="0 0 13 10" xmlns="http://www.w3.org/2000/svg"><path d="M12.823 4.164L8.954.182a.592.592 0 0 0-.854 0 .635.635 0 0 0 0 .88l2.836 2.92H.604A.614.614 0 0 0 0 4.604c0 .344.27.622.604.622h10.332L8.1 8.146a.635.635 0 0 0 0 .88.594.594 0 0 0 .854 0l3.869-3.982a.635.635 0 0 0 0-.88z" fill-rule="nonzero" fill="currentColor" class="fill-main"></path></svg></span>
-        </a>
-      </nav>
-    
-    
-  </div>
-</article>
-      
 
 
 
 
@@ -3573,6 +3573,17 @@
     </label>
     <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
 
+        <li class="md-nav__item">
+  <a href="#built-in-ui-for-monitoring-essential-gpu-metrics" class="md-nav__link">
+    <span class="md-ellipsis">
+      
+        Built-in UI for monitoring essential GPU metrics
+      
+    </span>
+  </a>
+  
+</li>
+      
         <li class="md-nav__item">
   <a href="#supporting-mpi-and-ncclrccl-tests" class="md-nav__link">
     <span class="md-ellipsis">
@@ -3670,17 +3681,6 @@
     </span>
   </a>
 
-</li>
-      
-        <li class="md-nav__item">
-  <a href="#using-tpus-for-fine-tuning-and-deploying-llms" class="md-nav__link">
-    <span class="md-ellipsis">
-      
-        Using TPUs for fine-tuning and deploying LLMs
-      
-    </span>
-  </a>
-  
 </li>
 
     </ul>
@@ -3859,6 +3859,17 @@
     </label>
     <ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
 
+        <li class="md-nav__item">
+  <a href="#built-in-ui-for-monitoring-essential-gpu-metrics" class="md-nav__link">
+    <span class="md-ellipsis">
+      
+        Built-in UI for monitoring essential GPU metrics
+      
+    </span>
+  </a>
+  
+</li>
+      
         <li class="md-nav__item">
   <a href="#supporting-mpi-and-ncclrccl-tests" class="md-nav__link">
     <span class="md-ellipsis">
@@ -3956,17 +3967,6 @@
     </span>
   </a>
 
-</li>
-      
-        <li class="md-nav__item">
-  <a href="#using-tpus-for-fine-tuning-and-deploying-llms" class="md-nav__link">
-    <span class="md-ellipsis">
-      
-        Using TPUs for fine-tuning and deploying LLMs
-      
-    </span>
-  </a>
-  
 </li>
 
     </ul>
@@ -3987,6 +3987,54 @@ <h1 id="changelog">Changelog<a class="headerlink" href="#changelog" title="Perma
         <article class="md-post md-post--excerpt">
   <header class="md-post__header">
 
+    <div class="md-post__meta md-meta">
+      <ul class="md-meta__list">
+        <li class="md-meta__item">
+          <time datetime="2025-04-03 00:00:00+00:00">April 3, 2025</time></li>
+        
+          <li class="md-meta__item">
+            in
+            
+              <a href="../../" class="md-meta__link">Changelog</a></li>
+        
+        
+          
+          <li class="md-meta__item">
+            
+              2 min read
+            
+          </li>
+        
+        
+      </ul>
+      
+    </div>
+  </header>
+  <div class="md-post__content md-typeset">
+    <h2 id="built-in-ui-for-monitoring-essential-gpu-metrics"><a class="toclink" href="../../../metrics-ui/">Built-in UI for monitoring essential GPU metrics</a></h2>
+<p>AI workloads generate vast amounts of metrics, making it essential to have efficient monitoring tools. While our recent
+update introduced the ability to export available metrics to Prometheus for maximum flexibility, there are times when
+users need to quickly access essential metrics without the need to switch to an external tool.</p>
+<p><img src="https://dstack.ai/static-assets/static-assets/images/dstack-metrics-ui-v3-min.png" width="630"/></p>
+<p>Previously, we introduced a <a href="../../../dstack-metrics/">CLI command</a> that allows users to view essential GPU metrics for both NVIDIA
+and AMD hardware. Now, with this latest update, we’re excited to announce the addition of a built-in dashboard within
+the <code>dstack</code> control plane.</p>
+
+    
+      <nav class="md-post__action">
+        <a href="../../../metrics-ui/">
+            <span>Continue reading</span>
+            <span class="icon"><svg viewBox="0 0 13 10" xmlns="http://www.w3.org/2000/svg"><path d="M12.823 4.164L8.954.182a.592.592 0 0 0-.854 0 .635.635 0 0 0 0 .88l2.836 2.92H.604A.614.614 0 0 0 0 4.604c0 .344.27.622.604.622h10.332L8.1 8.146a.635.635 0 0 0 0 .88.594.594 0 0 0 .854 0l3.869-3.982a.635.635 0 0 0 0-.88z" fill-rule="nonzero" fill="currentColor" class="fill-main"></path></svg></span>
+        </a>
+      </nav>
+    
+    
+  </div>
+</article>
+      
+        <article class="md-post md-post--excerpt">
+  <header class="md-post__header">
+    
     <div class="md-post__meta md-meta">
       <ul class="md-meta__list">
         <li class="md-meta__item">
@@ -4440,53 +4488,6 @@ <h3 id="how-it-works" style="display:none"><a class="toclink" href="../../../dst
   </div>
 </article>
 
-        <article class="md-post md-post--excerpt">
-  <header class="md-post__header">
-    
-    <div class="md-post__meta md-meta">
-      <ul class="md-meta__list">
-        <li class="md-meta__item">
-          <time datetime="2024-09-10 00:00:00+00:00">September 10, 2024</time></li>
-        
-          <li class="md-meta__item">
-            in
-            
-              <a href="../../" class="md-meta__link">Changelog</a></li>
-        
-        
-          
-          <li class="md-meta__item">
-            
-              4 min read
-            
-          </li>
-        
-        
-      </ul>
-      
-    </div>
-  </header>
-  <div class="md-post__content md-typeset">
-    <h2 id="using-tpus-for-fine-tuning-and-deploying-llms"><a class="toclink" href="../../../tpu-on-gcp/">Using TPUs for fine-tuning and deploying LLMs</a></h2>
-<p>If you’re using or planning to use TPUs with Google Cloud, you can now do so via <code>dstack</code>. Just specify the TPU version and the number of cores 
-(separated by a dash), in the <code>gpu</code> property under <code>resources</code>. </p>
-<p>Read below to find out how to use TPUs with <code>dstack</code> for fine-tuning and deploying
-LLMs, leveraging open-source tools like Hugging Face’s 
-<a href="https://github.com/huggingface/optimum-tpu">Optimum TPU</a> 
-and <a href="https://docs.vllm.ai/en/latest/getting_started/tpu-installation.html">vLLM</a>.</p>
-
-    
-      <nav class="md-post__action">
-        <a href="../../../tpu-on-gcp/">
-            <span>Continue reading</span>
-            <span class="icon"><svg viewBox="0 0 13 10" xmlns="http://www.w3.org/2000/svg"><path d="M12.823 4.164L8.954.182a.592.592 0 0 0-.854 0 .635.635 0 0 0 0 .88l2.836 2.92H.604A.614.614 0 0 0 0 4.604c0 .344.27.622.604.622h10.332L8.1 8.146a.635.635 0 0 0 0 .88.594.594 0 0 0 .854 0l3.869-3.982a.635.635 0 0 0 0-.88z" fill-rule="nonzero" fill="currentColor" class="fill-main"></path></svg></span>
-        </a>
-      </nav>
-    
-    
-  </div>
-</article>
-