Add RunPod to Vast.ai migration guide by wbrennan899 · Pull Request #86 · vast-ai/docs

wbrennan899 · 2026-03-19T21:05:15Z

Summary

Adds a comprehensive migration guide for users moving GPU workloads from RunPod to Vast.ai. The guide covers the full surface area: account setup, instance management (Pods → Instances), serverless migration, networking, and API/CLI reference.

What's included

Concept mapping table — RunPod terms to Vast equivalents (Pods, Templates, Volumes, Serverless, etc.)
Account setup — API key, CLI install, SSH key, billing
Migrating from Pods — Finding GPUs, Docker images, environment variables, startup scripts, port mapping, storage, and lifecycle management with side-by-side RunPod vs Vast code examples
Migrating from Serverless — Pre-built templates (vLLM, TGI, ComfyUI), endpoint/workergroup creation, and SDK client code. Leads with the simple template-based path rather than custom PyWorker code
API and CLI reference — Mapping of common RunPod CLI/API calls to Vast equivalents
SEO structured data — HowTo schema markup for search engines
Navigation entry — Added to docs.json under Examples → Migrations

Revisions (post-review)

De-emphasized Vast REST API throughout — Removed all 12 Vast API curl blocks from tutorial sections; users are nudged toward the vastai CLI and Python SDK instead. CodeGroups with only one remaining block were unwrapped. A note at the API reference table directs API users to the reference docs.
Tone made more favorable to Vast — Several comparison passages that inadvertently highlighted RunPod advantages were rewritten:
- Bandwidth: removed "RunPod includes free bandwidth" framing; rewritten to emphasize Vast's transparent per-use billing vs. competitors that bundle bandwidth costs into inflated GPU rates
- Volumes: removed "unlike RunPod Network Volumes, they can't move between hosts"; reframed around object storage as a more portable, provider-agnostic solution
- Host variability: removed "home connection with occasional downtime" implication; reframed around the data Vast surfaces (reliability scores, network speeds) to make an informed choice
- Opening paragraph: "you own more of the stack" → "you have more control…rather than accepting an opaque allocation"
- Jupyter section: removed "RunPod pre-configures JupyterLab on official templates" framing; focused on what Vast offers
Docker compatibility language softened — "will likely work as-is" / "Most Runpod-compatible images" → "may work as-is" / "Many Runpod-compatible images" to avoid overpromising
Concept mapping table reordered — "Community Cloud / Secure Cloud" row moved up to sit immediately after "Serverless Endpoint → Worker"
Stale notes removed — Removed the MB-vs-GB API note and "REST API uses a JSON object instead" sentence that were only relevant alongside the now-removed API blocks

Two-track guide (Pods and Serverless) with three-tab code examples showing RunPod API, Vast CLI, and Vast REST API side-by-side. Covers instance creation, Docker config, networking, storage, logs, lifecycle management, PyWorker migration, and a full API/CLI reference table.

The serverless migration section previously led with a PyWorker vs RunPod handler code comparison, making it look like writing custom PyWorker code was required. Now it leads with pre-built templates (vLLM, TGI, ComfyUI), adds a "Calling Your Endpoint" section with SDK client code from the official quickstart, and links to PyWorker docs for advanced users only.

LucasArmandVast · 2026-03-19T22:59:40Z

examples/migrations/runpod-to-vast.mdx

+```bash Vast API
+# Create an endpoint
+curl -X POST "https://console.vast.ai/api/v0/endptjobs/" \
+    -H "Authorization: Bearer $VAST_API_KEY" \
+    -H "Content-Type: application/json" \
+    -d '{
+        "endpoint_name": "my-llm-endpoint",
+        "max_workers": 5,
+        "cold_workers": 1,
+        "target_util": 0.9
+    }'
+
+# Create a workergroup
+curl -X POST "https://console.vast.ai/api/v0/workergroups/" \
+    -H "Authorization: Bearer $VAST_API_KEY" \
+    -H "Content-Type: application/json" \
+    -d '{
+        "endpoint_name": "my-llm-endpoint",
+        "template_hash": "<TEMPLATE_HASH>",
+        "gpu_ram": 24
+    }'
+```
+
+</CodeGroup>


We should not be including the API in guides.

LucasArmandVast · 2026-03-19T23:00:57Z

examples/migrations/runpod-to-vast.mdx

+
+## Migrating from Serverless
+
+RunPod **Serverless** lets you deploy a handler function that scales to zero — you send a request, RunPod spins up a worker, runs your handler, and tears it down. You pay per second of compute, not for idle GPUs.


Why are we advertising why RunPod is better?

LucasArmandVast · 2026-03-19T23:03:04Z

examples/migrations/runpod-to-vast.mdx

+
+RunPod **Serverless** lets you deploy a handler function that scales to zero — you send a request, RunPod spins up a worker, runs your handler, and tears it down. You pay per second of compute, not for idle GPUs.
+
+Vast **Serverless** serves the same purpose — autoscaling inference without managing instances — but the architecture is different. Instead of wrapping a handler function, you pick a pre-built template (vLLM, TGI, ComfyUI) and Vast runs it behind a managed proxy that handles routing, queueing, and autoscaling. For most migrations, no custom code is needed.


The serverless templates are just that -- templates. By no means is the Vast Serverless platform built around these templates fundamentally. They mostly serve as examples. We expect people to implement their own templates, their own API wrappers, and configure their endpoints. Yes, you can use the templates out of the box, but this is more akin to using RunPod's pre-built templates. You are absolutely implementing handlers for Vast serverless, just in a different way than RunPod.

- Replace cloud.vast.ai/search/ → cloud.vast.ai/create/ (3 occurrences) - Replace "25–50% discount/savings" → "up to 50%" to match official docs Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

guthrie-vast · 2026-03-20T17:44:32Z

@wbrennan899 @LucasArmandVast
I made some updates to the PR, focused on de-emphasizing the Vast REST API, improving tone, and smaller fixes (wording, table reorder, stale notes).

I will get more eyeballs on this

robballantyne · 2026-03-20T18:04:43Z

examples/migrations/runpod-to-vast.mdx

+1. **Your existing Runpod images will likely work as-is** — Most Runpod-compatible Docker images run on Vast with minimal or no modification.
+2. **Often cheaper for the same GPU** — Marketplace competition drives prices down. You'll frequently find the same hardware at lower rates than fixed-tier providers.
+3. **You pick the individual machine, not just the GPU type** — Every offer shows reliability score, network speed, CPU, location, and other critical specs. Two A100s at the same price can be very different machines. Vast gives you the data to choose the right one.
+4. **Bandwidth is metered** — Runpod includes free bandwidth; on Vast, egress is charged per GB at a rate shown on each offer (typically much lower than AWS).5. **Set your disk size right at launch** — Resizing requires recreating the container. Storage is cheap — err on the side of more space.


Vast charges for both inbound and outbound data transfers. Users will often pull large models so we should be very transparent here

robballantyne · 2026-03-20T18:12:12Z

examples/migrations/runpod-to-vast.mdx

+| Pod | Instance | Docker container with exclusive GPU access |
+| Serverless Endpoint -> Worker | Serverless Endpoint -> Workergroup -> Worker | Vast has managed autoscaling inference — see [Migrating from Serverless](#migrating-from-serverless) |
+| Template | Template / Docker image | Specify a Docker image and configuration at launch |
+| Network Volume | (Local) Volume | Vast volumes are currently local to one machine, not network-portable — see [Storage](#storage) |


Our volumes are more closely related to RP volume-disk. Benefit is attachment to one of many GPUs in a single node

robballantyne · 2026-03-20T18:13:56Z

examples/migrations/runpod-to-vast.mdx

+| Serverless Endpoint -> Worker | Serverless Endpoint -> Workergroup -> Worker | Vast has managed autoscaling inference — see [Migrating from Serverless](#migrating-from-serverless) |
+| Template | Template / Docker image | Specify a Docker image and configuration at launch |
+| Network Volume | (Local) Volume | Vast volumes are currently local to one machine, not network-portable — see [Storage](#storage) |
+| Hub | [Model Library](/documentation/serverless/getting-started-with-serverless) + [Template Library](/documentation/templates/introduction) | Vast has official templates for specific models in addition to base templates for lower-level control |


Official templates for many popular inference engines & applications along with specific model configs through the model library - Or similar?

robballantyne · 2026-03-20T18:19:36Z

examples/migrations/runpod-to-vast.mdx

+If you have a working Runpod template, you likely already have a Docker image that works on Vast. Most Runpod-compatible images run as-is — just specify the image in the `--image` flag.
+
+To minimize cold start times:
+- Use **Vast base images** which are pre-cached on many hosts


Needs further explanation and link to the base-image github/dockerhub repos. Docs there should help users

robballantyne · 2026-03-20T18:24:42Z

examples/migrations/runpod-to-vast.mdx

+
+Both platforms provide proxy access to services. On Runpod, proxy URLs are static: `https://<POD_ID>-<PORT>.proxy.runpod.net`. On Vast, there are two proxy mechanisms:
+
+- **HTTP/HTTPS proxy** — instances using [Vast base images](https://github.com/vast-ai/base-image/) get auto-generated Cloudflare tunnel URLs (`https://four-random-words.trycloudflare.com`) per open port via the [Instance Portal](/documentation/instances/connect/instance-portal).


Cloudflare tunnels are best effort and may not always be available. User can configure their instance to use the built-in jupyter cert to ensure TLS

LucasArmandVast · 2026-03-20T18:35:33Z

examples/migrations/runpod-to-vast.mdx

+Runpod **Serverless** lets you deploy a handler function that scales to zero — you send a request, Runpod spins up a worker, runs your handler, and tears it down. You pay per second of compute, not for idle GPUs.
+
+Vast **Serverless** delivers autoscaling inference at marketplace rates — no usage tiers, no hidden surcharges, just per-second billing across 68+ GPU types globally. Rather than wrapping a handler function, you select a pre-built template (vLLM, TGI, ComfyUI) and Vast handles routing, queueing, and autoscaling automatically.
+
+**Pricing:** Runpod charges a premium for Serverless GPU time on top of the base instance cost. On Vast, Serverless workers run on the same marketplace instances you'd rent directly — you pay the same rate, just with autoscaling on top.


Just want to point out that having "—" everywhere makes it very obvious that this text was LLM generated. Even if it wasn't, people will assume that it was. That's not necessarily a problem, but feels a bit unprofessional to me. Just my opinion though.

wbrennan899 added 2 commits March 18, 2026 17:39

mintlify bot deployed to staging March 19, 2026 21:06 View deployment

LucasArmandVast reviewed Mar 19, 2026

View reviewed changes

Fix inaccurate search URL and reserved instance discount range

53d806b

- Replace cloud.vast.ai/search/ → cloud.vast.ai/create/ (3 occurrences) - Replace "25–50% discount/savings" → "up to 50%" to match official docs Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

mintlify bot deployed to staging March 20, 2026 17:04 View deployment

robballantyne self-requested a review March 20, 2026 18:01

robballantyne reviewed Mar 20, 2026

View reviewed changes

LucasArmandVast reviewed Mar 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add RunPod to Vast.ai migration guide#86

Add RunPod to Vast.ai migration guide#86
wbrennan899 wants to merge 3 commits intomainfrom
examples/migrate-from-runpod

wbrennan899 commented Mar 19, 2026 •

edited by guthrie-vast

Loading

Uh oh!

LucasArmandVast Mar 19, 2026

Uh oh!

LucasArmandVast Mar 19, 2026

Uh oh!

LucasArmandVast Mar 19, 2026

Uh oh!

guthrie-vast commented Mar 20, 2026

Uh oh!

robballantyne Mar 20, 2026

Uh oh!

robballantyne Mar 20, 2026

Uh oh!

robballantyne Mar 20, 2026

Uh oh!

robballantyne Mar 20, 2026

Uh oh!

robballantyne Mar 20, 2026

Uh oh!

LucasArmandVast Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		## Migrating from Serverless

		RunPod Serverless lets you deploy a handler function that scales to zero — you send a request, RunPod spins up a worker, runs your handler, and tears it down. You pay per second of compute, not for idle GPUs.


		RunPod Serverless lets you deploy a handler function that scales to zero — you send a request, RunPod spins up a worker, runs your handler, and tears it down. You pay per second of compute, not for idle GPUs.

		Vast Serverless serves the same purpose — autoscaling inference without managing instances — but the architecture is different. Instead of wrapping a handler function, you pick a pre-built template (vLLM, TGI, ComfyUI) and Vast runs it behind a managed proxy that handles routing, queueing, and autoscaling. For most migrations, no custom code is needed.


		Both platforms provide proxy access to services. On Runpod, proxy URLs are static: `https://<POD_ID>-<PORT>.proxy.runpod.net`. On Vast, there are two proxy mechanisms:

		- HTTP/HTTPS proxy — instances using [Vast base images](https://github.com/vast-ai/base-image/) get auto-generated Cloudflare tunnel URLs (`https://four-random-words.trycloudflare.com`) per open port via the [Instance Portal](/documentation/instances/connect/instance-portal).

Conversation

wbrennan899 commented Mar 19, 2026 • edited by guthrie-vast Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What's included

Revisions (post-review)

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

guthrie-vast commented Mar 20, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

wbrennan899 commented Mar 19, 2026 •

edited by guthrie-vast

Loading