Skip to content

[Blog] Model inference with Prefill-Decode disaggregation #6480

[Blog] Model inference with Prefill-Decode disaggregation

[Blog] Model inference with Prefill-Decode disaggregation #6480

Triggered via pull request February 19, 2026 15:31
Status Success
Total duration 8m 43s
Artifacts 7

build.yml

on: pull_request
compute-version
5s
compute-version
build-docs  /  build-docs
1m 12s
build-docs / build-docs
build-artifacts  /  code-lint
1m 30s
build-artifacts / code-lint
build-artifacts  /  frontend-build
9s
build-artifacts / frontend-build
Matrix: build-artifacts / runner-test
build-artifacts  /  gateway-build
15s
build-artifacts / gateway-build
build-artifacts  /  generate-json-schema
21s
build-artifacts / generate-json-schema
Matrix: build-artifacts / python-test
build-artifacts  /  python-build
24s
build-artifacts / python-build
Matrix: build-artifacts / runner-compile
upload-pre-pypi-artifacts  /  upload-pre-pypi-artifacts
18s
upload-pre-pypi-artifacts / upload-pre-pypi-artifacts
upload-post-pypi-artifacts  /  upload-post-pypi-artifacts
19s
upload-post-pypi-artifacts / upload-post-pypi-artifacts
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
dstack-gateway Expired
3.16 KB
sha256:d6e0b11383e724caf24ed9d97e18c5d54c467e903cd3283acc79f741811b0761
dstack-runner-linux-amd64 Expired
18.6 MB
sha256:c2a98d42a932252ea5e72b38191532cbb1557d754836d60bc5e026c867605a13
dstack-runner-linux-arm64 Expired
17.1 MB
sha256:cc5ed12b524a3caab63230217736ad2b0e81d77647c1726c5b1d083163be2bcf
frontend-build Expired
14.7 MB
sha256:67d3d67a6b39ec9736b3a02caa15aede5b5b7523b136dc115d2d35c4c1b0ddc9
json-schemas Expired
14.1 KB
sha256:418fda56cb2bed1f6bda421cfbf92dbaa74b7e4f4feb4042e41a2522283639b2
python-build Expired
60.4 MB
sha256:1997b31a45f8dbd4e2f8b954e7d1eeebf72b677ba1b11798811931639a36f9aa
site Expired
48.6 MB
sha256:7b619c1255fd9ca082b079aaa0796bcc4a9ba9bfee897472f1195ed26105a96c