0.19.18
Server
Optimized resources processing
This release includes major improvements that allow the dstack server process more resources quickly. It also allows scaling processing rates of one server replica to take advantage of big Postgres instances by setting the DSTACK_SERVER_BACKGROUND_PROCESSING_FACTOR environment variable.
The result is:
- Faster processing rates: provisioning 100 runs on SQLite with default settings went from ~5m to ~2m.
- Better scaling: provisioning additional 100 runs is even quicker due to warm cache. Before, it was slower than the first 100 runs.
- Ability to process more runs per server replica: provisioning 300 runs on Postgres with
DSTACK_SERVER_BACKGROUND_PROCESSING_FACTOR=4is ~4m.
For more details on scaling backgraound processing rates, see the Server deployment guide.
Backends
Private GCP gateways
It's now possible to create GCP gateways without public IPs:
type: gateway
name: example
domain: gateway.example.com
backend: gcp
region: europe-west9
public_ip: false
certificate: nullNote that configuring HTTPS certificates for private GCP gateways is not yet supported, so you need to specify certificate: null.
What's Changed
- Ignore SSH keys when calculating fleet conf diff by @un-def in #2869
- [Blog] Refactoring by @peterschmidt85 in #2873
- Implemented fronted precommit linting by @olgenn in #2868
- Support processing more resources per replica by @r4victor in #2871
- Use uvloop by default by @r4victor in #2874
- Add server profiling by @r4victor in #2875
- Fix NVIDIA container toolkit bug in all backends by @jvstme in #2877
- Private GCP gateways by @jvstme in #2881
- Switch to
e2-mediumfor GCP gateways by @jvstme in #2886
Full Changelog: 0.19.17...0.19.18