This document describes how to customize the deployment of the Video-Agents-Foundry-Solution. Set these values before running azd up.
- Customizing AKS Node Pools
- Customizing AI Foundry
- Customizing Resource Names
- Enabling and Disabling Resources
The solution deploys an AKS cluster with four node pools. You can customize the VM sizes for each pool:
During deployment, the preprovision hook provides an interactive menu to select VM sizes with live quota checking. Alternatively, set environment variables before running azd up:
azd env set SYSTEM_VM_SIZE Standard_D4a_v4
azd env set WORKLOAD_VM_SIZE Standard_D32a_v4
azd env set DEEPSTREAM_GPU_VM_SIZE Standard_NC24ads_A100_v4
azd env set INFERENCE_GPU_VM_SIZE Standard_NC24ads_A100_v4Set the Kubernetes version:
azd env set KUBERNETES_VERSION 1.32Set the maximum node count for the DeepStream GPU node pool:
azd env set DEEPSTREAM_GPU_MAX_NODE_COUNT 1By default, the solution provisions an Azure AI Foundry project with a model deployment. You can customize or disable this.
If you don't need the AI Foundry component:
azd env set CREATE_FOUNDRY_PROJECT falseChange the model name (default: gpt-5.2):
azd env set AI_MODEL_NAME gpt-5.2Change the model version (default: 2025-12-11):
azd env set AI_MODEL_VERSION 2025-12-11Change the model deployment capacity:
azd env set AI_MODEL_CAPACITY 1By default, this template uses a naming convention with unique strings to prevent naming collisions. The following environment variables control resource naming via the main.bicepparam file:
| Variable | Description |
|---|---|
AZURE_ENV_NAME |
Environment name prefix (default: azd-foundry-solution) |
AZURE_RESOURCE_GROUP |
Resource group name |
AZURE_LOCATION |
Azure region (default: eastus2) |
To override, run azd env set <key> <value> before running azd up.
By default, the deployment creates role assignments for the current user. To disable:
azd env set CREATE_ROLE_FOR_USER falseTo skip provisioning the AI Foundry project, hub, and model deployment:
azd env set CREATE_FOUNDRY_PROJECT falseAll configurable parameters with their defaults:
| Variable | Default | Description |
|---|---|---|
AZURE_ENV_NAME |
azd-foundry-solution |
Environment name |
AZURE_RESOURCE_GROUP |
(empty - auto-generated) | Resource group name |
AZURE_LOCATION |
eastus2 |
Azure region |
KUBERNETES_VERSION |
1.32 |
AKS Kubernetes version |
SYSTEM_VM_SIZE |
(selected interactively) | System node pool VM |
WORKLOAD_VM_SIZE |
(selected interactively) | Workload node pool VM |
DEEPSTREAM_GPU_VM_SIZE |
(selected interactively) | DeepStream GPU node pool VM |
INFERENCE_GPU_VM_SIZE |
(selected interactively) | Inference GPU node pool VM |
DEEPSTREAM_GPU_MAX_NODE_COUNT |
1 |
Max nodes in DeepStream pool |
CREATE_FOUNDRY_PROJECT |
true |
Deploy AI Foundry |
AI_MODEL_NAME |
gpt-5.2 |
AI model name |
AI_MODEL_VERSION |
2025-12-11 |
AI model version |
AI_MODEL_CAPACITY |
1 |
Model deployment capacity |
CREATE_ROLE_FOR_USER |
true |
Create RBAC role assignments |
MEDIA_STREAMER_ENABLED |
true |
Enable media streamer (RTSP camera + sample agent job in post-provision) |
STORAGE_SKU_NAME |
Standard_LRS |
Storage account SKU (Standard_LRS, Standard_ZRS, Standard_GRS). ZRS availability is checked during preprovision. |