Welcome to the Video-Agents-Foundry-Solution, designed to help businesses leverage AI agents for automating complex video analysts tasks. This solution provides end-to-end framework for deploying AI-powered video analysis at the edge using Azure Video Indexer enabled by Azure Arc, combined with intelligent AI agents for automated decision-making and real-time video insights. The Video-Agents-Foundry-Solution allows users to process and analyze live and recorded video streams at the edge, extract AI insights, including speech transcription, OCR, face detection, object tracking, and action recognition and leverage AI agents to automate complex video analysis workflows with low latency and full data sovereignty.
Note: In order to successfully deploy the VI Extension, you must first submit your subscription for approval and be approved. Please use this application link. this form.
Deploying this solution can take approximately 45-60 minutes end-to-end (15-25 minutes for
azd upto complete, plus 20-35 minutes for GPU drivers and VI extension pods to fully initialize). During deployment, the Azure Portal may show some resources as "Failed" or "Unavailable". This is expected behavior as some resources may take longer to provision or initialize. Please allow up to an hour for all resources to be fully deployed and operational. If you continue to see issues after this time, please refer to the troubleshooting guide in the documentation.
Comprehensive Guides
For a detailed, step-by-step guide on creating an AKS cluster with GPU support and deploying the Video Indexer Arc extension, see:
- AKS-CLUSTER-SETUP.md - Complete setup guide with output examples
- create-aks-cluster.sh - Automated deployment script
The guide includes GPU quota checking, troubleshooting tips, and support for live video processing.
Note: With any AI solutions you create using these templates, you are responsible for assessing all associated risks and for complying with all applicable laws and safety standards. Learn more in the transparency documents for Agent Service and Agent Framework.
The solution leverages Azure Arc Service to extend Azure Video Indexer capabilities to edge environments running on Kubernetes clusters. By deploying the Video Indexer Arc extension on an Arc-enabled AKS cluster with GPU support (NVIDIA), the solution enables real-time video processing close to the data source — reducing latency and ensuring data sovereignty. AI agents built on Azure OpenAI orchestrate the video analysis pipeline, automating tasks such as content moderation, safety monitoring, and business intelligence extraction from video feeds. The agentic architecture allows multiple specialized agents to collaborate on complex video analysis tasks, with each agent handling a specific domain such as speech analysis, visual recognition, or compliance checking.
This solution optionally creates a Microsoft Foundry project and Foundry Tools (enabled by default; set CREATE_FOUNDRY_PROJECT=false to skip). More details about the resources can be found in the resources documentation.
![]() |
|---|
This template creates everything you need to get started with Microsoft Foundry (when CREATE_FOUNDRY_PROJECT=true, the default):
| Resource | Description |
|---|---|
| Azure AI Project | Optional - Provides a collaborative workspace for AI development with access to models, data, and compute resources |
| Azure OpenAI Service | Optional - Powers the AI model. Default model deployed is gpt-5.2, but any Azure AI model that supports tool calling can be specified per the documentation |
| Storage Account | Provides blob storage for application data and file uploads |
| Application Insights | Optional - Provides application performance monitoring, logging, and telemetry for debugging and optimization |
| Log Analytics Workspace | Optional - Collects and analyzes telemetry data for monitoring and troubleshooting |
Note: You should validate model availability in your region before deployment. The default model deployed with this solution is gpt-5.2, but you can customize the model and deployment configurations as needed. For more information on available models and their capabilities, please refer to the Azure OpenAI Service documentation: Foundry Models
- Azure Video Indexer enabled by Arc — Ignite 2025 Preview Blog
- Azure Video Indexer Documentation
- Azure Arc Documentation
- AKS Cluster Setup for Video Indexer Arc Extension
- Azure Video Indexer Samples on GitHub
For solutions to common deployment, container app, and agent issues, see the Troubleshooting Guide.
Learn more about the key features this solution enables
- Edge-Deployed Video Analysis — Run Azure Video Indexer on Arc-enabled Kubernetes clusters at the edge, bringing AI-powered video analysis closer to data sources for low-latency processing.
- Real-Time & Batch Video Processing — Support for both live video pipeline ingestion (RTSP streams) and batch processing of recorded video files.
- Customizable AI Model Deployment: The solution allows users to configure and deploy AI models, such as gpt-5.2, with options to adjust model capacity, deployment configurations, and knowledge retrieval methods.
- Multimodal AI Insights — Extract rich insights from video content including speech transcription, OCR, face detection, object tracking, scene detection, and action recognition.
- AI Agent–Based Automation — Leverage intelligent agents powered by Azure OpenAI to automate complex video analysis workflows, enabling automated decision-making and action triggering.
- Cloud-Managed Edge Deployments — Manage and monitor edge deployments centrally through Azure Arc while processing video data locally.
- Data Sovereignty & Compliance — Process sensitive video data on-premises or at the edge, ensuring data residency and compliance requirements are met.
- Extensible Agentic Architecture — Build custom agent teams for domain-specific video analysis scenarios such as retail analytics, manufacturing quality control, or safety monitoring.
Follow the quick deploy steps on the deployment guide to deploy this solution to your own Azure subscription.
Note: This solution accelerator requires Azure Developer CLI (azd) version 1.18.0 or higher. Please ensure you have the latest version installed before proceeding with deployment. Download azd here.
Click here to launch the deployment guide
⚠️ Important: Check Azure OpenAI Quota Availability
To ensure sufficient quota is available in your subscription, please follow quota check instructions guide before you deploy the solution.
- Azure Subscription — An active Azure subscription. Create one for free.
- VI Arc Extension Approval — Submit your subscription for approval via this form before deploying.
- GPU Quota — Sufficient GPU quota for
Standard_NC4as_T4_v3VMs in your target region (minimum 4 vCPUs for the NCasv3 family). - Azure CLI — Version 2.x or higher. Install Azure CLI.
- kubectl — Kubernetes command-line tool. Install kubectl.
- Helm 3 — Kubernetes package manager. Install Helm.
- Azure Developer CLI (azd) — Version 1.18.0 or higher. Install azd.
Pricing varies per region and usage, so it isn't possible to predict exact costs for your usage. The majority of the Azure resources used in this infrastructure are on usage-based pricing tiers. However, Azure Container Registry has a fixed cost per registry per day.
Use the Azure pricing calculator to calculate the cost of this solution in your subscription. Review a sample pricing sheet for the architecture.
| Product | Description | Cost |
|---|---|---|
| Azure OpenAI Service | Powers the AI agents for task automation | Pricing |
| Azure Container Apps | Hosts the web application frontend | Pricing |
| Azure Cosmos DB | Stores metadata and processing results | Pricing |
| Azure Container Registry | Stores container images for deployment | Pricing |
⚠️ Important: To avoid unnecessary costs, remember to take down your app if it's no longer in use, either by deleting the resource group in the Portal or runningazd down.
![]() |
|---|
Learn more about what value this solution provides
- Real-Time Decision Making — Enable immediate, automated responses to events captured in video feeds, reducing the time from observation to action from hours or days to seconds.
- Reduced Operational Costs — Automate video monitoring and analysis tasks that previously required dedicated staff for manual review, significantly lowering labor costs.
- Edge Data Sovereignty — Process sensitive video data locally at the edge, ensuring compliance with data residency regulations and reducing bandwidth costs from streaming video to the cloud.
- Scalable Intelligence — Deploy consistent AI-powered video analysis across multiple locations (stores, factories, facilities) using a centralized cloud management model via Azure Arc.
- Actionable Insights — Transform raw video feeds into structured, actionable business intelligence through multimodal AI analysis and agent-driven automation.
Learn more about what use cases this solution provides
| Use Case | Persona | Challenges | Summary/Approach |
|---|---|---|---|
| Retail Analytics | Store Manager | Manual monitoring of customer behavior across multiple camera feeds is time-consuming and error-prone | AI agents analyze video feeds in real time for foot traffic patterns, shelf interaction, customer demographics, and queue lengths to optimize store layout and staffing |
| Manufacturing Quality Control | Quality Engineer | Visual inspection of production lines requires constant human attention and is subject to fatigue-related errors | Automated defect detection and quality monitoring via edge-deployed video analysis with AI agents that flag anomalies and trigger alerts in real time |
| Safety & Compliance Monitoring | Safety Officer | Reviewing security footage after incidents is reactive rather than preventive | Real-time safety monitoring with AI agents that detect PPE violations, unauthorized access, hazardous conditions, and automatically generate compliance reports |
| Smart City & Traffic Management | Urban Planner | Collecting and analyzing traffic data across a city requires expensive infrastructure and manual data processing | Edge-deployed video analysis at intersections provides real-time traffic flow data, incident detection, and pedestrian safety monitoring powered by AI agents |
This template also uses Managed Identity for local development and deployment.
To ensure continued best practices in your own repository, we recommend that anyone creating solutions based on our templates ensure that the GitHub secret scanning setting is enabled.
You may want to consider additional security measures, such as:
- Enabling Microsoft Defender for Cloud to secure your Azure resources.
- Protecting the Azure Container Apps instance with a firewall and/or Virtual Network.
Important Security Notice
This template, the application code and configuration it contains, has been built to showcase Microsoft Azure specific services and tools. We strongly advise our customers not to make this code part of their production environments without implementing or enabling additional security features.
For a more comprehensive list of best practices and security recommendations for Intelligent Applications, visit our official documentation.
Have questions, find a bug, or want to request a feature? Submit a new issue on this repo and we'll connect.
Please refer to Transparency FAQ for responsible AI transparency details of this solution accelerator.
This release is an artificial intelligence (AI) system that generates text based on user input. The text generated by this system may include ungrounded content, meaning that it is not verified by any reliable source or based on any factual data. The data included in this release is synthetic, meaning that it is artificially created by the system and may contain factual errors or inconsistencies. Users of this release are responsible for determining the accuracy, validity, and suitability of any content generated by the system for their intended purposes. Users should not rely on the system output as a source of truth or as a substitute for human judgment or expertise.
This release only supports English language input and output. Users should not attempt to use the system with any other language or format. The system output may not be compatible with any translation tools or services, and may lose its meaning or coherence if translated.
This release does not reflect the opinions, views, or values of Microsoft Corporation or any of its affiliates, subsidiaries, or partners. The system output is solely based on the system's own logic and algorithms, and does not represent any endorsement, recommendation, or advice from Microsoft or any other entity. Microsoft disclaims any liability or responsibility for any damages, losses, or harms arising from the use of this release or its output by any user or third party.
This release does not provide any financial advice, legal advice and is not designed to replace the role of qualified client advisors in appropriately advising clients. Users should not use the system output for any financial decisions, legal guidance or transactions, and should consult with a professional financial advisor and or legal advisor as appropriate before taking any action based on the system output. Microsoft is not a financial institution or a fiduciary, and does not offer any financial products or services through this release or its output.
This release is intended as a proof of concept only, and is not a finished or polished product. It is not intended for commercial use or distribution, and is subject to change or discontinuation without notice. Any planned deployment of this release or its output should include comprehensive testing and evaluation to ensure it is fit for purpose and meets the user's requirements and expectations. Microsoft does not guarantee the quality, performance, reliability, or availability of this release or its output, and does not provide any warranty or support for it.
This Software requires the use of third-party components which are governed by separate proprietary or open-source licenses as identified below, and you must comply with the terms of each applicable license in order to use the Software. You acknowledge and agree that this license does not grant you a license or other right to use any such third-party proprietary or open-source components.
To the extent that the Software includes components or code used in or derived from Microsoft products or services, including without limitation Microsoft Azure Services (collectively, "Microsoft Products and Services"), you must also comply with the Product Terms applicable to such Microsoft Products and Services. You acknowledge and agree that the license governing the Software does not grant you a license or other right to use Microsoft Products and Services. Nothing in the license or this ReadMe file will serve to supersede, amend, terminate or modify any terms in the Product Terms for any Microsoft Products and Services.
You must also comply with all domestic and international export laws and regulations that apply to the Software, which include restrictions on destinations, end users, and end use. For further information on export restrictions, visit https://aka.ms/exporting.
You acknowledge that the Software and Microsoft Products and Services (1) are not designed, intended or made available as a medical device(s), and (2) are not designed or intended to be a substitute for professional medical advice, diagnosis, treatment, or judgment and should not be used to replace or as a substitute for professional medical advice, diagnosis, treatment, or judgment. Customer is solely responsible for displaying and/or obtaining appropriate consents, warnings, disclaimers, and acknowledgements to end users of Customer's implementation of the Online Services.
You acknowledge the Software is not subject to SOC 1 and SOC 2 compliance audits. No Microsoft technology, nor any of its component technologies, including the Software, is intended or made available as a substitute for the professional advice, opinion, or judgment of a certified financial services professional. Do not use the Software to replace, substitute, or provide professional financial advice or judgment.
BY ACCESSING OR USING THE SOFTWARE, YOU ACKNOWLEDGE THAT THE SOFTWARE IS NOT DESIGNED OR INTENDED TO SUPPORT ANY USE IN WHICH A SERVICE INTERRUPTION, DEFECT, ERROR, OR OTHER FAILURE OF THE SOFTWARE COULD RESULT IN THE DEATH OR SERIOUS BODILY INJURY OF ANY PERSON OR IN PHYSICAL OR ENVIRONMENTAL DAMAGE (COLLECTIVELY, "HIGH-RISK USE"), AND THAT YOU WILL ENSURE THAT, IN THE EVENT OF ANY INTERRUPTION, DEFECT, ERROR, OR OTHER FAILURE OF THE SOFTWARE, THE SAFETY OF PEOPLE, PROPERTY, AND THE ENVIRONMENT ARE NOT REDUCED BELOW A LEVEL THAT IS REASONABLY, APPROPRIATE, AND LEGAL, WHETHER IN GENERAL OR IN A SPECIFIC INDUSTRY. BY ACCESSING THE SOFTWARE, YOU FURTHER ACKNOWLEDGE THAT YOUR HIGH-RISK USE OF THE SOFTWARE IS AT YOUR OWN RISK.

