Name	Name	Last commit message	Last commit date
Latest commit History 266 Commits
.azdo/pipelines	.azdo/pipelines
.vscode	.vscode
assets	assets
bicep/infra	bicep/infra
guides	guides
scripts	scripts
shared	shared
src	src
.env.template	.env.template
.gitignore	.gitignore
LICENSE	LICENSE
README.md	README.md
azure.yaml	azure.yaml

🏰 Citadel Governance Hub

Enterprise AI Landing Zone
A comprehensive solution accelerator for governing, observing, and accelerating AI deployments at scale with unified security, compliance, and intelligent orchestration.

🎯 Transform Your AI Strategy

62% of practitioners cite security concerns as the top blocker to wider AI adoption
71% of enterprises struggle to track AI usage, enforce quotas, and report costs
47% of organizations require explicit guardrails before deploying AI agents
70% of customers need an AI registry for agents and tools to scale AI adoption

AI Citadel Governance Hub turns these challenges into platform strengths — governed access, transparent consumption, defensible guardrails, and a shared catalog of reusable AI capabilities.

🚀 What is AI Citadel Governance Hub?

Citadel Governance Hub is an enterprise-grade AI landing zone that establishes a centralized, governable, and observable control plane for all AI service consumption across multiple teams, use cases, and environments.

Instead of fragmented, unmonitored, master-key model access, Citadel Governance Hub provides a unified AI gateway pattern built on Azure API Management (APIM), adding:

✅ Intelligent routing and load balancing
✅ Security enforcement and compliance guardrails
✅ Usage analytics and cost attribution
✅ AI registry for agents, tools, and services
✅ Automated onboarding and governance workflows

This elevates AI consumption from ad-hoc experimentation to scalable, auditable, and cost-attributable platform capability.

🏛️ Three Pillars of AI Citadel

AI Citadel Governance Hub is built on three foundational pillars that address enterprise AI adoption challenges end-to-end:

1️⃣ Governance & Security Pillar – Trustworthy AI Operations at Scale

🔐 Why Governance Matters: Without centralized AI governance, organizations face unpredictable costs, reliability issues, security risks, developer friction, and compliance nightmares. AI Citadel fixes this by building guardrails into every AI call.

Key Capabilities:

Capability	Description
🚪 Unified AI Gateway	Central entry point (APIM) for all AI requests with consistent policy enforcement
🗝️ Managed Credentials	Gateway-keys pattern replaces master API keys with scoped, revocable tokens and support for JWT tokens
🛡️ Policy Enforcement	Granular access control, rate/token limiting, token quotas, and traffic mediation
🌐 Multi-Cloud Support	Govern Azure OpenAI, open-source models, third-party models under one umbrella
🛡️ AI Content Safety	Built-in Azure AI Content Safety with prompt shields, harmful content detection, and protected content checks
📊 Cost Governance	Centralized logging, usage tracking, and cost attribution by team/application
📘 AI Registry	Unified catalog for LLMs, AI tools (through Model Context Protocol (MCP)), and agents
🔒 Data Security	Built-in support for PII detection in addition to Microsoft Purview integration for sensitivity labels and data governance

2️⃣ Observability & Compliance Pillar – End-to-End Monitoring, Evaluation & Trust

📊 Full Visibility = Trust & Confidence: AI Citadel provides holistic observability through a dual-layer approach ensuring teams can debug issues, assure quality, and govern compliance in real-time.

🏗️ Platform-Level Observability

As part of AI Citadel Governance Hub, a centralized monitoring without requiring agent code changes provides:

Feature	Description
📊 Central Application Performance Monitoring	Azure Monitor and Application Insights for infrastructure metrics and system health
📈 Usage Tracking	Token consumption, request volumes, cost allocation by team/use case/agent
🔍 Centralized AI Evaluation	Automated quality evaluations (groundedness, relevance, coherence, safety) without code changes
🚨 Enterprise Alerts	Configurable alerts with automated remediation and compliance reporting

Platform observability is enabled out-of-the-box for all AI workloads routing through the AI Citadel Governance Hub.

3️⃣ AI Development Velocity Pillar – Accelerating Innovation with Templates & Tools

🚀 Build Fast, Build Right: AI Citadel Governance Hub support integrating existing agents and tools in addition to support integrating new agents which enable teams to experiment and innovate quickly without sacrificing governance or quality.

AI Citadel Governance Hub provides automatable agent onboarding configurations through Citadel Access & Publish Contracts along with reusable blueprints and templates for common AI patterns.

Key Capabilities:

Capability	Description
🚀 Citadel Access Contract	Govern the required access to LLMs and centrally managed tools and agents
🤖 Citadel Publish Contract	Provide the ability to publish agents and tools on AI Citadel Governance Hub
📘 Citadel AI Registry	Central catalog for discovering, managing, and reusing AI assets across the enterprise
🔄 DevOps Integration	Automate and source control both access and public AI Citadel Contracts

🎯 Key Use Cases

Citadel Governance Hub enables secure, scalable AI deployment across diverse enterprise scenarios:

💼 Enterprise AI Governance

Centralized access control for all AI services across departments
Cost attribution and chargeback to business units
Compliance reporting and audit trails
Shadow AI prevention and policy enforcement

🤖 Multi-Agent Systems

Discover and reuse agents through the AI Registry
Govern agent-to-agent communication
Monitor multi-agent workflows end-to-end
Enforce safety guardrails across agent interactions

🌐 Multi-Cloud AI Strategy

Unified governance across Azure OpenAI, AWS Bedrock, and open-source models
Consistent security policies regardless of backend
Seamless migration and failover between providers
Cost optimization through intelligent routing

🔒 Regulated Industries

Financial services compliance (SOC 2, PCI DSS)
Healthcare data protection (HIPAA)
Government security requirements (FedRAMP)
PII detection and anonymization

📊 AI Operations at Scale

Support thousands of concurrent AI applications
Near real-time usage monitoring and alerts
Capacity planning and quota management
Performance optimization and troubleshooting

🏗️ Architecture Overview

AI Citadel Governance Hub follows a hub-spoke architecture that integrates seamlessly with your existing enterpriseAzure Landing Zone:

Networking approach

Part of the hub network

In this approach, the Citadel Governance Hub is deployed within the existing hub virtual network (VNet) of your Azure Landing Zone.

This allows for direct communication between the unified AI gateway and connected agentic spokes, leveraging existing security and networking configurations.

%%{init: {
  'theme': 'base',
  'themeVariables': {
    'primaryColor': '#0078d4',
    'primaryTextColor': '#fff',
    'primaryBorderColor': '#0078d4',
    'lineColor': '#8a8886',
    'secondaryColor': '#50e6ff',
    'tertiaryColor': '#f3f2f1',
    'noteBkgColor': '#fef9e7',
    'noteTextColor': '#323130',
    'noteBorderColor': '#d4a300',
    'actorBkg': '#0078d4',
    'actorBorder': '#005a9e',
    'actorTextColor': '#fff',
    'actorLineColor': '#8a8886',
    'signalColor': '#323130',
    'signalTextColor': '#323130',
    'labelBoxBkgColor': '#e1dfdd',
    'labelBoxBorderColor': '#605e5c',
    'labelTextColor': '#323130',
    'loopTextColor': '#323130',
    'activationBorderColor': '#0078d4',
    'activationBkgColor': '#deecf9',
    'sequenceNumberColor': '#fff'
  }
}}%%
sequenceDiagram
    autonumber
    
    participant Agent as 🤖 Spoke Agent<br/>(Spoke Network)
    participant Gateway as 🚪 AI Gateway<br/>(Hub Network)
    participant Backends as ⚡ AI Backends<br/>(Hub/Spoke Network*)

    Note over Agent,Backends: 🔷 Hub-Based Governance Architecture
    
    rect rgba(0, 120, 212, 0.1)
        Note right of Agent: Step 1: Request Initiation
        Agent->>+Gateway: AI Request with Auth Token
    end
    
    rect rgba(80, 230, 255, 0.1)
        Note over Gateway: Step 2: Governance & Security Enforcement
        Gateway->>+Backends: Routed Request to LLM/Agent/Tool
    end
    
    rect rgba(16, 124, 16, 0.1)
        Note over Backends: Step 3: AI Processing & Response
        Backends-->>-Gateway: AI Response + Telemetry
    end
    
    rect rgba(147, 51, 234, 0.1)
        Note over Gateway: Step 4: Response Validation & Logging
        Gateway-->>-Agent: Governed AI Response
    end
    
    Note over Agent,Backends: ✅ Complete observability with zero agent-side instrumentation

Note: When AI Backends reside in a different spoke networks, their traffic should be forced through the hub firewall to maintain integrity of the network traffic flow.*

Traffic Flow

Routed requests originate from spoke-hosted agents.
Traffic is directly forwarded to AI Gateway for governance, security, and observability enforcement.
Traffic intelligently routed out to managed LLMs, tools, or downstream agents (gateway-spoke-network).

Part of spoke network

In this approach, the Citadel Governance Hub is deployed within a dedicated spoke VNet that connects to the hub VNet via VNet peering.

Agentic workloads in other spokes are routed first to the hub network firewall through direct peering, then forwarded to the Citadel Governance Hub gateway network.

This provides an additional layer of isolation for AI workloads while still enabling secure communication with other enterprise resources in the hub.

%%{init: {
  'theme': 'base',
  'themeVariables': {
    'primaryColor': '#0078d4',
    'primaryTextColor': '#fff',
    'primaryBorderColor': '#0078d4',
    'lineColor': '#8a8886',
    'secondaryColor': '#d13438',
    'tertiaryColor': '#f3f2f1',
    'noteBkgColor': '#fef9e7',
    'noteTextColor': '#323130',
    'noteBorderColor': '#d4a300',
    'actorBkg': '#0078d4',
    'actorBorder': '#005a9e',
    'actorTextColor': '#fff',
    'actorLineColor': '#8a8886',
    'signalColor': '#323130',
    'signalTextColor': '#323130',
    'labelBoxBkgColor': '#e1dfdd',
    'labelBoxBorderColor': '#605e5c',
    'labelTextColor': '#323130',
    'loopTextColor': '#323130',
    'activationBorderColor': '#0078d4',
    'activationBkgColor': '#deecf9',
    'sequenceNumberColor': '#fff'
  }
}}%%
sequenceDiagram
    autonumber
    
    participant Agent as 🤖 Spoke Agent<br/>(Agent Spoke Network)
    participant Firewall as 🛡️ Hub Firewall<br/>(Hub Network)
    participant Gateway as 🚪 AI Gateway<br/>(Gateway Spoke Network)
    participant Backends as ⚡ AI Backends<br/>(Spoke Network*)

    Note over Agent,Backends: 🔷 Spoke-Based Governance with Firewall Isolation
    
    rect rgba(0, 120, 212, 0.1)
        Note right of Agent: Step 1: Request Initiation
        Agent->>+Firewall: AI Request via Peering
    end
    
    rect rgba(209, 52, 56, 0.1)
        Note over Firewall: Step 2: Network Security Inspection
        Firewall->>+Gateway: Forward Approved Traffic
    end
    
    rect rgba(80, 230, 255, 0.1)
        Note over Gateway: Step 3: AI Governance Layer
        Gateway->>+Backends: Governed Request to LLM/Agent/Tool
    end
    
    rect rgba(16, 124, 16, 0.1)
        Note over Backends: Step 4: AI Processing
        Backends-->>-Gateway: AI Response + Telemetry
    end
    
    rect rgba(147, 51, 234, 0.1)
        Note over Gateway: Step 5: Response Validation
        Gateway-->>-Firewall: Validated Response
    end
    
    rect rgba(209, 52, 56, 0.1)
        Note over Firewall: Step 6: Egress Inspection
        Firewall-->>-Agent: Secured AI Response
    end
    
    Note over Agent,Backends: 🔒 Defense-in-depth with dual security layers<br/>✅ Network isolation + AI governance

Note: * When AI Backends reside in a different spoke networks, their traffic should be forced through the hub firewall to maintain integrity of the network traffic flow.*

Traffic isolation flow

Routed requests originate from spoke-hosted agents (agent-spoke-network).
Traffic first routed to hub network firewall for inspection (hub-network).
Hub Firewall forwards to AI Gateway for governance, security, and observability enforcement (gateway-spoke-network).
Traffic intelligently routed out to managed LLMs, tools, or downstream agents (through the hub firewall or directly).
AI Backend responses may still be routed through the hub firewall for final inspection before reaching spoke agents, depending on governance policy.

🎯 Citadel Governance Hub - Central Control Plane

The central governance layer with unified AI Gateway that all AI workloads route through.

Core Components

Component	Purpose	Enterprise Features
🚪 API Management	Unified AI gateway	LLM governance, AI resiliency, AI registry gateway
📘 API Center	Universal AI Registry	Discovery of available AI tools, agents and AI services
🔍 Microsoft Foundry	Platform Observability	Platform AI Evaluations & Compliance reports
📊 Log Analytics	Logs, metrics & audits	Scalable enterprise telemetry ingestion and storage
📊 Application Insights	Platform monitoring	Performance dashboards, automated alerts
📨 Event Hub	Usage data streaming	Real-time usage streaming, custom logging
🗄️ Cosmos DB	Usage analytics	Long-term storage of usage, automatic scaling
⚡ Logic App	Event processing	Workflow-based processing of usage/logs & AI Eval
🔐 Managed Identity	Zero-credential auth	Secure service-to-service communication
🔗 Virtual Network	Private connectivity	BYO-VNET support, private endpoints

Security & Compliance

AI Gateway security & compliance enforcements components:

Component	Purpose	Enterprise Features
🛡️ Content Safety	LLM protection	Prompt Shield and Content Safety protections
💳 Language Service	PII detection	Natural language and RegEx based PII entity detection with anonymization support

Supported by subscription wide security services:

Component	Purpose	Enterprise Features
Defender for Cloud	Threat protection	AI workload security posture management
Purview	Data governance	Sensitivity labeling, data classification
Entra ID	Identity & access management	Zero Trust architecture, conditional access

AI Services

Optionally you can deploy one or more generative AI services in the hub:

Component	Purpose	Enterprise Features
Microsoft Foundry	LLM model hosting	Access to rich foundational model catalog with variety of deployment options

Optional Components

Pluggable components to enhance AI Citadel Governance capabilities:

Component	Purpose
Azure Managed Redis	Semantic caching layer for high-throughput AI workloads

🌐 Citadel Compliant Agents - Existing and new agents on-boarding

To govern AI agents through AI Citadel Governance Hub, agents must communicate with AI backends (central LLMs, tools and agents) through the Citadel's unified AI gateway.

Existing agents

Guidance to bring existing agents is through updating endpoint and credentials to access central LLMs, tools and agents through the unified gateway.

Recommendation is to use Azure Key Vault to store these information due to its sensitivity when the agent is running on Azure.

Leverage Citadel Access Contracts to declare the required access to LLMs, tools and agents through the gateway along with precise governance policies.

New agents

Building new agents is accelerated through the Citadel Agent Spoke landing zone guidance, which provides isolated, secure environments designed specifically for AI agent development and deployment. Each spoke serves a single business unit or major use case, ensuring clear boundaries, simplified management, and integration with the Citadel Governance Hub for centralized governance.

Deployment Approach:

One spoke per business unit or use case - Dedicated environments for insurance claims processing, customer support automation, or other agentic scenarios
Flexible runtime options - Choose between AI Foundry Agents (fully managed runtime) or Azure Container Apps (bring-your-own-agent)
Pre-configured infrastructure - Automated deployment via Bicep or Terraform with all networking, security, and monitoring built-in
Hub integration - Seamless connection to Citadel Governance Hub through Citadel Access & Publish Contracts

Core Infrastructure Components:

Component	Purpose
🤖 Azure AI Foundry	Managed agent runtime with rich SDK, prompt flow orchestration, and native AI Evaluations
📦 Azure Container Apps	Serverless container hosting for custom-built agents with auto-scaling and simplified deployment
🔍 Azure AI Search	Vector and hybrid search for RAG patterns and document indexing
🗄️ Azure Cosmos DB	Distributed NoSQL database for agent state, threads, and multi-agent coordination
💾 Azure Storage	Blob storage for AI Foundry datasets, agent assets, and shared files
🔐 Azure Key Vault	Secure secrets, keys, and certificates with automated rotation
📊 Application Insights	Detailed monitoring, diagnostics, and alerts integrated with platform-level observability
🔒 Virtual Network	Private connectivity with subnets for compute, agents, data, and management

Deployment Patterns:

Greenfield (Standalone with New Resources) - Creates all infrastructure from scratch with new VNet and Log Analytics workspace
Brownfield (Standalone with Existing Resources) - Integrates with existing enterprise landing zones, reusing VNets and centralized monitoring

Note: Citadel Agent Spoke deployment supports the AI development velocity pillar and is designed to work in conjunction with Citadel Governance Hub. Multiple spokes can connect to a single hub for unified governance and observability.

🔄 AI Citadel Contracts - Connect agents to governance hub

Citadel Governance Hub seamlessly integrates with Citadel compliant Agents environments through automated governance alignment:

📝 AI Access Contract

Declares the governed dependencies an agent needs—LLMs, AI services, tools, and reusable agents—along with precise access policies:

Model selection and capacity allocation
Regional preferences and compliance requirements
Safety and security guardrails
Usage quotas and cost limits

📤 AI Publish Contract

Describes the tools and agents a spoke exposes back to the hub:

Publishing rules and governance gates
Ownership metadata and documentation
Security posture and compliance status
Discovery and cataloging in the AI Registry

Benefits:

✅ Audit-ready traceability through infrastructure-as-code
✅ Faster release cycles with automated approvals
✅ Reduced manual effort in governance onboarding
✅ Continuous policy compliance verification

🔗 Learn More: Citadel Access Contracts Guide

📋 Prerequisites

Azure Requirements:

Azure CLI and Azure Developer CLI installed and signed in
A resource group in your target subscription
Owner or Contributor + User Access Administrator permissions on the subscription
All required subscription resource providers registered.

Development Tools: Although it is recommended to have the below tools installed on a local machine or through DevOps agents to conduct the provisioning, you still can leverage Azure Cloud Shell (mounted to storage account) as an alternative which has all the tools pre-installed.

🚀 Quick Deploy

Deploy your Citadel Governance Hub in minutes with Azure Developer CLI:

# Authenticate and setup environment
azd auth login
azd env new citadel-governance-hub-nonprod

# Deploy Citadel Governance Hub
azd up

💡 Tip: Use Azure Cloud Shell to avoid local setup. Review main.bicep configuration before deployment.

✅ Post-Deployment Validation

Once deployed, access your Citadel AI Gateway through Azure API Management:

Key Endpoints:

AI Gateway: https://<your-apim>.azure-api.net
AI Registry: Azure API Center portal
Monitoring Dashboard: Application Insights
Usage Analytics: Power BI Dashboard connected to Cosmos DB (optional)

📚 Comprehensive Documentation

Master Citadel implementation and operations with our detailed guides:

🏗️ Architecture & Deployment

Guide	Description
🆕 Quick Deployment Guide	Fast deployment for non-production environments
🆕 Full Deployment Guide	Comprehensive guide for dev, staging, and production
Parameters Deployment Guide	Comprehensive Bicep parameter file usage
Enterprise Provisioning	Branch-based deployment, CI/CD automation

🔧 AI Service Integration

Guide	Description
LLM-Backend-Onboarding	Add Azure OpenAI instances and models (including Realtime API)

🛡️ Security & Compliance

Guide	Description
PII Detection & Masking	Automated sensitive data protection
Entra ID Authentication	JWT validation and Zero Trust implementation
Citadel Access Contracts	AI Access & Publish Contract specifications

📊 Observability & Analytics

Guide	Description
Power BI Dashboard	Usage analytics and cost allocation dashboards
Throttling Events Monitoring	Real-time 429 error tracking with alerts
Dynamic Throttling Assignment	Intelligent load balancing for PTU models

⚙️ Advanced Capabilities

Guide	Description

🌟 What Makes Citadel Different?

Traditional Approach	Citadel Governance Hub
❌ Direct API key access per team	✅ Centralized gateway with managed credentials
❌ Fragmented monitoring per service	✅ Unified observability across all AI workloads
❌ Manual cost tracking and allocation	✅ Automated usage tracking and charge-back
❌ Inconsistent security policies	✅ Enforced guardrails on every AI call
❌ Shadow AI and governance gaps	✅ Complete visibility and control
❌ Slow onboarding and provisioning	✅ Automated templates and reusable blueprints

🏆 Benefits by Stakeholder

👨‍💼 For CIOs & Business Leaders

Accelerate AI ROI - Deploy AI solutions 10x faster with pre-built templates
Reduce Risk - Enforce compliance and security policies automatically
Control Costs - Precise cost attribution and quota management
Demonstrate Governance - Audit-ready compliance and transparency

👨‍💻 For Developers & Data Scientists

Focus on Innovation - Governance handled by the platform
Self-Service Access - Discover and consume AI services through the registry
Rich Tooling - Support for Copilot Studio, Semantic Kernel, LangChain, AutoGen
Fast Iteration - CI/CD integration with automated testing

🛡️ For Security & Compliance Teams

Zero Trust Architecture - Private endpoints and managed identities throughout
Content Safety - Automatic prompt and response filtering
PII Protection - Detect and redact sensitive data automatically
Audit Trails - Complete logging and trace capabilities

📊 For Operations Teams

Single Pane of Glass - Unified monitoring across all AI workloads
Proactive Alerting - Detect and remediate issues before impact
Performance Insights - Detailed traces and analytics
Capacity Planning - Usage trends and forecasting

🗺️ Roadmap & Evolution

Citadel Governance Hub is continuously evolving as part of the Foundry Citadel Platform vision:

✅ Current Release

Unified AI Gateway with intelligent routing
Platform observability
Universal LLM, Azure OpenAI, Azure OpenAI Realtime, AI Search, Document Intelligence integration
PII detection and content safety
Usage analytics and cost management

🚧 Coming Soon

AI Evaluation pipeline at the gateway level
Add support for A2A and agents publishing
Defender enablement

🔮 Future Vision

Autonomous agent governance and orchestration through DevOps end-to-end approach

🤝 Contributing

We welcome contributions from the community! Whether it's:

🐛 Bug reports and fixes
📖 Documentation improvements
💡 Feature requests and enhancements

Please see our Contributing Guide for details.

📞 Support & Community

🐛 Issues: GitHub Issues
💬 Discussions: GitHub Discussions

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Citadel Governance Hub - Your organization's fortress in the new world of AI

Providing protection, structure, and strength as you scale new heights with enterprise AI

🚀 Deploy Now | 📚 Documentation | 🤝 Contribute

License

Azure-Samples/ai-hub-gateway-solution-accelerator

Folders and files

Latest commit

History

Repository files navigation