Azure Well-Architected Framework¶

Scope¶

Covers the five pillars of the Azure Well-Architected Framework (Reliability, Security, Cost Optimization, Operational Excellence, Performance Efficiency) with Azure-specific checklists and guidance for architecture reviews. Does not cover AWS (see frameworks/aws-well-architected.md) or GCP (see frameworks/gcp-architecture-framework.md) equivalents.

The Azure Well-Architected Framework provides architectural guidance for building high-quality solutions on Microsoft Azure. It is organized into five pillars that serve as the foundation for workload quality across the cloud.

Pillar 1: Reliability¶

Focuses on ensuring a workload performs its intended function correctly and consistently. This includes the ability to recover from failures, meet availability targets, and handle demand changes.

Design Principles¶

Design for business requirements
Design for resilience
Design for recovery
Design for operations
Keep it simple

Checklist¶

Why This Matters¶

Reliability is the most critical pillar because no other quality matters if the system is unavailable. Azure provides a wide array of redundancy and resilience features, but they must be deliberately designed into the architecture. Many outages stem not from Azure platform failures but from application-level design gaps: missing health checks, inadequate retry logic, or untested failover procedures. A reliable system is one that has been designed for failure and tested under failure conditions.

Pillar 2: Security¶

Focuses on protecting the workload from threats, including data, identity, network, and application-level security concerns.

Design Principles¶

Plan resources and how to harden them
Automate and use least privilege
Classify and encrypt data
Monitor and validate continuously
Evolve with the threat landscape

Checklist¶

Why This Matters¶

Azure operates under a shared responsibility model: Microsoft secures the platform, but customers must secure their workloads, data, and identities. The expanding threat landscape means security must be automated and continuous, not a one-time gate. Microsoft Entra ID and Azure Policy provide powerful tools, but misconfiguration remains the leading cause of cloud security incidents. Defense in depth -- layering identity, network, application, and data controls -- is essential.

Pillar 3: Cost Optimization¶

Focuses on reducing unnecessary expenditure and improving operational efficiency while maintaining full workload capability.

Design Principles¶

Develop cost-management discipline
Design with a cost-efficiency mindset
Design for usage optimization
Design for rate optimization
Monitor and optimize over time

Checklist¶

Why This Matters¶

Cloud spending grows organically and can quickly exceed expectations. Azure provides granular billing data and optimization tools, but without organizational discipline, waste accumulates. Common sources of waste include oversized VMs, orphaned disks, always-on dev/test environments, and suboptimal storage tiering. Cost optimization is a continuous practice, not a one-time cleanup. The most effective approach combines technical right-sizing with financial governance and team accountability.

Pillar 4: Operational Excellence¶

Focuses on the processes and practices that keep a workload running in production, including deployment, monitoring, and incident management.

Design Principles¶

Embrace DevOps culture
Establish development standards
Evolve operations with observability
Deploy with confidence
Automate for efficiency

Checklist¶

Why This Matters¶

Operational excellence bridges the gap between development and production. Without it, deployments are risky, incidents drag on, and teams cannot improve. Azure provides extensive monitoring and automation tools, but tools alone are insufficient. The organizational practices -- blameless postmortems, infrastructure as code discipline, deployment standards -- determine whether operations are predictable and improving or chaotic and reactive.

Pillar 5: Performance Efficiency¶

Focuses on the ability of a workload to scale and meet the demands placed on it by users in an efficient manner.

Design Principles¶

Negotiate realistic performance targets
Design to meet capacity requirements
Achieve and sustain performance
Improve efficiency through optimization
Test and monitor proactively

Checklist¶

Why This Matters¶

Performance directly impacts user experience and business outcomes. Slow applications lose users; systems that cannot scale under load lose revenue. Azure provides elastic scaling capabilities, but the architecture must be designed to take advantage of them. Common mistakes include monolithic designs that cannot scale horizontally, synchronous calls that create bottlenecks, and reliance on vertical scaling alone. Performance testing before production is essential -- assumptions about performance are frequently wrong.

How to Use in Architecture Reviews¶

When to Apply¶

New Azure workload design: Evaluate all five pillars before committing to an architecture. Use the Azure Well-Architected Review assessment tool for structured guidance.
Workload modernization: When migrating or modernizing existing applications on Azure, focus on pillars that represent the biggest gaps (often Reliability and Performance Efficiency for legacy workloads).
Architecture Decision Records: Reference specific pillar guidance when documenting trade-off decisions.
Periodic health checks: Conduct quarterly reviews using Azure Advisor and the Well-Architected assessment to track improvement.
Post-incident reviews: Map incidents to pillar gaps to identify systemic improvements.

How to Apply During a Design Session¶

Prioritize pillars for the workload: A customer-facing e-commerce application may weight Reliability and Performance Efficiency highest; an internal analytics platform may prioritize Cost Optimization. All pillars matter, but relative priority guides trade-off decisions.
Walk through each pillar checklist: Use the items as discussion prompts. Mark items as addressed, not applicable, or requiring follow-up.
Identify Azure service choices: For each component, evaluate whether the selected Azure service aligns with the pillar requirements. Consider PaaS and serverless options before IaaS.
Document trade-offs: When optimizing one pillar conflicts with another (e.g., multi-region for reliability increases cost), document the decision and rationale as an ADR.
Run the Azure Well-Architected Review: Use the official Azure Well-Architected Review assessment to get tailored recommendations.
Create an action plan: Prioritize identified gaps by business impact and create work items with owners and timelines.
Integrate with Azure Advisor: Enable Advisor recommendations aligned with each pillar for continuous, automated feedback.

Common Decisions (ADR Triggers)¶

Pillar prioritization — which pillars to focus on first based on workload maturity and business requirements
Reliability architecture — single-region vs multi-region, availability zone usage, health modeling approach
Security posture — Microsoft Defender tier selection, Sentinel vs third-party SIEM, identity architecture
Cost optimization — Azure Reservations vs Savings Plans, advisor recommendations, spot VM usage
Operational model — IaC tooling (Bicep vs Terraform), Azure Monitor vs third-party observability
Performance baseline — Azure Load Testing adoption, autoscale configuration, CDN strategy
Well-Architected Review cadence — Azure Advisor integration, assessment frequency, remediation tracking

Azure Well-Architected Framework¶

Scope¶

Pillar 1: Reliability¶

Design Principles¶

Checklist¶

Why This Matters¶

Pillar 2: Security¶

Design Principles¶

Checklist¶

Why This Matters¶

Pillar 3: Cost Optimization¶

Design Principles¶

Checklist¶

Why This Matters¶

Pillar 4: Operational Excellence¶

Design Principles¶

Checklist¶

Why This Matters¶

Pillar 5: Performance Efficiency¶

Design Principles¶

Checklist¶

Why This Matters¶

How to Use in Architecture Reviews¶

When to Apply¶

How to Apply During a Design Session¶

Common Decisions (ADR Triggers)¶

Reference Links¶

See Also¶