AWS Well-Architected Framework¶

Scope¶

Covers the six pillars of the AWS Well-Architected Framework (Operational Excellence, Security, Reliability, Performance Efficiency, Cost Optimization, Sustainability) with AWS-specific checklists and guidance for architecture reviews. Does not cover Azure (see frameworks/azure-well-architected.md) or GCP (see frameworks/gcp-architecture-framework.md) equivalents.

The AWS Well-Architected Framework helps cloud architects build secure, high-performing, resilient, and efficient infrastructure for their applications and workloads. It is organized into six pillars, each representing a foundational area of cloud architecture excellence.

Pillar 1: Operational Excellence¶

Focuses on running and monitoring systems to deliver business value and continually improving supporting processes and procedures.

Design Principles¶

Perform operations as code
Make frequent, small, reversible changes
Refine operations procedures frequently
Anticipate failure
Learn from all operational failures

Checklist¶

Why This Matters¶

Operational excellence ensures your workloads can be managed predictably over time. Without it, teams spend excessive time firefighting, changes carry high risk, and the organization cannot learn from failures. AWS services like CloudWatch, Systems Manager, and EventBridge support operational excellence, but the organizational practices matter more than the tools.

Pillar 2: Security¶

Focuses on protecting information, systems, and assets while delivering business value through risk assessments and mitigation strategies.

Design Principles¶

Implement a strong identity foundation
Enable traceability
Apply security at all layers
Automate security best practices
Protect data in transit and at rest
Keep people away from data
Prepare for security events

Checklist¶

Why This Matters¶

Security is not optional or bolt-on; it must be foundational. A single misconfigured S3 bucket or overly permissive IAM role can expose an entire organization. The shared responsibility model means AWS secures the cloud, but you must secure what you build in it. Automated, layered security controls reduce human error and ensure consistent protection.

Pillar 3: Reliability¶

Focuses on the ability of a workload to perform its intended function correctly and consistently when expected.

Design Principles¶

Automatically recover from failure
Test recovery procedures
Scale horizontally to increase aggregate workload availability
Stop guessing capacity
Manage change in automation

Checklist¶

Why This Matters¶

Users expect systems to work. Reliability underpins all other pillars because an insecure but unavailable system, or a cost-optimized but unreliable one, delivers no value. AWS provides the building blocks (multi-AZ, Auto Scaling, Route 53 health checks), but reliability requires deliberate architectural decisions around failure isolation, testing, and recovery automation.

Pillar 4: Performance Efficiency¶

Focuses on the efficient use of computing resources to meet requirements, and maintaining that efficiency as demand changes and technologies evolve.

Design Principles¶

Democratize advanced technologies
Go global in minutes
Use serverless architectures
Experiment more often
Consider mechanical sympathy

Checklist¶

Why This Matters¶

Performance efficiency is not just about speed; it is about using the right resources in the right configuration for your specific workload. Over-provisioning wastes money, under-provisioning degrades user experience. AWS continuously releases new instance types, managed services, and serverless options. Architectures that were optimal a year ago may no longer be the best choice.

Pillar 5: Cost Optimization¶

Focuses on avoiding unnecessary costs, understanding spending, and selecting the most appropriate and right number of resource types.

Design Principles¶

Implement cloud financial management
Adopt a consumption model
Measure overall efficiency
Stop spending money on undifferentiated heavy lifting
Analyze and attribute expenditure

Checklist¶

Why This Matters¶

Cloud spending can grow unchecked without deliberate management. Cost optimization is not about cutting corners; it is about ensuring every dollar spent delivers business value. The pay-as-you-go model is only cost-effective if you match consumption to actual need. Organizations that treat cost optimization as an ongoing practice rather than a one-time exercise consistently achieve better outcomes.

Pillar 6: Sustainability¶

Focuses on minimizing the environmental impacts of running cloud workloads.

Design Principles¶

Understand your impact
Establish sustainability goals
Maximize utilization
Anticipate and adopt new, more efficient offerings
Use managed services
Reduce the downstream impact of your cloud workloads

Checklist¶

Why This Matters¶

Cloud computing is not inherently green; it simply shifts the environmental impact. AWS infrastructure is more energy-efficient than most on-premises data centers, but architects still bear responsibility for how efficiently they use those resources. Sustainability and cost optimization are often aligned: reducing waste lowers both bills and environmental impact.

How to Use in Architecture Reviews¶

When to Apply¶

New workload design: Walk through all six pillars before finalizing architecture. Identify gaps and trade-offs explicitly.
Periodic reviews: Schedule quarterly or semi-annual reviews of production workloads against the framework.
Post-incident: After a significant incident, use the relevant pillar (usually Reliability or Security) to identify systemic gaps.
Before migration: When moving workloads to AWS, use the framework to design the target architecture rather than lift-and-shift.
Cost reviews: When cloud spend becomes a concern, focus on Cost Optimization and Performance Efficiency pillars together.

How to Apply During a Design Session¶

Start with business context: Identify which pillars matter most for this workload. A financial trading system prioritizes Performance Efficiency and Reliability; a dev/test environment prioritizes Cost Optimization.
Walk through each pillar systematically: Use the checklists above as conversation starters. Not every item applies to every workload.
Document trade-offs explicitly: Record decisions where you intentionally deprioritized one pillar in favor of another (e.g., accepted higher cost for lower latency). Capture these as Architecture Decision Records (ADRs).
Identify top risks: For each pillar, identify the 2-3 highest-risk items and create action items to address them.
Use the AWS Well-Architected Tool: Run a formal review in the AWS console to generate a structured report and track improvements over time.
Assign owners: Each identified risk or improvement should have a clear owner and timeline.
Revisit regularly: Architecture is not a one-time activity. As workloads evolve, re-evaluate against the framework.

Common Decisions (ADR Triggers)¶

Pillar prioritization — which pillars to focus on first based on workload maturity and business needs
Cost optimization strategy — reserved instances vs savings plans vs spot, rightsizing cadence
Resilience architecture — single-region multi-AZ vs multi-region, RPO/RTO targets per workload tier
Security posture — AWS-native security tools vs third-party, centralized vs distributed security model
Operational model — IaC tooling (CloudFormation vs CDK vs Terraform), observability stack, deployment strategy
Performance optimization — caching strategy, database selection, compute right-sizing methodology
Sustainability approach — instance family selection for energy efficiency, right-sizing for carbon reduction
Well-Architected Review cadence — how often to conduct reviews, remediation prioritization

AWS Well-Architected Framework¶

Scope¶

Pillar 1: Operational Excellence¶

Design Principles¶

Checklist¶

Why This Matters¶

Pillar 2: Security¶

Design Principles¶

Checklist¶

Why This Matters¶

Pillar 3: Reliability¶

Design Principles¶

Checklist¶

Why This Matters¶

Pillar 4: Performance Efficiency¶

Design Principles¶

Checklist¶

Why This Matters¶

Pillar 5: Cost Optimization¶

Design Principles¶

Checklist¶

Why This Matters¶

Pillar 6: Sustainability¶

Design Principles¶

Checklist¶

Why This Matters¶

How to Use in Architecture Reviews¶

When to Apply¶

How to Apply During a Design Session¶

Common Decisions (ADR Triggers)¶

Reference Links¶

See Also¶