GCP Storage¶

Scope¶

Cloud Storage (classes, lifecycle, retention, dual/multi-region), Persistent Disk (pd-balanced, pd-ssd, pd-extreme), Filestore (NFS tiers), Cloud Storage FUSE, Storage Transfer Service, and Transfer Appliance.

Checklist¶

Why This Matters¶

GCP Cloud Storage is the foundational object store with a uniquely tiered pricing model: storage class transitions are free when moving to colder tiers, but retrieval costs increase significantly (Archive retrieval is $0.05/GB vs $0 for Standard). Lifecycle policies are essential for cost control but must be designed carefully because class transitions are one-way (Standard to Nearline, never Nearline to Standard automatically). Persistent Disk performance scales linearly with disk size for pd-balanced and pd-ssd, meaning undersized disks deliver poor IOPS -- a common surprise for teams migrating from AWS EBS where performance tiers are more explicit. Filestore pricing is based on provisioned capacity, and performance scales with tier and size, making right-sizing critical.

Common Decisions (ADR Triggers)¶

Storage class strategy -- single class vs autoclass (automatic class management) vs manual lifecycle rules, cost modeling for access patterns
Replication model -- single-region (cheapest) vs dual-region (specific region pair, turbo replication option) vs multi-region (US/EU/ASIA)
Block storage type -- pd-balanced (default) vs pd-ssd vs pd-extreme (provisioned IOPS), regional persistent disk for HA
File storage tier -- Filestore Basic vs Zonal/Regional vs Enterprise (snapshots, replication), Filestore vs Cloud Storage FUSE vs Parallelstore for HPC
Access control model -- uniform bucket-level access (IAM only) vs fine-grained ACLs, signed URLs for external sharing
Data protection -- object versioning vs bucket retention policies vs Object Lock, soft delete (default 7-day recovery)
Data transfer method -- gsutil/gcloud CLI vs Storage Transfer Service vs Transfer Appliance (offline, 100TB+), interconnect transfer
Cost optimization -- autoclass vs manual lifecycle rules, requestor-pays for shared datasets, committed use discounts on Persistent Disk

Reference Architectures¶

Google Cloud Architecture Center: Storage -- reference architectures for object storage, file systems, and data lake patterns
Google Cloud Architecture Framework: Cost optimization - Storage -- best practices for storage class selection, lifecycle management, and cost control
Google Cloud: Best practices for Cloud Storage -- reference patterns for naming, access control, retry handling, and performance optimization
Google Cloud: Persistent Disk deep dive -- reference for disk performance characteristics, IOPS scaling by size, and throughput limits per machine type
Google Cloud: Data transfer options -- reference architecture for selecting the right transfer mechanism based on data volume, network bandwidth, and timeline