Redis¶

Scope¶

This file covers Redis (and Valkey) architecture decisions: clustering topology (Redis Cluster vs Sentinel), persistence modes (RDB snapshots vs AOF), data structure selection and memory optimization, caching patterns vs primary datastore use cases, memory management and eviction policies, Redis Stack (Search, JSON, Time Series, Graph, Probabilistic), pub/sub and Streams for messaging, Lua scripting and Functions, and managed service options (Amazon ElastiCache/MemoryDB, Azure Cache for Redis, Google Memorystore). For general caching strategy and CDN patterns, see general/data.md. For AWS-specific ElastiCache configuration, see providers/aws/elasticache.md.

Checklist¶

Why This Matters¶

Redis is the most widely deployed in-memory data store, used as both a cache and increasingly as a primary database, but its apparent simplicity hides operational complexity that surfaces at scale. The single-threaded execution model that makes Redis fast and predictable also means a single slow command (KEYS, SMEMBERS on a large set, unoptimized Lua script) blocks all other operations. In production, a developer running KEYS in a debugging session can cause a multi-second outage across all services that depend on that Redis instance.

Memory management is the primary operational challenge. Redis stores all data in RAM, and unlike disk-based databases, running out of memory causes immediate data loss through eviction or outright rejection of writes. The gap between "allocated memory" and "used memory" is often surprising — memory fragmentation, replication buffers, and fork overhead for persistence can consume 2-3x the raw data size. Organizations that provision Redis based on dataset size alone routinely encounter out-of-memory events when persistence operations or replica synchronization double the memory footprint.

The most dangerous anti-pattern is using Redis as a primary datastore without understanding its durability guarantees. With the default AOF everysec configuration, up to one second of writes can be lost during a crash. With RDB-only persistence, the loss window is the interval between snapshots (often 5-15 minutes). Organizations that store financial transactions or session state in Redis without alternative persistence discover these gaps during their first server failure.

Common Decisions (ADR Triggers)¶

ADR: Redis Cluster vs Sentinel¶

Context: The organization needs Redis high availability and must choose between Sentinel (failover only) and Cluster (sharding + failover).

Options:

Criterion	Redis Sentinel	Redis Cluster
Data Sharding	No (single primary)	Yes (up to 16,384 hash slots across primaries)
Max Dataset Size	Limited by single node RAM	Scales across nodes
Multi-Key Operations	All keys on same instance	Only within same hash slot
Client Complexity	Sentinel-aware client	Cluster-aware client with redirection
Minimum Nodes	3 Sentinels + 1 primary + 1 replica	6 (3 primaries + 3 replicas)
Failover Time	10-30 seconds	1-15 seconds

Decision drivers: Dataset size (above or below single-node memory), need for multi-key atomic operations (MULTI/EXEC across keys), client library cluster support, and operational complexity tolerance.

ADR: Cache vs Primary Datastore¶

Context: Redis is being considered for a workload, and the team must determine whether it serves as a cache layer or the primary system of record.

Options: - Cache only: Data always exists in a backing database. TTLs on all keys. Eviction policy (allkeys-lru or volatile-lru) enabled. Cache misses are served from the backing store. Data loss results in increased latency (cache cold start) but no data loss. Simpler operations — can be rebuilt from scratch. - Primary datastore: Redis is the source of truth. No eviction (noeviction policy). Persistence mandatory (AOF + RDB). Backup and restore procedures tested and documented. High availability with automatic failover. Data loss means real data loss. Requires same operational rigor as any production database. - Hybrid (same instance): Some keys are cache (with TTLs), others are primary data (without TTLs). Risk: eviction policies cannot distinguish between cache and primary data unless volatile- policies are used consistently. Operational ambiguity about which keys are expendable. - Hybrid (separate instances):* Dedicated Redis for cache, separate Redis for primary data. Clear operational boundaries. Higher infrastructure cost. Recommended over single-instance hybrid.

Decision drivers: Whether a backing datastore exists, durability requirements, acceptable data loss window, operational maturity for managing Redis as a database, and cost of separate instances.

ADR: Redis vs Valkey¶

Context: Redis re-licensing (SSPL + RSALv2 dual license from Redis 7.4+) requires evaluating whether to continue with Redis or adopt the Valkey fork.

Options: - Redis (Redis Ltd.): Original project with Redis Stack modules (Search, JSON, TimeSeries). Dual-licensed — SSPL restricts offering Redis as a managed service. Redis Ltd. continues to add proprietary features. Enterprise support through Redis Ltd. - Valkey (Linux Foundation): Fork of Redis 7.2 under BSD-3 license. Backed by AWS, Google, Oracle. Adopted by Amazon ElastiCache and MemoryDB. Community-driven development. May diverge from Redis in features and APIs over time. - KeyDB (Snap): Multi-threaded Redis fork with higher throughput per node. Less community adoption than Valkey. Different performance characteristics due to multi-threading.

Decision drivers: Licensing compliance requirements, managed service provider preference (AWS has migrated to Valkey), need for Redis Stack modules (currently Redis-only), and long-term ecosystem stability preference.

Reference Links¶

Redis Documentation -- commands, data types, clustering, persistence, and administration
Valkey Documentation -- Valkey-specific features, migration from Redis, and community roadmap
Redis Cluster Specification -- hash slot distribution, gossip protocol, failover mechanics, and client redirection
Amazon MemoryDB Documentation -- durable Redis-compatible database with strong consistency
Redis University -- free courses on data structures, RediSearch, and Redis Streams