Security Operations¶

Scope¶

Covers security operations center (SOC) design, SIEM architecture, incident response planning, threat detection and hunting, SOAR integration, threat intelligence, and forensic readiness. Applicable to any organization establishing or maturing security operations capabilities — whether building an in-house SOC, operating a hybrid model with an MSSP, or deploying cloud-native security monitoring across multi-cloud environments.

Overview¶

Security operations is the continuous practice of detecting, analyzing, and responding to cybersecurity threats against an organization's infrastructure, applications, and data. A mature security operations program combines people (SOC analysts organized in tiers), processes (incident response playbooks aligned to frameworks like NIST 800-61), and technology (SIEM for log aggregation and correlation, SOAR for automated response, EDR/XDR for endpoint visibility, and threat intelligence feeds for context enrichment). The goal is to minimize mean time to detect (MTTD) and mean time to respond (MTTR) while maintaining forensic readiness and compliance with regulatory incident reporting requirements.

Checklist¶

Why This Matters¶

Organizations without mature security operations face an average breach dwell time exceeding 200 days — meaning attackers operate undetected in the environment for over six months before discovery. During that time, they establish persistence, move laterally, escalate privileges, and exfiltrate data at will. The cost differential between breaches detected internally versus those discovered by external parties (law enforcement, customers, or the media) is dramatic: internally detected breaches are contained faster, affect fewer records, and cost significantly less to remediate.

A well-designed SOC with properly tuned SIEM, documented playbooks, and trained analysts reduces dwell time to hours or days rather than months. Alert fatigue is the primary operational risk — when analysts are overwhelmed by thousands of low-fidelity alerts, they begin ignoring or auto-closing them, and real attacks slip through. Organizations that invest in detection engineering, alert tuning, and SOAR automation see measurable improvements in analyst effectiveness and incident response times.

Forensic readiness is frequently overlooked until a breach occurs. Without pre-positioned forensic tooling, proper log retention, and chain-of-custody procedures, organizations cannot determine the scope of a breach, satisfy regulatory investigation requirements, or provide evidence for law enforcement. The cost of retrofitting forensic capability during an active incident — including hiring external IR firms at premium rates — vastly exceeds the cost of building readiness into the architecture from the start.

MITRE ATT&CK has become the de facto standard for measuring detection coverage and communicating about adversary behavior. Organizations that map their detection rules to ATT&CK techniques can quantify their security posture, identify gaps, and prioritize detection engineering investments based on the techniques most commonly used by threat actors targeting their industry.

Common Decisions (ADR Triggers)¶

ADR: SIEM Platform Selection¶

Context: The organization must select a SIEM platform for centralized log aggregation, correlation, and threat detection across on-premises and cloud environments.

Options:

Platform	Deployment Model	Strengths	Considerations
Splunk Enterprise Security	On-prem, cloud (Splunk Cloud), hybrid	Most mature search language (SPL), largest ecosystem of apps and integrations, strong for complex correlation	Expensive at high ingest volumes (ingest-based licensing), requires significant tuning expertise, hardware-intensive for on-prem
Microsoft Sentinel	Cloud-native (Azure)	Native integration with Microsoft 365 and Azure, built-in SOAR via Logic Apps, KQL query language, free ingestion for Microsoft data sources	Best value for Microsoft-heavy environments, weaker for non-Microsoft log sources, Azure dependency
Google Chronicle (SecOps)	Cloud-native (GCP)	Fixed-price storage model (not ingest-based), petabyte-scale search, YARA-L detection language, integrated SOAR	Smaller ecosystem than Splunk, relatively newer platform, best for organizations comfortable with Google Cloud
Elastic Security	Self-managed or Elastic Cloud	Open-source core, flexible deployment, strong for custom data sources, no per-GB licensing for self-managed	Requires significant operational investment for self-managed, cluster management complexity, security features require paid license
CrowdStrike Falcon LogScale (Humio)	Cloud-native or self-managed	Streaming architecture for real-time search, compression-efficient storage, fast query performance	Primarily log management rather than full SIEM, detection content library smaller than Splunk/Sentinel

Decision drivers: Daily log ingest volume and growth projections, existing technology stack (Microsoft vs multi-vendor), budget model preference (ingest-based vs fixed-price vs self-managed), team expertise with query languages (SPL, KQL, EQL), cloud strategy, and compliance requirements for data residency.

ADR: SOC Operating Model¶

Context: The organization must decide how to staff and operate its security operations center — fully in-house, fully outsourced to an MSSP/MDR, or a hybrid model.

Options:

In-house SOC: Full control over detection engineering, incident response, and threat hunting. Requires significant investment in hiring, training, and retaining skilled analysts (24x7 coverage requires minimum 8-12 FTEs). Best for large organizations with complex environments and regulatory requirements that demand direct control.
MSSP/MDR (fully outsourced): Managed security service provider handles monitoring, alert triage, and initial response. Lower cost for 24x7 coverage, faster time to operational capability. Less customization, potential alert fatigue from generic rule sets, shared analyst attention across multiple clients. Best for small-to-mid-size organizations without dedicated security staff.
Hybrid SOC: MSSP provides 24x7 L1 monitoring and alert triage, in-house team handles L2/L3 investigation, threat hunting, and detection engineering. Balances cost and control. Requires clear escalation procedures and SLAs between MSSP and internal team. Most common model for mid-size organizations.

Decision drivers: Organization size and security team maturity, budget for 24x7 staffing, regulatory requirements for incident handling (some frameworks require internal IR capability), complexity of the technology environment, and tolerance for outsourced access to security telemetry.

ADR: Log Retention and Storage Tiering Strategy¶

Context: Log data must be retained for operational investigation, threat hunting, and compliance mandates, but storage costs grow linearly with retention duration and ingest volume.

Options:

Single-tier hot storage: All logs searchable at full speed for the entire retention period. Simplest to operate but most expensive. Only viable for small-to-moderate ingest volumes (under 100 GB/day).
Two-tier (hot + cold): Hot storage for 30-90 days of full-fidelity indexed data, cold storage in object storage (S3, Azure Blob, GCS) for long-term compliance retention. Cold data requires rehydration for searching. Good balance of cost and accessibility.
Three-tier (hot + warm + cold): Hot for 7-30 days of real-time search, warm for 30-365 days of slower but still searchable data, cold archive for multi-year retention. Optimizes cost for high-volume environments. Adds operational complexity for managing tier transitions.

Decision drivers: Daily ingest volume, compliance retention mandates (PCI DSS 12 months, HIPAA 6 years, SOX 7 years), threat hunting requirements (hunters need access to historical data for retrospective IOC searches), budget constraints, and query performance requirements for different use cases.

ADR: SOAR Platform Selection and Automation Scope¶

Context: Repetitive incident response tasks consume analyst time and delay response. A SOAR platform can automate enrichment, containment, and notification workflows, but scope must be carefully defined to avoid automating high-risk actions without human oversight.

Options:

Palo Alto XSOAR (formerly Demisto): Largest integration marketplace (700+ integrations), mature playbook visual editor, strong case management. Enterprise pricing, complex deployment.
Splunk SOAR (formerly Phantom): Tight integration with Splunk SIEM, visual playbook editor, community playbook sharing. Best paired with Splunk ES. Separate licensing from Splunk SIEM.
Microsoft Sentinel + Logic Apps: Native SOAR capability within Sentinel using Azure Logic Apps for workflow automation. No additional licensing for Sentinel customers. Limited to Azure ecosystem for native integrations.
Tines: No-code automation platform, cloud-native, flexible API integration model, not SIEM-dependent. Strong for organizations that want SOAR without SIEM vendor lock-in.

Decision drivers: Existing SIEM platform (native SOAR integration reduces complexity), number of integrations needed, team capability for playbook development, budget for additional platform licensing, and whether SOAR must support non-security automation use cases.

ADR: Threat Intelligence Strategy¶

Context: Detection effectiveness depends on contextual threat intelligence — indicators of compromise, adversary TTPs, and industry-specific threat reporting. The organization must decide which intelligence sources to consume and how to operationalize them.

Options:

Commercial feeds only (Recorded Future, Mandiant, CrowdStrike): High-confidence indicators, curated industry reporting, finished intelligence products. Expensive but low operational overhead. Best for organizations without dedicated threat intelligence analysts.
Open-source feeds only (MISP, OTX, Abuse.ch, VirusTotal community): Free but requires significant curation effort to filter noise, score confidence, and age out stale indicators. High false positive risk without tuning. Best as a supplement rather than sole source.
Hybrid with TIP (MISP, ThreatConnect, Anomali): Aggregate commercial and open-source feeds into a threat intelligence platform that deduplicates, scores, and distributes indicators to SIEM and security tools. Highest operational maturity but requires dedicated TI analyst capacity.

Decision drivers: Security team size and TI expertise, budget for commercial feeds, industry vertical (financial services and defense have specialized threat actors requiring tailored intelligence), volume tolerance for IOC matching in SIEM, and whether the organization needs finished intelligence reports for executive communication.

Security Operations¶

Scope¶

Overview¶

Checklist¶

Why This Matters¶

Common Decisions (ADR Triggers)¶

ADR: SIEM Platform Selection¶

ADR: SOC Operating Model¶

ADR: Log Retention and Storage Tiering Strategy¶

ADR: SOAR Platform Selection and Automation Scope¶

ADR: Threat Intelligence Strategy¶

Reference Links¶

See Also¶