ServiceNow CMDB, Discovery, and Service Mapping¶

Scope¶

This file covers the ServiceNow data layer that nearly every other Now Platform process depends on: the Configuration Management Database (CMDB) itself, the Discovery product that populates most of it, and the Service Mapping product that builds application-level relationship maps on top of it. Topics: CI class strategy and the Common Service Data Model (CSDM), the Identification and Reconciliation Engine (IRE) and data-source priority, CI relationships (cmdb_rel_ci) including depends-on / runs-on / hosted-on semantics, domain separation for multi-tenant CMDBs, the CMDB Health dashboard (Completeness, Compliance, Correctness), MID Server architecture and placement, Discovery patterns (schedules, IP ranges, credentials, exploration vs classification phases), Cloud Discovery vs on-prem Discovery, Service Mapping approaches (top-down, machine learning, traffic-based), and multi-source data quality when Discovery, IntegrationHub, manual entry, and file imports all write to the same tables. For platform-level architecture decisions (instance topology, licensing, Now Assist), see providers/servicenow/itsm.md. For operational depth (SLA engine, Performance Analytics, automation placement), see providers/servicenow/itsm-operations.md. For an end-to-end pattern that exercises all of this, see patterns/vmware-servicenow-chargeback.md.

Checklist¶

Why This Matters¶

CMDB quality is the foundation of every other ITSM process. Incident routing depends on the affected CI having an accurate support_group. Change impact analysis depends on the CI's relationships traversing correctly upward into business services and downward into infrastructure. Cost allocation depends on the CI's owned_by, cost_center, and business-unit attributes being current and authoritative. Vulnerability management depends on the CI's install_status so that retired hosts do not appear as open findings. When any of these processes produces wrong results, the platform team's first question is whether the CMDB record is correct -- and the answer is almost always no, because nobody has set up Completeness / Compliance / Correctness monitoring and nobody has documented who writes which attribute. The "CMDB is the foundation" cliche is true precisely because every downstream failure traces back to a Discovery rule, an IRE identifier, a reconciliation priority, or a relationship type that was never thought through.

The Identification and Reconciliation Engine is the single most consequential design choice in the data layer. IRE decides whether a re-discovered CI updates the existing record or creates a new one, and reconciliation decides which data source wins when two sources disagree. Both defaults are too permissive for production: the OOB IRE identifiers are sensible starting points but rarely cover the exact attribute mix the customer's environment provides, and the OOB reconciliation behavior is essentially last-write-wins, which means a nightly Discovery run can silently overwrite a manually entered ownership value. Organizations that skip the IRE-and-reconciliation-design step accumulate duplicate CIs steadily, never quite enough to be visible in any single week but cumulatively enough that after a year the CMDB has 30 percent more rows than it has actual configuration items.

The MID Server is the architectural component most often under-designed. MID Servers are the only way ServiceNow reaches on-premises infrastructure -- for Discovery, for IntegrationHub spokes that talk to internal systems, for Orchestration runbooks. They run as services on Windows or Linux, poll the ServiceNow instance over outbound HTTPS for work, and execute it locally. Placement matters because every probe runs from the MID Server: a MID Server in the wrong network zone forces every probe across a firewall, multiplies latency, and requires credential propagation across security boundaries. Sizing matters because Discovery, Orchestration, and IntegrationHub all queue work on the same MID Server, and a saturated MID Server silently drops or delays work. HA matters because a single MID Server is a single point of failure for everything that needs to reach on-prem, which becomes obvious only during the first MID Server outage.

Service Mapping is the product that builds application-level CI relationship maps that Discovery alone cannot. Discovery produces an inventory of hosts, processes, and network connections; Service Mapping turns that inventory into "the Online Banking service runs on these load balancers, which front these web servers, which call these app servers, which read from these databases." The mapping is what makes incident impact analysis say "this database outage affects Online Banking" rather than "this database outage affects an unidentified host." Service Mapping is expensive to maintain at scale because applications change, deployment topologies change, and traffic patterns change; the right strategy is to map only the tier-1 services that drive incident-management priority, with explicit ownership of the maps. Attempting to map everything produces stale maps that nobody trusts.

Cloud Discovery is the area where the CMDB design most often falls behind the actual environment. Cloud resources are created and destroyed faster than nightly network-based Discovery can keep up with, and most cloud services (S3 buckets, SQS queues, IAM roles, Lambda functions) are not reachable by network probes at all. The IntegrationHub cloud spokes use cloud-provider APIs with read-only IAM roles to enumerate resources, which is both faster and more complete. Organizations that attempt to discover cloud with the same nightly network probes used for on-prem typically end up with a CMDB that is 60-70 percent complete for cloud resources, with managed services entirely absent, which makes cloud cost allocation and cloud security findings unreliable.

Common Decisions (ADR Triggers)¶

CSDM maturity target -- Level 1 (Foundation) gets identity, ownership, and basic infrastructure CIs in place; Level 2 (Foundation + Design) adds business applications, technical services, and the application-to-infrastructure linkage that incident-impact analysis depends on; Level 3 (Service Mapping) adds dynamic application-relationship discovery; Level 4 (Service Offerings) adds the customer-facing service catalog linkage that chargeback and service-level reporting depend on. Choose the target up front; trying to skip levels produces a CMDB that does not support the processes the customer expects.
CI class customization: OOB hierarchy vs custom classes -- The OOB cmdb_ci_* hierarchy is deep and covers most enterprise CI types. Custom CI classes break OOB Discovery patterns, Service Mapping patterns, IRE identifiers, and Performance Analytics indicators; they also create upgrade conflicts. Choose custom only when the OOB hierarchy genuinely cannot express the CI type; document the rationale and the upgrade-impact assessment.
IRE identifier strategy: OOB identifiers vs custom identifiers -- OOB identifiers are sensible defaults but rarely match the exact attribute mix the customer's data sources provide. Custom identifiers can be precise for the customer's environment but require deliberate priority-order design and become a CMDB-team-owned artifact. Most production CMDBs end up with a mix; document each custom identifier with its rationale and its expected data sources.
Reconciliation priority: per-attribute vs blanket per-source -- Per-attribute reconciliation is precise (Discovery wins on cpu_count, HR spoke wins on owned_by, Vendor spoke wins on support_group) but requires per-attribute configuration; blanket per-source reconciliation is coarser (Discovery always wins, or manual entry always wins) but is harder to keep correct as new data sources are added. Per-attribute is recommended for any CMDB with three or more active data sources.
Domain separation vs company-based segregation -- Domain separation is the OOB mechanism for true multi-tenant data isolation and is supported by domain-aware Business Rules, reports, and ACLs; it is also invasive, hard to undo, and complicates upgrades. Company-based segregation (the company field plus ACLs and reference qualifiers) is lighter-weight but is not actually data isolation -- a privileged user can see across companies. Choose domain separation only when regulatory or contractual isolation is required; choose company-based segregation when "soft" multi-tenancy is sufficient.
MID Server topology: per-zone vs per-purpose vs shared -- Per-zone MID Servers (one HA pair per network security zone) is the standard pattern and is required for Discovery to reach hosts without crossing firewalls. Per-purpose MID Servers (a Discovery pool, an Orchestration pool, an IntegrationHub pool) prevent a single MID Server from being saturated by one workload but multiply infrastructure cost. Shared MID Servers are simpler but expose the platform to noisy-neighbor problems. Per-zone with workload-specific clusters within zones is the typical production design.
Discovery cadence: single nightly vs segmented schedules -- A single nightly schedule is the OOB default and is the easiest to operate, but routinely overruns its window in environments with more than ~50,000 CIs. Segmented schedules (fast cadence for change-prone segments, slow cadence for stable segments, separate classification-only and full-exploration runs) keep high-change segments fresh and prevent the all-IPs schedule from saturating MID Servers. Segmented is recommended once the single nightly schedule begins overrunning.
Cloud Discovery: IntegrationHub cloud spokes vs network-based Discovery vs Service Graph Connectors -- IntegrationHub cloud spokes (AWS, Azure, GCP) use provider APIs with read-only IAM roles and are the recommended path for IaaS / PaaS / managed-service inventory. Network-based Discovery against cloud IP ranges misses managed services entirely and triggers cloud-provider security alerts. Service Graph Connectors (third-party CMDB-population tools that publish to ServiceNow) are an alternative for organizations that already operate a multi-cloud CMDB population tool. Spokes are the default; mix in Service Graph Connectors only when the spoke does not cover the resource type.
Service Mapping methodology: top-down patterns vs ML-driven vs traffic-based -- Top-down patterns (entry-point-driven traversal of network connections) are the most precise and the most expensive to maintain. Machine-learning-driven mapping (Service Mapping with ML) reduces the upfront pattern-writing effort but produces lower-precision maps. Traffic-based discovery via Service Graph Connectors for Dynatrace / AppDynamics / Splunk uses APM data as the relationship source, which is the highest-precision approach for organizations that already operate APM but requires the APM tool. Most organizations end up with a mix; document the choice per service.
CMDB Health remediation ownership -- CMDB Health Completeness / Compliance / Correctness gaps can be owned by the platform team (centralized remediation), by the CI-class owners (each CI class has an accountable team that fixes its own data), or split (platform team owns Compliance, CI owners own Completeness). The split model scales best but requires explicit handoff; the centralized model works at smaller scale; the per-class model is the most accurate but requires mature CI ownership.

Reference Links¶

The following links point to ServiceNow public documentation. ServiceNow gates a portion of its product documentation behind a customer login; verify the current URL and bundle name against the customer's release (Yokohama, Zurich, etc.) before relying on a specific page.

ServiceNow CMDB product page -- CMDB product positioning, CSDM, and related modules
Common Service Data Model (CSDM) -- CSDM four-layer model, application service vs technical service vs service offering
ServiceNow Discovery -- agentless discovery, MID Server, Patterns, Cloud Discovery
ServiceNow Service Mapping -- top-down service mapping, ML mapping, Service Graph Connectors
ServiceNow Developer Portal -- CMDB schema, IRE configuration, CI class extension, and Pattern Designer references
ServiceNow Product Documentation -- bundle-specific CMDB / Discovery / Service Mapping configuration (some content gated)