Inmon


Relevant books:

  • Building the Data Warehouse, 4th edition (Wiley) by W.H.Inmon

  • Corporate Information Factory

  • The Unified Star Schema (Technics Publications) by W. H. (Bill) Inmon


Inmon Data Modeling (Corporate Information Factory)

Bill Inmon’s “Top-Down” Data Warehouse Architecture

Bill Inmon—often called the father of the data warehouse—defined one of the earliest and most influential approaches to enterprise analytics architecture. His method is typically summarized as top-down, enterprise-wide, and normalized.

At the time (late 1980s), organizations were querying production OLTP systems directly, slowing them down. Inmon’s data warehouse solved this by separating analytical workloads from operational systems.


Inmon’s Definition of a Data Warehouse

A data warehouse is:

Subject-oriented, integrated, nonvolatile, and time-variant data that supports management decision-making.

Let’s break down each term and its modeling implications.


1. Subject-Oriented

  • The warehouse is built around business subjects (e.g., Sales, Finance, Marketing—not around applications like SAP, Shopify, or HubSpot).

  • Logical models focus on one subject domain at a time.

  • Each domain contains business keys, attributes, and relationships across the enterprise.

Implication: Your data model must capture all enterprise facts about a single business area, not just data from one application.


2. Integrated

This is the most important Inmon principle.

  • Data is ingested from multiple disparate source systems.

  • During ETL, it is:

    • standardized

    • conformed

    • cleaned

    • deduplicated

    • sequenced and normalized

When it lands in the warehouse, it has a single unified corporate representation.

Integration creates the “Single Source of Truth.”

Implication: All data must be aligned (domains, data types, business keys). The warehouse becomes the closest thing to a unified enterprise data model.


3. Nonvolatile

  • Data, once stored, does not change (except for appending new history).

  • No transactional updates, deletes, or overwrites.

Implication: Design models expecting immutable data:

  • Surrogate keys

  • Audit fields

  • Load dates

  • Business-effective dates

  • Snapshot tables if needed

You build a stable layer where older facts remain intact for long-term analytics.


4. Time-Variant

  • Every record includes time context (load time, effective time).

  • Queries can span long historical ranges.

Implication: Modeling must include:

  • slowly changing dimensions (SCDs)

  • historical fact tables

  • lineage tracking

  • temporal data structures

The warehouse becomes a place to ask questions about the past, not just the present.


What the Inmon Warehouse Looks Like

Core Characteristics

  • Built using 3rd Normal Form (3NF) entity-relationship modeling.

  • Highly normalized → minimal redundancy → maximum consistency.

  • Mirrors business truth, often resembling normalized OLTP systems in structure (but not purpose).

  • Data arrives via heavy ETL: standardization, validation, integration.


Architecture: Top-Down

Below is a conceptual diagram showing how data flows in a classic Inmon (top-down) architecture:

This is the classic Inmon flow: Sources → ETL → 3NF EDW → ETL → Star Schemas → BI.

Key points:

  • EDW is the authoritative source for analytics.

  • Data marts depend on the EDW, not on operational systems.

  • Data marts may be:

    • star schema (common)

    • snowflake schema

    • or any structure optimized for consumption.


Example: E-commerce Business

Source systems:

  • Orders

  • Inventory

  • Marketing

These feed into the Enterprise Data Warehouse, where they become:

  • normalized

  • integrated

  • historical

  • immutable

Once in the EDW, downstream ETL builds subject-specific data marts:

  • Sales data mart

  • Marketing data mart

  • Purchasing data mart

Each may use a star schema for performance.


Why Inmon?

Pros

  • Strong enterprise consistency (integration).

  • High data quality.

  • Long-term historical accuracy.

  • Excellent for large organizations with complex domains.

Cons

  • Slowest to implement (years in large companies).

  • Heavy ETL maintenance.

  • Requires strict data governance.

  • Analysts often don’t like normalized structures for querying.


How It Compares to Kimball

  • Inmon: Top-down → EDW (3NF) → Data marts → Reports Enterprise first.

  • Kimball: Bottom-up → Data marts (star schemas) → Virtual EDW Analytics first.

Modern architectures often blend both.


Summary of Inmon Modeling

  • Build a centralized, enterprise-wide data warehouse.

  • Store granular, normalized (3NF) data.

  • Maintain immutable, time-variant history.

  • Use data marts for department-specific access.

  • Prioritize integration consistency above all else.



Last updated