Kimball Approach To Data Warehouse Lifecycle ✮ ❲ORIGINAL❳
This is where Kimball distinguishes itself from "big bang" Inmon approaches. A Kimball warehouse goes live in weeks or months, not years. Each iteration delivers concrete, queryable value. Phases: Program Management, Ongoing Support.
Simultaneously, the back room (ETL) and front room (BI) are developed in parallel. Kimball famously separates the (data staging area: messy, technical, high-volume) from the presentation area (dimensional models: clean, business-facing, accessible). The ETL system must handle slowly changing dimensions (SCDs)—tracking historical changes like a customer’s address over time—a signature Kimball contribution. Stage 3: Deployment & Iteration Phases: BI Application Development, Deployment, Maintenance & Growth.
Here, the famous Kimball dimensional model is created. A fact table is designed for a single business process (e.g., "Daily Sales Facts"). Dimensions are "conformed" so they can be used across multiple fact tables—ensuring that "Customer" means the same thing in Sales and Returns. kimball approach to data warehouse lifecycle
That methodology is the .
The final phase is often overlooked but crucial. Kimball insists on a that manages conformed dimensions, tracks business requirement changes, and oversees the growing bus matrix. Without this, the warehouse degrades into a set of isolated, inconsistent data marts—the very problem Kimball designed to solve. Why Kimball Wins in Practice 1. Understandability: Business users can read a star schema. They know that "Sales Amount" lives in the fact table and "Customer Name" lives in the customer dimension. Queries are simple joins. This is where Kimball distinguishes itself from "big
Unlike software applications with a clear "go-live" finish line, a Kimball data warehouse is built incrementally, evolves continuously, and remains tightly coupled to business value. The lifecycle is designed to prevent the most common cause of data warehouse failure: building what IT thinks is interesting, not what business users need to make decisions.
Adding a new data source or attribute? You often just add a row to a dimension or a column to a fact table. No massive schema redesign. Phases: Program Management, Ongoing Support
Another criticism: ETL for slowly changing dimensions can be complex. But this complexity is essential if you need to answer "What was the customer’s region at the time of that sale last year?" Kimball gives you a pattern; Inmon’s normalized approach often cannot answer that question without massive joins. Today, the Kimball lifecycle has been absorbed into almost every major data warehousing platform. Snowflake’s documentation? Full of star schema examples. dbt (data build tool)? Its core philosophy of modular, testable, SQL-based transformations is a direct expression of Kimball’s layered ETL approach. Even the term "conformed dimension" is standard vocabulary for any modern data engineer.
