Comparison of Bill Inmon and Ralph Kimball paradigm:
In the data warehousing field, we often hear about discussions on where a person / organization's philosophy falls into Bill Inmon's camp or into Ralph Kimball's camp. We describe below the difference between the two.
Bill Inmon's paradigm: Data warehouse is one part of the overall business intelligence system. An enterprise has one data warehouse, and data marts source their information from the data warehouse. In the data warehouse, information is stored in 3rd normal form.
Bill Inmon: Endorses a Top-Down design
Independent data marts cannot comprise an effective EDW. Organizations must focus on building EDW.
"You can catch all the minnows in the ocean and stack them together and they still do not make a whale,"
Bill Inmon, January 8, 1998.
Ralph Kimball's paradigm: Data warehouse is the conglomerate of all data marts within the enterprise . Information is always stored in the dimensional model.
Kimball model also proposes the data warehouse bus architecture. This architecture is comprised of:
- A staging area (which can have an E/R or relationally designed 3NF design or flat file format), which cannot be accessed by an end-user of the data warehouse bus (Presentation Layer) .
-The Data Warehouse Bus itself which includes several atomic data marts, several aggregated data marts and a personal
The Data Warehouse Bus:
- Is dimensional;
- Contains transaction and summary data;
- Includes data marts, which have single subject or fact tables; and
- Can consist of multiple data marts in a single data base.
There is no right or wrong between these two ideas, as they represent different data warehousing philosophies. In reality, the data warehouse in most enterprises are closer to Ralph Kimball's idea. This is because most data warehouses started out as a departmental effort, and hence they originated as a data mart. Only when more data marts are built later do they evolve into a data warehouse.
Ralph Kimball: Endorses a Bottom-Up design
EDW effectively grows up around many of the several independent data marts – such as for sales, inventory, or marketing
"...The data warehouse is nothing more than the union of all the data marts...,"
Ralph Kimball, December 29, 1997.
The main difference is Top-Down vs. Bottom-Up Approach.
Let us see the advantages & disadvantages:
Advantages of Top-Down Approach: [Bill Inmon Approach]:
- A truly corporate effort, an enterprise view of data
- Inherently architect ed- not a union of disparate DataMarts
- Central rules and control
- May be developed fast using iterative approach
- Takes longer to build even with iterative method
- High exposure/risk to failure
- Needs high level of cross functional skills
- High outlay without proof of concept
- Difficult to sell this approach to senior management and sponsors
- Faster and easier implementation of manageable pieces
- Favorable ROI and proof of concept
- Less risk of failure
- Inherently incremental; can schedule important DataMarts first
- Allows project team to learn and grow
- Each DataMart has its own narrow view of data
- Permeates redundant data in every DataMart
- Difficult to integrate if the overall requirements are not considered in the beginning
while gathering business requirements while developing/designing data marts using Bus Architecture to integrate in to Enterprise Data warehouse.
Vertical
- Don’t just rely on the business data analyst to determine requirements
- Inputs from senior managers about their vision, objectives, and challenges are critical
- Ignoring this vertical span might cause failure in understanding the organization’s direction and likely future trends
- Look horizontally across the departments before designing the Data Warehouse
- Critical in establishing the enterprise view i.e Bus Architecture
- Challenging to do if one particular department if funding the project. [Even one departments funds this project , but all the departments together become the company]
- Ignoring horizontal span will create isolated, department-centric databases that are inconsistent and can’t be integrated while considering for Enterprise Data Warehouse
- Complete coverage in a large organization is difficult , but CEO or CTO represents all of the departments. They should have the proper understanding of the full business and recommend what is good for whole company not just one department.
- One rep. from each dept. interacting with the core development team can be of immense help
- Creating a Business Decision Making group when there is a difference in the perspective from each department would help in resolve these kind of conflicts.
- Plan and define requirements at the overall corporate level
- Create a surrounding architecture for a complete warehouse
- Conform and standardize the data content
- Implement the Data Warehouse as a series of Supermarts, one at a time
SUPERMARTS
- Totally monolithic approach vs. totally stovepipe approach
- A step-by-step approach for building an Enterprise Data Warehouse from granular data
- A Supermart s a data mart that has been carefully built with a disciplined architectural framework
- A Supermart is naturally a complete subset of the Data Warehouse.
- A Supermart is based on the most granular data that can possible be collected and stored
- Conformed dimensions and standardized fact definitions
Operational Source System (OSS) To Directly transforms data to Enterprise Data Warehouse Let us call -- ETL Process (1) .
Enterprise Data warehouse to Data Marts
Let us call -- ETL Process (2).
Enterprise Data Warehouse
[Further comparison will be added often ]
0 comments:
Post a Comment