-
Marketplace
-
Channel Resources
Articles from this Site
Netezza Enters Location Intelligence Market
Data Warehousing Meets Data Archiving in Information Lifecycle Management
The Role of Packaged BI Applications in Delivering Enterprise-Wide BI and DW
What are your views on the advantages and/or disadvantages ETL tools and data modeling versus code?
Where is the best place for a BI application to reside?
White Papers
Data Warehouses: What are they and how will they benefit your organization?
Advances in Data Warehouse Performance
Books
Resist the Urge to Start Coding
Kimball Perspectives
DM Review welcomes Ralph Kimball as a new columnist. The Kimball Perspectives series will systematically describe classic best practices as well as new trends in technologies. The most important first step in designing a data warehouse (DW)/business intelligence (BI) system, paradoxically, is to stop. Step back for a week, and be absolutely sure you have a sufficiently broad perspective on all the requirements that surround your project. The DW/BI design task is a daunting intellectual challenge, and it is not easy to step far enough back from the problem to protect yourself from embarrassing or career-threatening changes discovered after the project is underway. Before cutting any code, designing any tables or making a major hardware or software purchase, take a week to write down thoughtful, high-quality answers to the following 10 questions, each of which is a reality that will come to control your project at some point. These define the classic set of simultaneous constraints faced by every DW/BI effort.Classic Questions to Ask
- Business requirements. Are you in touch with the key performance indicators (KPIs) your end users actually need to make the decisions currently important to your enterprise? Although all 10 questions are important, understanding the business requirements is the most fundamental and far reaching. If you have a good answer to this question, then you can identify the data assets needed to support decision-making, and you will be able to decide which measurement process to tackle first.
- Strategic data profiling. Have you verified that your available data assets are capable of supporting the answers to question number one? The goal of strategic data profiling is to make go or no-go decisions very early in the DW project whether to proceed with a subject area.
- Tactical data profiling. Is there a clear executive mandate to support the necessary business process re-engineering required for an effective data quality culture, perhaps even driving for Six Sigma data quality? The only real way to improve data quality is to go back to the source and figure out why better data isnt being entered. Data entry clerks are not the cause of poor data quality! Rather, the fixes require an end-to-end awareness of the need for better quality data and a commitment from the highest levels to change how business processes work.
- Integration. Is there a clear executive mandate in your organization to define common descriptors and measures across all your customer-facing processes? All of the organizations within your enterprise who participate in data integration must come to agreement on key descriptors and measures. Have your executives made it clear that this must happen?
- Latency. Do you have a realistic set of requirements from end users for how quickly data must be published by the data warehouse, including as-of-yesterday, many-times-per-day and truly instantaneous?
- Compliance. Have you received clear guidance from senior management as to which data is compliance sensitive, and where you must guarantee that you have protected the chain of custody?
- Security. Do you know how you are going to protect confidential as well as proprietary data in the ETL back room, at the end-users desktops, over the Web and on all permanent media?
- Archiving. Do you have a realistic plan for very long-term archiving of important data, and do you know what data should be archived?
- Supporting end users. Have you profiled all your end-user communities to determine their abilities to use spreadsheets, construct database requests in ad hoc query tools or just view reports on their screens?
- IT licenses and skill sets. Are you prepared to rely on the major technology site licenses your organization has already committed to, and do you have enough staff with advanced skills to exploit the technical choices you make?
Ralph Kimball is the founder of the Kimball Group and Kimball University where he has taught data warehouse design to more than 10,000 students. He is known for the best selling series of data warehouse "Toolkit" books. He started with a Ph.D. in man-machine systems from Stanford in 1973 and has spent the last 34 years designing systems for end users that are simple and fast. You can reach him at ralph@kimballgroup.com.
For more information on related topics, visit the following channels:


