-
Marketplace
-
Channel Resources
Articles from this Site
Information Builders to Extend WebFOCUS to Predictive Analysis
A Statistical Stocking Stuffer for the Holidays
What are your views on the advantages and/or disadvantages ETL tools and data modeling versus code?
New Interfaces Integrated into MEGA International Modeling Software
Which would be a better choice of classes for career growth in data warehousing - ETL architecture or dimensional modeling?
White Papers
Best Practices: Eight Tips for Improving Your Professional Services Business
Metadata Management for Enterprise Applications
UML for C#
PHP Code Design
Domain-Specific Modeling: 10x Faster than UML
Web Seminars
Modeling Unstructured Data
Creative Strategies for Achieving 24/7 Uptime
Closing the Loop: Real-Time Event Detection and Response
Learning from Others: Best Practices for Data Governance
Books
Data Mining Cookbook: Modeling Data for Marketing, Risk and Customer Relationship Management
Data Modeler's Workbench: Tools and Techniques for Analysis and Design
The Data Modeling Handbook: A Best-Practice Approach to Building Quality Data Models
Data Mining Using SAS Applications
Data Mining: Concepts, Models, Methods and Algorithms
Do you have any specific suggestions about the use of a primary key in fact tables?
Q: We are trying to find some theory about the benefits to have a Primary Key to identify records in a Fact Tables. We do not use it because we presume all the integrity issues are treated in ETL and the volume of Index will greatly increase the size of my DW. Do you have any specific suggestions about the use of a PK in Fact Tables? (I'm not talking about FK with dimensions.)
Chuck Kelley's Answer: You will find many different answers to this question. Personally, I would prefer to have the primary key defined on my Fact tables to make sure that the ETL is doing the job correctly. It is my opinion that the database should always be correct and I don't always trust the developers to get it right. I have seen the developers put in Distinct clauses to make them distinct before writing them out, thus dropping some data.
Having said that, I have created Fact tables without a primary key and only the FK to the dimensions defined with Indexes.
Chuck Kelley is an internationally known expert in database and data warehousing technology. He has 30 years of experience in designing and implementing operational/production systems and data warehouses. Kelley has worked in some facet of the design and implementation phase of more than 50 data warehouses and data marts. He also teaches seminars, co-authored four books on data warehousing and has been published in many trade magazines on database technology, data warehousing and enterprise data strategies. He can be contacted at chuckkelley@usa.net.
For more information on related topics, visit the following channels:


