A Modern Approach to Dimensional Modeling - In a Columnar Database (repeat)


TRACKData Warehousing - Analytics and BI
INDUSTRYEnterprise Technology
TECHNOLOGIESSQL Analytics / BI / Visualizations
SKILL LEVELIntermediate

This session is repeated.


Is using dimensional modeling and star schemas, not a good architecture for data marts or data products in Databricks? Think again. Star schemas still make the best data model for your gold layer. The world is turning data-driven, and “everybody” is doing analytics. Many newcomers to analytics are led to think analytics is only about making data available. Data marts with so-called wide tables are popping up in gold layers, providing minimal analytical value, and are a nightmare when it comes to data governance and data quality. This session will give you an overview of why star schemas are the winning data model for your gold layer. The differences between relational and columnar databases mean that you can simplify some of the physical implementation of your star schemas. Therefore, we will also give you some design techniques that you should consider in Databricks and touch on some best practices for your physical implementation.


Truls Bergersen

Okeanos AS