Skip to main content

Data Modeling Strategies

This course walks practitioners through the full spectrum of data modelling approaches on the Databricks Data Intelligence Platform - from classical data warehouse techniques (Inmon, Kimball, Data Vault 2.0), through Feature Store-driven ML use cases, to productising data via Data Products on Unity Catalog.


Each modelling approach is introduced with a lecture, then reinforced with a hands-on demo against a shared dataset (TPC-H samples). The course finishes with a comprehensive end-to-end lab that exercises ERM, dimensional modelling, Data Vault 2.0, and the Feature Store in a single integrated workflow.


Note: Databricks Academy is transitioning to a notebook-based format for classroom sessions within the Databricks environment, discontinuing the use of slide decks for lectures. You can access the lecture notebooks in the Vocareum lab environment.


Languages Available: English | 日本語 | Português BR | 한국어

Skill Level
Associate
Duration
4h
Prerequisites

In this course, the content was developed for participants with these skills/knowledge/abilities: 

• Working knowledge of SQL and relational database concepts

• Familiarity with Databricks fundamentals (workspaces, notebooks, Unity Catalog basics)

• Conceptual understanding of OLTP vs OLAP and the medallion architecture

• Basic exposure to Python and PySpark is helpful but not required

• Awareness of dimensional modelling concepts is helpful but not required

Outline

Data Warehouse Data Modelling

• Lakehouse Architecture Recap

• Data Warehouse Overview

• Inmon's Corporate Information Factory

• Demo: Entity Relationship Modelling and Constraints

• Kimball's Dimensional Modelling

• Demo: Dimensional Modelling and ETL

• Lab: Dimensional Modelling and ETL

• Data Vault 2.0

• Demo: Data Vault 2.0


Modern Data Architecture Use Cases

• Modern Gold-Layer Use Cases

• Combining Modeling Approaches

• Demo: Combining Modelling Approaches


Data Products

• Defining Data Products

• Summary and Next Steps


Comprehensive Lab

• Lab: Warehouse Modelling End-to-End

Upcoming Public Classes

Date
Time
Your Local Time
Language
Price
Jul 06
01 PM - 05 PM (Australia/Sydney)
-
English
$750.00
Jul 06
09 AM - 01 PM (America/New_York)
-
English
$750.00
Aug 21
09 AM - 01 PM (Asia/Singapore)
-
English
$750.00
Aug 21
09 AM - 01 PM (America/Los_Angeles)
-
English
$750.00
Sep 25
01 PM - 05 PM (Europe/Paris)
-
English
$750.00
Oct 23
01 PM - 05 PM (America/New_York)
-
English
$750.00
Oct 30
09 AM - 01 PM (Asia/Kolkata)
-
English
$750.00

Public Class Registration

If your company has purchased success credits or has a learning subscription, please fill out the Training Request form. Otherwise, you can register below.

Private Class Request

If your company is interested in private training, please submit a request.

See all our registration options

Registration options

Databricks has a delivery method for wherever you are on your learning journey

Runtime

Self-Paced

Custom-fit learning paths for data, analytics, and AI roles and career paths through on-demand videos

Register now

Instructors

Instructor-Led

Public and private courses taught by expert instructors across half-day to two-day courses

Register now

Learning

Blended Learning

Self-paced and weekly instructor-led sessions for every style of learner to optimize course completion and knowledge retention. Go to Subscriptions Catalog tab to purchase

Purchase now

Scale

Skills@Scale

Comprehensive training offering for large scale customers that includes learning elements for every style of learning. Inquire with your account executive for details

Upcoming Public Classes

Questions?

If you have any questions, please refer to our Frequently Asked Questions page.