Skip to main content

Get Started with Lakehouse Architecture on Databricks

In this course, you will explore the Databricks Data Intelligence Platform from the perspective of platform architecture, specifically related to the platform foundation in lakehouse architecture. You will learn about the scope, vision, and capabilities of a lakehouse-based platform, examine how Databricks integrates with major cloud providers, and discover the key features of a successful lakehouse implementation through the well-architected lakehouse framework. The course also covers essential architectural principles, best practices, and data architecture strategies to help accelerate your organization’s data and AI initiatives using Databricks.


Languages Available: English | 日本語 Português BR | 한국어

Skill Level
Onboarding
Duration
2h
Prerequisites

The content was developed for participants with these skills/knowledge/abilities: 

• Familiarity with traditional data management architectures, specifically the distinctions between data warehouses and data lakes.

• Beginner familiarity with cloud computing concepts such as object storage (S3, ADLS, GCS) and cloud provider environments (AWS, Azure, GCP).

• Intermediate experience with SQL concepts, including ANSI SQL commands, views, and database management functionality.

• A basic understanding of data engineering principles and topics such as data collection, extraction, ingestion, and transformation.

• Basic understanding of data governance principles, including access control, data lineage, and auditing.

• Basic knowledge of artificial intelligence and machine learning workflows, including Generative AI concepts and MLOps.

• Understanding of core data team personas and their responsibilities, such as Data Engineers, Data Scientists, and Business Analysts.

• Familiarity with open data standards and file formats, such as Apache Parquet, Delta Lake, and Apache Iceberg.

• Prior experience in data platform solutions architecture or similar areas of study.

Outline

Databricks Overview
Databricks Infrastructure
Databricks Data Intelligence Platform
Unity Catalog Overview
Databricks Workspace Walkthrough

Exploring Lakehouse Architecture
The Scope of Lakehouse Architecture
The Modern Data and AI Platform
Exploring the Guiding Principles of the Lakehouse

Building a Well-Architected Lakehouse
Integrating Databricks with Cloud Provider Environments
A Well-Architected Lakehouse Framework
Data Architecture Strategies

Public Class Registration

If your company has purchased success credits or has a learning subscription, please fill out the Training Request form. Otherwise, you can register below.

Private Class Request

If your company is interested in private training, please submit a request.

See all our registration options

Registration options

Databricks has a delivery method for wherever you are on your learning journey

Runtime

Self-Paced

Custom-fit learning paths for data, analytics, and AI roles and career paths through on-demand videos

Register now

Instructors

Instructor-Led

Public and private courses taught by expert instructors across half-day to two-day courses

Register now

Learning

Blended Learning

Self-paced and weekly instructor-led sessions for every style of learner to optimize course completion and knowledge retention. Go to Subscriptions Catalog tab to purchase

Purchase now

Scale

Skills@Scale

Comprehensive training offering for large scale customers that includes learning elements for every style of learning. Inquire with your account executive for details

Upcoming Public Classes

Data Analyst

AI/BI for Data Analysts

This course teaches data analysts how to design, build, publish, and operate AI/BI Dashboards in Databricks. AI/BI Dashboards combine governed Unity Catalog data with interactive visualizations, filters, and Genie integration so business users can explore answers without writing code.

The course follows a single end-to-end build. You start with source tables in Unity Catalog and finish with a published, monitored multi-page dashboard. Along the way you learn how dashboards fit into the broader Databricks AI/BI product family and where Genie, datasets, visualizations, and filters each fit in the workflow.

The content covers:

• AI/BI Dashboard fundamentals and how they relate to Genie and the rest of the Databricks platform.

• Exploring source data in Unity Catalog and designing reusable dashboard datasets with SQL.

• Authoring visualizations (KPIs, trends, breakdowns) and laying out a clean multi-page dashboard.

• Using Genie Code to draft SQL, charts, and filters from natural language prompts.

• Adding filters to make dashboards interactive and responsive to viewer questions.

• Publishing, sharing, and managing permissions so the right people can view and edit the dashboard.

• Running the dashboard in production with scheduled refresh, caching, and usage monitoring.

Note: For SCORM lecture files, please ensure that you close the SCORM window after completing the content. Do not click the ‘Next Lesson’ button, as doing so may prevent the SCORM module from being marked as complete.

Languages Available: English | 日本語 | Português BR | 한국어 | Español | française

Free
2h
Associate

Questions?

If you have any questions, please refer to our Frequently Asked Questions page.