Get Started with Databricks for Machine Learning

In this course, you will develop the foundational skills needed to use the Databricks Data Intelligence Platform for executing basic machine learning workflows and supporting data science workloads. You will explore the platform from the perspective of a machine learning practitioner, covering topics such as feature engineering with Databricks Notebooks and model lifecycle tracking with MLflow. Additionally, you will learn about real-time model inference with Mosaic AI Model Serving and experience Databricks’ “glass box” approach to model development through AutoML. The course includes three instructor-led demonstrations, culminating in a comprehensive lab that reinforces the concepts covered in the demos.

Languages Available: English | 日本語 | Português BR | 한국어

Skill Level

Onboarding

Duration

Prerequisites

A beginner-level understanding of Python.
Basic understanding of DS/ML concepts (e.g. classification and regression models), common model metrics (e.g. F1-score), and Python libraries (e.g. scikit-learn and XGBoost).

Outline

Databricks Overview

Databricks Data Intelligence Platform
Demo: Databricks Workspace Walkthrough

Using Databricks for Machine Learning

Introduction to Machine Learning with Databricks
Exploratory Data Analysis (EDA) and Feature Engineering on Databricks
Demo: EDA and Feature Engineering
Introduction to MLflow on Databricks
Demo: Tracking and Managing Models with MLflow
Introduction to Mosaic AI AutoML
Demo: Experimentation with Mosaic AI AutoML
Introduction to Mosaic AI Model Serving
Demo: Getting Started with Mosaic AI Model Serving
Comprehensive Lab: Getting Started with Databricks for ML

Upcoming Public Classes

Date	Time	Language	Price
Date	Time	Language	Price	Dec 12	09 AM - 11 AM (America/Los_Angeles)	English	Free
Dec 15	03 PM - 05 PM (Europe/London)	English	Free
Jan 05	09 AM - 11 AM (America/Los_Angeles)	English	Free
Jan 14	12 PM - 02 PM (Asia/Singapore)	English	Free

Public Class Registration

If your company has purchased success credits or has a learning subscription, please fill out the Training Request form. Otherwise, you can register below.

Customer registration Partner registration

Private Class Request

If your company is interested in private training, please submit a request.

Request Private Training

See all our registration options

Registration options

Databricks has a delivery method for wherever you are on your learning journey

Self-Paced

Custom-fit learning paths for data, analytics, and AI roles and career paths through on-demand videos

Instructor-Led

Public and private courses taught by expert instructors across half-day to two-day courses

Blended Learning

Self-paced and weekly instructor-led sessions for every style of learner to optimize course completion and knowledge retention. Go to Subscriptions Catalog tab to purchase

Purchase now

Skills@Scale

Comprehensive training offering for large scale customers that includes learning elements for every style of learning. Inquire with your account executive for details

Upcoming Public Classes

Model Development at Scale

In this course, you will develop an in-depth understanding of how to design, implement, and govern scalable machine learning systems that operate effectively at enterprise scale. The curriculum is organized into three experiential modules: developing distributed ML workflows with frameworks such as Apache SparkML and Ray, transitioning local ML development to distributed compute using tools like Pandas on Spark, and operationalizing and governing production models with Databricks’ MLOps ecosystem.

Through hands-on projects, you will construct end-to-end distributed ML pipelines using the SparkML workflow, applying Transformers, Estimators, and the fit/transform paradigm for both classification and regression tasks. You will version, compare, and manage experiments using MLflow 3.0 to ensure reproducibility and governance, capturing lineage between data, features, and model artifacts. Additionally, you will apply scalable Hyperparameter Optimization frameworks to improve model performance at scale.

The course concludes by demonstrating complete lifecycle management, from experimentation to production deployment, using Unity Catalog and Model Serving. You will learn to operationalize trained models, monitor their performance, and implement strong governance over models, features, and Delta assets within the Databricks environment.

Free

Professional

Data Engineer

Get Started with Data Governance on Databricks - Japanese

このコースでは、ハンズオンデモとキャップストーンラボを使用して、Unity Catalog および Databricks でのきめ細かなアクセス制御について説明します。テーブルの種類、カタログとスキーマの構成、グループベースのアクセス管理、およびアクセス制御の移行戦略について学習します。このコースには、行レベルのセキュリティと列マスキングによるきめ細かなアクセス制御の適用、属性ベースのアクセス制御、制御の組み合わせ、制御の移行、および包括的なガバナンス実装のためのラボに関するデモが含まれています。

Free

Onboarding

Platform Administrator