Machine Learning at Scale
In this course, you will gain theoretical and practical knowledge of Apache Spark’s architecture and its application to machine learning workloads within Databricks. You will learn when to use Spark for data preparation, model training, and deployment, while also gaining hands-on experience with Spark ML and pandas APIs on Spark. This course will introduce you to advanced concepts like hyperparameter tuning and scaling Optuna with Spark. This course will use features and concepts introduced in the associate course such as MLflow and Unity Catalog for comprehensive model packaging and governance.
The content was developed for participants with these skills/knowledge/abilities:
- A beginner-level understanding of Python.
- Basic understanding of DS/ML concepts (e.g. classification and regression models), common model metrics (e.g. F1-score), and Python libraries (e.g. scikit-learn and XGBoost).
Outline
Machine Learning Development with Spark
A Brief Overview of Spark Architecture for Machine Learning
Introduction to Spark ML for Model Development
Model Tracking and Packaging with MLflow and Unity Catalog on Databricks
Model Development with Spark
Distributed Model Tuning on Databricks
Overview of Hyperparameter Tuning
Scalable HPO Frameworks on Databricks
Optuna and Hyperopt with Spark ML
HPO with Ray Tune
Deploying Machine Learning Models with Spark
Deployment with Spark
Inference with Spark
Model Deployment with Spark
Optimization Strategies with Spark and Delta Lake
Model Deployment with Spark
Pandas on Spark
Scaling with Pandas APIs
Pandas UDFs and Function APIs
Pandas APIs
Upcoming Public Classes
Date | Time | Language | Price |
---|---|---|---|
May 26 | 09 AM - 01 PM (America/New_York) | English | $750.00 |
May 28 | 09 AM - 01 PM (Europe/London) | English | $750.00 |
May 30 | 09 AM - 01 PM (Asia/Kolkata) | English | $750.00 |
Jun 30 | 01 PM - 05 PM (Europe/London) | English | $750.00 |
Jul 02 | 01 PM - 05 PM (Asia/Kolkata) | English | $750.00 |
Jul 08 | 01 PM - 05 PM (America/New_York) | English | $750.00 |
Jul 28 | 09 AM - 01 PM (America/New_York) | English | $750.00 |
Jul 30 | 09 AM - 01 PM (Europe/London) | English | $750.00 |
Jul 31 | 09 AM - 01 PM (Asia/Kolkata) | English | $750.00 |
Aug 18 | 01 PM - 05 PM (Asia/Singapore) | English | $750.00 |
Aug 19 | 09 AM - 01 PM (Europe/London) | English | $750.00 |
Aug 21 | 09 AM - 01 PM (America/New_York) | English | $750.00 |
Public Class Registration
If your company has purchased success credits or has a learning subscription, please fill out the Training Request form. Otherwise, you can register below.
Private Class Request
If your company is interested in private training, please submit a request.
Registration options
Databricks has a delivery method for wherever you are on your learning journey
Self-Paced
Custom-fit learning paths for data, analytics, and AI roles and career paths through on-demand videos
Register nowInstructor-Led
Public and private courses taught by expert instructors across half-day to two-day courses
Register nowBlended Learning
Self-paced and weekly instructor-led sessions for every style of learner to optimize course completion and knowledge retention. Go to Subscriptions Catalog tab to purchase
Purchase nowSkills@Scale
Comprehensive training offering for large scale customers that includes learning elements for every style of learning. Inquire with your account executive for details