Introduction to Python for Data Science and Data Engineering

This course is intended for complete beginners to Python to provide the basics of programmatically interacting with data. The course begins with a basic introduction to programming expressions, variables, and data types. It then progresses into conditional and control statements followed by an introduction to methods and functions. You will learn the basics of data structures, classes, and various string and utility functions. Lastly, you will gain experience using the pandas library for data analysis and visualization as well as the fundamentals of cloud computing. Throughout the course, you will gain hands-on practice through lab exercises with additional resources to deepen your knowledge of programming after the class.

Skill Level

Associate

Duration

16h

Prerequisites

None

Outline

Day 1

Introduction to the Databricks environment
Python overview
Variables and data types
Complex data types
Control flow
Loops
Functions
Classes

Day 2

Using libraries
Data analysis with pandas
Advanced methods in Pandas
Data visualization
Cloud computing 101
Capstone and next steps

Upcoming Public Classes

Date	Time	Language	Price
Date	Time	Language	Price	Feb 24 - 27	02 PM - 06 PM (America/New_York)	English	$1500.00
Mar 24 - 25	09 AM - 05 PM (Europe/Paris)	English	$1500.00
Apr 07 - 10	02 PM - 06 PM (America/New_York)	English	$1500.00
Apr 21 - 22	09 AM - 05 PM (Europe/London)	English	$1500.00
Apr 27 - 30	11 AM - 03 PM (Asia/Singapore)	English	$1500.00

Public Class Registration

If your company has purchased success credits or has a learning subscription, please fill out the Training Request form. Otherwise, you can register below.

Customer registration Partner registration

Private Class Request

If your company is interested in private training, please submit a request.

Request Private Training

See all our registration options

Registration options

Databricks has a delivery method for wherever you are on your learning journey

Self-Paced

Custom-fit learning paths for data, analytics, and AI roles and career paths through on-demand videos

Instructor-Led

Public and private courses taught by expert instructors across half-day to two-day courses

Blended Learning

Self-paced and weekly instructor-led sessions for every style of learner to optimize course completion and knowledge retention. Go to Subscriptions Catalog tab to purchase

Purchase now

Skills@Scale

Comprehensive training offering for large scale customers that includes learning elements for every style of learning. Inquire with your account executive for details

Upcoming Public Classes

Apache Spark Developer

Developing Applications with Apache Spark™

Master scalable data processing with Apache Spark in this hands-on course. Learn to build efficient ETL pipelines, perform advanced analytics, and optimize distributed data transformations using Spark’s DataFrame API. Explore grouping, aggregation, joins, set operations, and window functions. Work with complex data types like arrays, maps, and structs while applying best practices for performance optimization.

Languages Available: English | 日本語 | 한국어

SQL Analytics on Databricks

In this course, you'll learn how to effectively use Databricks for data analytics, with a specific focus on Databricks SQL. As a Databricks Data Analyst, your responsibilities will include finding relevant data, analyzing it for potential applications, and transforming it into formats that provide valuable business insights.

You will also understand your role in managing data objects and how to manipulate them within the Databricks Data Intelligence Platform, using tools such as Notebooks, the SQL Editor, and Databricks SQL.

Additionally, you will learn about the importance of Unity Catalog in managing data assets and the overall platform. Finally, the course will provide an overview of how Databricks facilitates performance optimization and teach you how to access Query Insights to understand the processes occurring behind the scenes when executing SQL analytics on Databricks.

Languages Available: English | 日本語 | Português BR | 한국어

Apache Spark Developer

Introduction to Apache Spark™

This course offers essential knowledge of Apache Spark, with a focus on its distributed architecture and practical applications for large-scale data processing. Participants will explore programming frameworks, learn the Spark DataFrame API, and develop skills for reading, writing, and transforming data using Python-based Spark workflows.

Languages Available: English | 日本語 | 한국어