Building Single-Agent Applications on Databricks

This course provides hands-on training for building single-agent applications on the Databricks Data Intelligence Platform. Students will learn to create AI agents that leverage Unity Catalog functions as tools, implement comprehensive tracing and monitoring with MLflow, and deploy agents using both traditional frameworks like LangChain and modern solutions like Agent Bricks. The course covers the complete agent lifecycle from initial tool creation and testing in AI Playground through production deployment with governance, evaluation, and continuous improvement capabilities.

Note: This is the second course in the 'Generative AI Engineering with Databricks’ series. It was previously named 'Generative AI Application Development'.

Skill Level

Associate

Duration

Prerequisites

The content was developed for participants with these skills/knowledge/abilities:

1. Python-Specific Prerequisites

Learners must be comfortable writing production-quality Python, not scripts.

• Core Python syntax and data structures

• Functions, classes, and basic OOP patterns

• Exception handling and error propagation

• Decorators

• Type hints and docstrings

2. SQL-Specific Prerequisites

Learners must be able to define reusable SQL logic, not just query tables.

• Writing SELECT queries with filters and aggregations

• Understanding SQL data types and NULL handling

• Creating parameterized SQL functions

• Using CREATE OR REPLACE FUNCTION syntax

• Writing clear SQL comments for documentation

3. Databricks-Specific Prerequisites

Learners must be comfortable operating inside the Databricks platform.

• Navigating the Databricks workspace and notebooks

• Running notebook cells and interpreting outputs

• Understanding basic compute concepts (especially serverless)

• Using Catalog Explorer to inspect registered assets

• Awareness of Databricks-managed services (Model Serving, AI Playground)

4. GenAI / Agent-Specific Prerequisites

Learners must understand how LLM-powered agents behave, even if frameworks are taught in-course.

• What Large Language Models are and what they can and cannot do

• Basic prompt engineering concepts

• High-level understanding of Retrieval-Augmented Generation (RAG)

• Conceptual understanding of agent reasoning and tool invocation

• Familiarity with REST APIs and JSON payloads

5. Optional but Helpful (Not Required)

• MLflow fundamentals (tracking, model registry, tracing)

• Agent frameworks (e.g., LangChain)

6. Databricks-related recommended training: AI Agents Fundamentals, Get Started with AI Agents

Self-Paced

Custom-fit learning paths for data, analytics, and AI roles and career paths through on-demand videos

Customer registration Partner registration

See all our registration options

Registration options

Databricks has a delivery method for wherever you are on your learning journey

Self-Paced

Custom-fit learning paths for data, analytics, and AI roles and career paths through on-demand videos

Instructor-Led

Public and private courses taught by expert instructors across half-day to two-day courses

Blended Learning

Self-paced and weekly instructor-led sessions for every style of learner to optimize course completion and knowledge retention. Go to Subscriptions Catalog tab to purchase

Purchase now

Skills@Scale

Comprehensive training offering for large scale customers that includes learning elements for every style of learning. Inquire with your account executive for details

Upcoming Public Classes

Platform Administrator

Get Started with Data Governance on Databricks

In this course, you will explore how Unity Catalog enables secure, centralized data governance and fine-grained access control on Databricks. You will learn about table and volume types, catalog and schema configuration, group-based access management, and strategies for migrating existing access controls into Unity Catalog. The course also explains how to design and apply fine-grained controls such as row-level security, column masking, and attribute-based access control, how to combine these mechanisms across data and AI assets, and how to align them with broader governance requirements for compliant, scalable access management.

Languages Available: English | 日本語 | 한국어

Paid & Subscription

Lab

Onboarding

Model Development at Scale

In this course, you will develop an in-depth understanding of how to design, implement, and govern scalable machine learning systems that operate effectively at enterprise scale. The curriculum is organized into three experiential modules: developing distributed ML workflows with frameworks such as Apache SparkML and Ray, transitioning local ML development to distributed compute using tools like Pandas on Spark, and operationalizing and governing production models with Databricks’ MLOps ecosystem.

Through hands-on projects, you will construct end-to-end distributed ML pipelines using the SparkML workflow, applying Transformers, Estimators, and the fit/transform paradigm for both classification and regression tasks. You will version, compare, and manage experiments using MLflow 3.0 to ensure reproducibility and governance, capturing lineage between data, features, and model artifacts. Additionally, you will apply scalable Hyperparameter Optimization frameworks to improve model performance at scale.

The course concludes by demonstrating complete lifecycle management, from experimentation to production deployment, using Unity Catalog and Model Serving. You will learn to operationalize trained models, monitor their performance, and implement strong governance over models, features, and Delta assets within the Databricks environment.

Paid & Subscription

Lab

Professional

Platform Administrator

Data Governance at Scale

In this course, you will learn how to implement data governance at scale on Databricks using Unity Catalog, with a focus on attribute-based access control, observability, and federated sharing. You will configure ABAC with governed tags, migrate from legacy fine-grained controls, enable and use system tables for audit and cost monitoring, deploy Lakehouse Monitoring for data and model quality, interpret lineage for impact and compliance, and apply federated governance and Delta Sharing patterns for secure cross-cloud collaboration.

Paid & Subscription

Lab

Associate