Agentic Systems and Evaluation with Databricks AI

What you’ll learn

Databricks lets you build and deploy your AI Agent System for any use case, leveraging foundation LLMs, PDF extraction, Vector Search, and Mosaic AI Agent Evaluation.

In this demo, you'll learn how to:

  • Build and deploy your first tools, and save them as a Unity Catalog function

  • Create your first agent, leveraging these tools through LangChain

  • Evaluate your agent and build an evaluation loop to ensure new versions perform better on your dataset

  • Leverage MLflow 3.0 tracing, feedback, and scorers, including evaluation dataset generation

  • Package and deploy your chatbot as a Databricks application

  • Scan and extract information using Databricks' built-in document_ai_parse function

  • Monitor your live, production agent behavior with Databricks Monitoring

 

To run the demo, get a free Databricks workspace and execute the following two commands in a Python notebook:

Recommended

<p>Discover Databricks Agent Bricks: Auto Optimized Agents Using Your Data</p>

Product Tour

Discover Databricks Agent Bricks: Auto Optimized Agents Using Your Data

<p>Building Agents on Databricks Natively With Claude 3.7 Sonnet</p>

On-Demand Video

Building Agents on Databricks Natively With Claude 3.7 Sonnet

<p>Build a Multi-Agent GenAI System</p>

On-Demand Video

Build a Multi-Agent GenAI System

Disclaimer: This tutorial leverages features that are currently in private preview. Databricks Private Preview terms apply.
For more details, open the introduction notebook.

Ready to get started?