HomepageData + AI Summit 2022 Logo
Watch on demand

Data-Centric Principles for AI Engineering

On Demand

Type

  • Session

Format

  • In-Person

Track

  • Data Science, Machine Learning and MLOps

Room

  • Moscone South | Upper Mezzanine | 152

Duration

  • 35 min
Download session slides

Overview

While some AI problems can be solved with end-to-end deep learning models that go from raw inputs to outputs, practitioners (including our customers!) find that such "mega models" are, on their own, not enough to build production-ready AI applications. In practice, it’s critical that AI engineers can inspect, test, and refactor the modular components of their applications, as they would with any piece of infrastructure or software.

In this talk, we’ll introduce a data-centric approach to AI engineering that highlights the advantages of modular components, fine-grained evaluation, and rapid iteration through programmatic labeling. We'll discuss the practical trade-offs of incrementally building and testing pipelines composed of models, preprocessing steps, and business logic. Along the way, we’ll share examples of these principles in practice through real-world case studies.

Session Speakers

Vincent Chen

Head of ML Engineering / Founding Engineer

Snorkel AI

See the best of Data+AI Summit

Watch on demand