Skip to main content
Company Blog

The Data + AI Summit happening this week had its origins as the Spark + AI Summit, but we thought it was important to expand the conference due to the convergence of data topics we’ve been seeing in the community. The Summit now covers all things data -- from data science and data engineering to data analytics and machine learning.

This year, we’re lucky to be joined by some of the leading innovators in the field of machine learning, such as Rajat Monga (co-creator of TensorFlow), Soumith Chintala (co-creator of PyTorch), Clément Delangue (co-creator of Hugging Face Transformers NLP), Matei Zaharia (co-creator of MLflow), Manasi Vartak (creator of ModelDB) and more.

In addition to these well-known creators, there are many other amazing practitioners joining Data + AI Summit to share their knowledge. Here’s some of the 200+ sessions at Data + AI Summit.

29+ Talks on Deep Learning, Mlops, Nlp and Other Machine Learning Topics

Machine learning at large
Keynote: AI is Eating Software by Rajat Monga, co-creator of TensorFlow

Keynote: Using Mathematics to Address the Growing Distrust in Algorithms by Turing Award winner Shafi Goldwasser, CS Professor at MIT, UC Berkeley and Weizmann

A Vision for the Future of ML Frameworks by Soumith Chintala, co-creator of PyTorch and lead of PyTorch at Facebook AI

Meetup: Machine Learning Frameworks, Model Management and Ops by Clément Delangue (Hugging Face), Max Fisher (Databricks), Zhihao Jia (Facebook)

Keynote: AI for Intelligent Financial Services by Dr. Manuela Veloso (JP Morgan, Carnegie Mellon University)

Deep learning
Drug Repurposing using Deep Learning on Knowledge Graphs by Alexander Thomas (Wisecube) and Vishnu Vettrivel (Wisecube)

Object Detection with Transformers by Dr Liam Li (Determined AI)

Automated Background Removal Using PyTorch by Oleksander Miroshnychenko (GlobalLogic), Simona Stolnicu (Levi9)

ML operations (MLOps)
CI/CD in MLOps: Implementing a Framework for Self-Service Everything by Cara Phillips (Artis Consulting) and Wesly Clark (J.B. Hunt)

Why APM is Not the Same as ML Monitoring by Cory Johannsen (Verta)

The Function, the Context, and the Data - Enabling MLOps at Stitch Fix by Elijah Ben Izzy

Consolidating MLOps at One of Europe’s Biggest Airports by Floris Hoogenboom (Schipol) and Sebastiaan Grasdijk (Schipol)

Catch Me If You Can: Keeping Up With ML Models in Production by Shreya Shankar (former Viaduct, Google Brain, Stanford)

Machine Learning CI/CD for Email Attack Detection by Jeshua Bratman (Abnormal Security) and Justin Young (Abnormal Security)

Scale
Anomaly Detection at Scale! By Opher Dubrovsky (Nielsen) and Max Peres (Nielsen)

Intuitive & Scalable Hyperparameter Tuning with Apache Spark + Fugue by Han Wang (Lyft)

The Rise of Vector Data by Edo Liberty (Pinecone)

Model Monitoring at Scale with Apache Spark and Verta by Manasi Vartak (Verta)

Natural Language Processing (NLP)
Efficient Large-Scale Language Model Training on GPU Clusters by Deepak Narayanan (PhD Student @ Stanford)

Automatic ICD-10 Code Assignment to Consultations in healthcare by Joinal Ahmed (Halodoc) and Nirav Kumar (Halodoc)

NLP-Focused Applied ML at Scale for Global Fleet Analytics at ExxonMobil by Sumeet Trehan

Advanced Natural Language Processing with Apache Spark NLP by David Talby (John Snow Labs)

Conversational AI with Transformer Models by Rajesh Shreedhar Bhat (Walmart) and Dinesh Ladi (Walmart)

Recommendation Systems
Towards Personalization in Global Digital Health by Africa Perianez (benshi.ai)

Scaling Online ML Predictions at DoorDash by Hien Luu and Arbaz Khan

Offer Recommendation System with Apache Spark at Burger King by Luyang Wang and Kai Huang (Intel)

Recommender-Based Transformers by Denis Rothman

Building A Product Assortment Recommendation Engine by Ethan Dubois (Anheuser Busch) and Justin Morse (Anheuser Busch)

I hope you enjoy this selection of amazing talks on the past, present and future of machine learning. You’ll also want to be sure to attend the Apache Spark Data Science and Machine Learning keynotes on Thursday AM (PDT) where you’ll hear the latest announcements and product releases from Databricks and related open source projects.

See you in the Dev Hub at Summit!

Register today