Session

Beyond Batch: Engineering Self-Evolving Ingestion with Databricks Auto Loader

Overview

ExperienceIn Person
TrackData Engineering & Streaming
IndustryEnterprise Technology, Financial Services
TechnologiesDatabricks SQL, Delta Sharing, Unity Catalog
Skill LevelIntermediate

Is your data engineering team trapped in brittle schema fixes and rising costs? As data sources multiply, traditional ingestion stalls enterprise insights. At Capital One Software, we’ve moved beyond static ETL to a "self-evolving" framework that treats data as a dynamic stream. This session reveals our modular architecture using Databricks Auto Loader to bridge multi-platform data into S3 with zero manual overhead. We’ll dive into how Schema Evolution and Rescue Columns handle upstream drift automatically, keeping pipelines live when source systems change.

Key Takeaways:

  • Modular Design: Decouple ingestion from transformation for reusable, enterprise-scale patterns.
  • Dynamic Schema Management: Strategies to detect and adapt to drift without breaking dependencies.
  • Cost Optimization: Real-world tactics for balancing trigger intervals and compute for peak efficiency. Transition from reactive maintenance to automated data empowerment.

Session Speakers

Speaker placeholderIMAGE COMING SOON

Yudhish Batra

/Distinguished Engineer
Capital One

Speaker placeholderIMAGE COMING SOON

Syed Mehmood

/Director of Software Engineering & Data
Capital One