Accelerating Corteva’s Data Source Onboarding With Lakeflow
Overview
| Experience | In Person |
|---|---|
| Track | Data Engineering & Streaming |
| Industry | Enterprise Technology, Healthcare & Life Sciences, Manufacturing |
| Technologies | Databricks SQL, Lakeflow, Unity Catalog |
| Skill Level | Intermediate |
Onboarding a new data source at Corteva used to take 30–45 days. Now it’s 4–7 days, about 85% faster. The Lakeflow-first strategy and Unity Catalog helped standardize the ingestion‑to‑consumption pathway across 6 domains. Lakeflow Connect, Spark Declarative Pipelines and Jobs make pipeline runs repeatable and config‑driven. A lightweight metadata layer centralizes data cleansing, and an incremental DDL tool pushes schema changes safely through each environment.This pattern now underpins Corteva’s operational analytics. It accelerates root‑cause analysis linking field variability to dryer‑bin stabilization, improves throughput across 35 seed sites, and strengthens seed quality. Today, the setup covers 5,000+ tables from 15+ sources, shortening time‑to‑insight and speeding partner onboarding. Attendees will leave with a reusable Lakeflow‑first ingestion pattern, a metadata‑driven data quality blueprint and a safe schema‑evolution workflow across environments.
Session Speakers
Mehul Bhuva
/Data and AI Platform Engineer
Corteva AgriScience
Harshit Rai
/Resident Solutions Architect
Databricks, Inc.