HomepageData + AI Summit 2023 Logo
SAN FRANCISCO, JUNE 26-29
VIRTUAL, JUNE 28-29
  • Sessions
Watch on demand

Leveraging IoT Data at Scale to Mitigate Global Water Risks Using Apache Spark™ Streaming and Delta

Thursday, June 29 @1:30 PM
Attending in person? Add to your schedule ↗

Overview

Every year, billions of dollars are lost due to water risks from storms, floods, and droughts. Water data scarcity and excess are issues that risk models cannot overcome, creating a world of uncertainty. Divirod is building a platform of water data by normalizing diverse data sources of varying velocity into one unified data asset. In addition to publicly available third-party datasets, we are rapidly deploying our own IoT sensors. These sensors ingest signals at a rate of about 100,000 messages per hour into preprocessing, signal-processing, analytics, and postprocessing workloads in one spark-streaming pipeline to enable critical real-time decision-making processes. By leveraging streaming architecture, we were able to reduce end-to-end latency from tens of minutes to just a few seconds.



 



We are leveraging Delta Lake to provide a single query interface across multiple tables of this continuously changing data. This enables data science and analytics workloads to always use the most current and comprehensive information available. In addition to the obvious schema transformations, we implement data quality metrics and datum conversions to provide a trustworthy unified dataset.


Type

  • Breakout

Experience

  • In Person

Track

  • Data Streaming

Industry

  • Enterprise Technology, Manufacturing, Public Sector

Difficulty

  • Intermediate

Duration

  • 40 min

Session Speakers

Headshot of Heiko Udluft

Heiko Udluft

Chief Technology Officer

Divirod, Inc.

Headshot of Adam Wilson

Adam Wilson

Co-Founder, Chief of Product

Divirod, Inc.

Don't miss this year's event!

Register now