Aaron is a Sr. Director of Data Engineering who has built multiple data lakes and Data Warehouses for major companies across the financial space. He has spent the last couple of years working passionately inside evolving technologies such as Lakehouse, Data mesh, and scalable data system. Aaron holds a master’s degree in information technology from University of Wisconsin.
May 27, 2021 03:50 PM PT
Privacy has become one of the most important critical topics in data today. It is more than how do we ingest and consume data but the important factors about how you protect your customer’s rights while balancing the business need. In our session, we will bring CTO, Privacera, Don Bosco Durai together with Northwestern Mutual to detail an important use case in privacy and then show how to scale Privacy with a focus on the business needs. We will make the ability to scale effortless.
April 24, 2019 05:00 PM PT
Life occurs in real-time, and not surprisingly, more solutions are being built using streaming technologies. Event-based architectures are becoming the norm, and customers are expecting immediate access to their data. This new world offers many exciting opportunities, but also some new challenges. What do you do when your streaming data is not complete? What if it relies on another data source? Does the dependent data exist yet, and does it come from a 3rd party? How do we merge a complete picture of data when data is sourcing from multiple places at the same time? A new norm in the world of distributed services.
Join us as we dive deep into the technical details around these scenarios and more. Expect to learn about stream-stream joins, enriching stream data using local or remote data, and ways to anticipate and correct errors within the stream. Leave with a better understanding of managing data dependencies within a Spark Structured Streaming application.
June 5, 2018 05:00 PM PT
A mobile application is only as good as our design and how customers use it. But how do they use it? We've got over 35 million devices running our mobile banking platform, and we need to understand each and every one of them. Is the customer enjoying their experience, are they lost, or are they a fraudulent hacker 3000 miles away?
We developed an algorithm to examine the user's workflow so we can perform near real-time analysis of their online activities. We leverage Spark's Structured Streaming, ML Pipelines & GraphFrames, and good old fashioned grit to gain insights that allow us to improve our mobile app. For good measure, we added fraud detection to the mix so we can use artificial intelligence to detect any strange or alarming patterns.
Session hashtag: #DevSAIS18