Apache Avro as a Built-in Data Source in Apache Spark 2.4November 30, 2018 by Gengliang Wang, Wenchen Fan and Michael Armbrust in Solutions Try this notebook in Databricks Apache Avro is a popular data serialization format. It is widely used in the Apache Spark and Apache...
Introducing Apache Spark 2.4November 8, 2018 by Wenchen Fan, Xiao Li and Reynold Xin in Engineering Blog UPDATED: 11/19/2018 We are excited to announce the availability of Apache Spark 2.4 on Databricks as part of the Databricks Runtime 5.0...
Building a Real-Time Attribution Pipeline with Databricks DeltaAugust 9, 2018 by Caryl Yuhas and Denny Lee in Platform Blog Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. In digital advertising, one...
Processing Petabytes of Data in Seconds with Databricks DeltaJuly 31, 2018 by Adrian Ionescu in Engineering Blog Introduction Databricks Delta Lake is a unified data management system that brings data reliability and fast analytics to cloud data lakes . In...
Simplify Advertising Analytics Click Prediction with Databricks Unified Analytics PlatformJuly 19, 2018 by Tony Cruz and Denny Lee in Platform Blog Read Rise of the Data Lakehouse to explore why lakehouses are the data architecture of the future with the father of the data...