Skip to main content
<
Page 3
>
Engineering blog

Make Your Data Lakehouse Run, Faster With Delta Lake 1.1

Delta Lake 1.1 improves performance for merge operations, adds the support for generated columns and improves nested field resolution With the tremendous contributions...
Engineering blog

The Ubiquity of Delta Standalone: Java, Scala, Hive, Presto, Trino, Power BI, and More!

The Delta Standalone library is a single-node Java library that can be used to read from and write to Delta tables. Specifically, this...
Engineering blog

Creating a Faster TAR Extractor

January 26, 2022 by Christopher Denny in Engineering Blog
Tarballs are used industry-wide for packaging and distributing files, and this is no different at Databricks. Every day we launch millions of VMs...
Engineering blog

Extending Delta Sharing for Azure

We are excited for the release of Delta Sharing 0.3.0, which introduces several key improvements and bug fixes, including the following features: Delta...
Engineering blog

Log4j2 Vulnerability (CVE-2021-44228) Research and Assessment

December 23, 2021 by Fermin J. Serna in Engineering Blog
This blog relates to an ongoing investigation. We will update it with any significant updates, including detection rules to help people investigate potential...
Engineering blog

Scala at Scale at Databricks

December 3, 2021 by Li Haoyi in Engineering Blog
With hundreds of developers and millions of lines of code, Databricks is one of the largest Scala shops around. This post will be...
Engineering blog

The Foundation of Your Lakehouse Starts With Delta Lake

December 1, 2021 by Denny Lee and Vini Jaiswal in Engineering Blog
It’s been an exciting last few years with the Delta Lake project. The release of Delta Lake 1.0 as announced by Michael Armbrust...
Engineering blog

Turning 2 Trillion Data Points of Traffic Intelligence into Critical Business Insights

November 3, 2021 by Stephanie Mak in Engineering Blog
This is a guest authored post by Stephanie Mak , Senior Data Engineer, formerly at Intelematics. This blog post offers my experience of...
Engineering blog

Introducing Apache Spark™ 3.2

We are excited to announce the availability of Apache Spark™ 3.2 on Databricks as part of Databricks Runtime 10.0 . We want to...
Engineering blog

Native Support of Session Window in Spark Structured Streaming

Apache Spark™ Structured Streaming allowed users to do aggregations on windows over event-time . Before Apache Spark 3.2™, Spark supported tumbling windows and...