Burak Yavuz

Burak Yavuz's posts

Databricks Delta Lake table cloning abstracts the complexity and cost from cloning for testing, sharing and ML reproducibility.

September 15, 2020/8 min read

Easily Clone your Delta Lake for Testing, Sharing, and ML Reproducibility

Data Intelligence Platforms

August 27, 2020/14 min read

Enabling Spark SQL DDL and DML in Delta Lake on Apache Spark 3.0

blog-delta-lake-year-1-social-2

June 18, 2020/8 min read

Time Traveling with Delta Lake: A Retrospective of the Last Year

Diving Into Delta Lake: Schema Enforcement & Evolution

September 23, 2019/9 min read

Diving Into Delta Lake: Schema Enforcement & Evolution

Diving Into Delta Lake: Unpacking The Transaction Log

August 20, 2019/10 min read

Diving Into Delta Lake: Unpacking The Transaction Log

Introducing Delta Time Travel for Large Scale Data Lakes

February 4, 2019/6 min read

Introducing Delta Time Travel for Large Scale Data Lakes

Data Intelligence Platforms

October 11, 2017/5 min read

Benchmarking Structured Streaming on Databricks Runtime Against State-of-the-Art Streaming Systems

Running Streaming Jobs Once a Day For 10x

May 22, 2017/6 min read

Running Streaming Jobs Once a Day For 10x Cost Savings

Diagram showing the breakdown of various types of data sources and formats

February 23, 2017/12 min read

Working with Complex Data Formats with Structured Streaming in Apache Spark 2.1

Engineering blog

July 29, 2015/5 min read

New Features in Machine Learning Pipelines in Apache Spark 1.4

Platform blog

July 28, 2015/3 min read

Using 3rd Party Libraries in Databricks: Apache Spark Packages and Maven Libraries

Data Intelligence Platforms

June 5, 2015/3 min read

Making Databricks Better for Developers: IDE Integration

Showing 1 - 12 of 16 results