Skip to main content
Page 1
Engineering blog

Project Lightspeed Update - Advancing Apache Spark Structured Streaming

In this blog post, we will review the advancements in Spark Structured Streaming since we announced Project Lightspeed a year ago, from performance...
Platform blog

Announcing the Public Preview of Predictive I/O for Updates

Previously, we’ve shown you how a new technology called Predictive I/O could improve selective reads by up to 35x for CDW customers without...
Company blog

Real-Time Insights: The Top Three Reasons Why Customers Love Data Streaming with Databricks

The world operates in real-time The ability to make real-time decisions in today's fast paced world is more critical than ever before. Today's...
Platform blog

Why We Migrated From Apache Airflow to Databricks Workflows at YipitData

December 7, 2022 by Hillevi Crognale and Frank Munz in Platform Blog
This is a collaborative post from Databricks and YipitData. We thank Engineering Manager Hillevi Crognale at YipitData for her contributions. YipitData is the...
Company blog

Databricks at Current 2022

Current 2022 , organized by Confluent, is the first-ever data streaming industry event – and it's coming up soon! No matter where you...
Platform blog

Low-latency Streaming Data Pipelines with Delta Live Tables and Apache Kafka

August 9, 2022 by Frank Munz in Product
Delta Live Tables (DLT) is the first ETL framework that uses a simple declarative approach for creating reliable data pipelines and fully manages...
Platform blog

Introducing Databricks Workflows

Today we are excited to introduce Databricks Workflows , the fully-managed orchestration service that is deeply integrated with the Databricks Lakehouse Platform. Workflows...
Engineering blog

How We Built Databricks on Google Kubernetes Engine (GKE)

August 6, 2021 by Frank Munz and Li Gao in Engineering Blog
Our release of Databricks on Google Cloud Platform (GCP) was a major milestone toward a unified data, analytics and AI platform that is...