Skip to main content
Page 1
Engineering blog

Introducing Databricks Ingest: Easy and Efficient Data Ingestion from Different Sources into Delta Lake

February 24, 2020 by Prakash Chockalingam in Engineering Blog
Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake We are excited to...
Engineering blog

Announcing the Delta Lake 0.3.0 Release

Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. We are excited to...
Company blog

Getting Data Ready for Data Science: On-Demand Webinar and Q&A Now Available

On June 25th, our team hosted a live webinar — Getting Data Ready for Data Science — with Prakash Chockalingam, Product Manager at...
Company blog

Open Sourcing Delta Lake

April 24, 2019 by Prakash Chockalingam in Company Blog
Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. Build reliable data lakes...
Company blog

Efficient Upserts into Data Lakes with Databricks Delta

Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. Simplify building big data...
Company blog

Introducing Delta Time Travel for Large Scale Data Lakes

February 4, 2019 by Burak Yavuz and Prakash Chockalingam in Company Blog
Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake . Data versioning for...
Engineering blog

Apache Spark™ Clusters in Autopilot Mode

Apache Spark™ is a unified analytics engine that helps users use a single distributed computing framework for various use cases. With the advent...
Company blog

Introducing Databricks Optimized Autoscaling on Apache Spark™

Databricks is thrilled to announce our new optimized autoscaling feature. The new Apache Spark™-aware resource manager leverages Spark shuffle and executor statistics to...
Company blog

Transparent Autoscaling of Instance Storage

Big data workloads require access to disk space for a variety of operations, generally when intermediate results will not fit in memory. When...
Company blog

What AWS Per-Second Billing Means for Big Data Processing

November 6, 2017 by Prakash Chockalingam in Company Blog
Databricks, the Unified Analytics Platform, has always been a cloud-first platform. We believe in the scalability and elasticity of the cloud so that...
Company blog

Access Control for Databricks Jobs

Secure your production workloads end-to-end with Databricks’ comprehensive access control system Databricks offers role-based access control for clusters and workspace to secure infrastructure...
Company blog

Continuous Integration & Continuous Delivery with Databricks

Continuous integration and continuous delivery (CI/CD) is a practice that enables an organization to rapidly iterate on software changes while maintaining stability, performance...
Company blog

Databricks Serverless: Next Generation Resource Management for Apache Spark

As the amount of data in an organization grows, more and more engineers, analysts and data scientists need to analyze this data using...
Company blog

Persistent Clusters: Simplifying Cluster Management for Analytics

Today we are excited to announce persistent clusters for analytics in Databricks. With persistent clusters, users no longer need to go through the...