Benchmarking Big Data SQL Platforms in the CloudJuly 12, 2017 by Juliusz Sompolski and Reynold Xin in Engineering Blog For a deeper dive on these benchmarks, watch the webinar featuring Reynold Xin. Performance is often a key factor in choosing big data...
Introducing Apache Spark 2.2July 11, 2017 by Michael Armbrust in Engineering Blog Today we are happy to announce the availability of Apache Spark 2.2.0 on Databricks as part of the Databricks Runtime 3.0. This release...
Declarative Infrastructure with the Jsonnet Templating LanguageJune 26, 2017 by Eric Liang and Aaron Davidson in Platform Blog This blog post is part of our series of internal engineering blogs on Databricks platform, infrastructure management, integration, tooling, monitoring, and provisioning. At...
Five Spark SQL Utility Functions to Extract and Explore Complex Data TypesJune 13, 2017 by Jules Damji in Engineering Blog Try this notebook on Databricks For developers, often the how is as important as the why . While our in-depth blog explains the...
A Vision for Making Deep Learning SimpleJune 6, 2017 by Sue Ann Hong, Tim Hunter and Reynold Xin in Engineering Blog Try this notebook on Databricks When MapReduce was introduced 15 years ago, it showed the world a glimpse into the future. For the...