Technical Preview of Apache Spark 2.0 Now on DatabricksMay 11, 2016 by Reynold Xin in Engineering Blog For the past few months, we have been busy contributing to the next major release of the big data open source software we...
New Content in Databricks Community EditionApril 12, 2016 by Ion Stoica in Engineering Blog At the Spark Summit New York , we announced Databricks Community Edition (CE) beta. CE is a free version of the Databricks service...
The Unreasonable Effectiveness of Deep Learning on Apache SparkApril 1, 2016 by Miles Yucht and Reynold Xin in Engineering Blog Update: this post is an April Fools joke. It is not an actual project we're working on. For the past three years, our...
Apache Spark Trending in the Stack Overflow SurveyMarch 22, 2016 by Reynold Xin in Solutions Last week, Stack Overflow released the result of their 2016 developer survey . This is one of the most significant surveys in the...
On-Time Flight Performance with GraphFrames for Apache SparkMarch 16, 2016 by Joseph Bradley, Bill Chambers and Denny Lee in Engineering Blog Introduction Graph structures are a more intuitive approach to many classes of data problems. Whether traversing social networks, restaurant recommendations, or flight paths...