Approximate Algorithms in Apache Spark: HyperLogLog and QuantilesMay 19, 2016 by Tim Hunter, Hossein Falaki and Joseph Bradley in Solutions Introduction Apache Spark is fast, but applications such as preliminary data exploration need to be even faster and are willing to sacrifice some...
Apache Spark MLlib: From Quick Start to Scikit-LearnMay 18, 2016 by Wayne Chan in Company Blog A few months ago, we held a live webinar – Apache Spark MLlib: From Quick Start to Scikit-Learn – to give a quick...
Introducing the Spark Live 2016 TourMay 4, 2016 by Wayne Chan in Company Blog You’ve asked and we’re making it happen - Databricks, the company founded by the team that created Apache Spark, is hitting the road...
New eBook Released: Mastering Advanced Analytics with Apache SparkApril 27, 2016 by Dave Wang in Company Blog We are excited to announce that the second eBook in our technical blog book series, Mastering Advanced Analytics with Apache Spark , has...
GraphFrames On-Demand Webinar and FAQApril 21, 2016 by Dave Wang in Company Blog Last week, we held a live webinar – GraphFrames: DataFrame-based graphs for Apache Spark – to give an overview, a live demo, and...
New Content in Databricks Community EditionApril 12, 2016 by Ion Stoica in Engineering Blog At the Spark Summit New York , we announced Databricks Community Edition (CE) beta. CE is a free version of the Databricks service...
How DNV GL Uses Databricks to Build Tomorrow’s Energy GridApril 7, 2016 by Dave Wang in Energy We are proud to announce that DNV GL , a provider of software and independent expert advisory services to the maritime, oil &...
Agenda Announced for #SparkSummit 2016 in San FranciscoApril 4, 2016 by Scott Walent in Announcements San Francisco, as a cosmopolitan metropolis, has its draw not only to artists and tourists but engineers and high-tech entrepreneurs. So, get ready...
The Unreasonable Effectiveness of Deep Learning on Apache SparkApril 1, 2016 by Miles Yucht and Reynold Xin in Engineering Blog Update: this post is an April Fools joke. It is not an actual project we're working on. For the past three years, our...
Introducing our new eBook: Apache Spark Analytics Made SimpleMarch 31, 2016 by Wayne Chan and Dave Wang in Product Apache Spark ™ has rapidly emerged as the de facto standard for big data processing and data sciences across all industries. The use...