Apache Spark 2015 Year In ReviewJanuary 5, 2016 by Reynold Xin, Matei Zaharia and Patrick Wendell in Solutions To learn more about Apache Spark, attend Spark Summit East in New York in Feb 2016 . 2015 has been a year of...
Introducing Apache Spark DatasetsJanuary 4, 2016 by Michael Armbrust, Wenchen Fan, Reynold Xin and Matei Zaharia in Engineering Blog Developers have always loved Apache Spark for providing APIs that are simple yet powerful, a combination of traits that makes complex analysis possible...
Spark Survey 2015 Results are now availableSeptember 24, 2015 by Matei Zaharia, Patrick Wendell and Denny Lee in Announcements We ran the Spark Survey 2015 this summer to gain insights on how organizations are using Apache Spark. The results of this year’s...
Diving into Apache Spark Streaming's Execution ModelJuly 30, 2015 by Tathagata Das, Matei Zaharia and Patrick Wendell in Engineering Blog With so many distributed stream processing engines available, people often ask us about the unique benefits of Apache Spark Streaming . From early...
Databricks is now Generally AvailableJune 15, 2015 by Ion Stoica and Matei Zaharia in Product We are excited to announce today, at Spark Summit 2015 , the general availability of the Databricks – a hosted data platform from...
Deep Dive into Spark SQL's Catalyst OptimizerApril 13, 2015 by Michael Armbrust, Yin Huai, Cheng Liang, Reynold Xin and Matei Zaharia in Engineering Blog Check out the Why the Data Lakehouse is Your Next Data Warehouse ebook to discover the inner workings of the Databricks Lakehouse Platform...
Apache Spark Turns Five Years Old!March 31, 2015 by Matei Zaharia in Engineering Blog Today, we’re celebrating an important milestone for the Apache Spark project -- it’s now been five years since Spark was first open sourced...
Apache Spark: A review of 2014 and looking ahead to 2015 prioritiesFebruary 13, 2015 by Patrick Wendell and Matei Zaharia in Engineering Blog 2014 has been a year of tremendous growth for Apache Spark. It became the most active open source project in the Big Data...
"Learning Spark" book available from O'ReillyFebruary 9, 2015 by Holden Karau, Andy Konwinski, Patrick Wendell and Matei Zaharia in Company Blog Today we are happy to announce that the complete Learning Spark book is available from O’Reilly in e-book form with the print copy...
The State of Apache Spark in 2014July 18, 2014 by Matei Zaharia in Engineering Blog This post originally appeared in insideBIGDATA and is reposted here with permission. With the second Spark Summit behind us, we wanted to take...