Understanding your Apache Spark Application Through VisualizationJune 22, 2015 by Andrew Or in Engineering Blog The greatest value of a picture is when it forces us to notice what we never expected to see. - John Tukey In...
Announcing Apache Spark 1.4June 11, 2015 by Patrick Wendell in Engineering Blog Today I’m excited to announce the general availability of Apache Spark 1.4! Spark 1.4 introduces SparkR, an R API targeted towards data scientists...
Announcing SparkR: R on Apache SparkJune 9, 2015 by Shivaram Venkataraman in Engineering Blog I am excited to announce that the upcoming Apache Spark 1.4 release will include SparkR, an R package that allows data scientists to...
Statistical and Mathematical Functions with DataFrames in Apache SparkJune 2, 2015 by Burak Yavuz and Reynold Xin in Engineering Blog We introduced DataFrames in Apache Spark 1.3 to make Apache Spark much easier to use. Inspired by data frames in R and Python...
Project Tungsten: Bringing Apache Spark Closer to Bare MetalApril 28, 2015 by Reynold Xin and Josh Rosen in Engineering Blog In a previous blog post , we looked back and surveyed performance improvements made to Apache Spark in the past year. In this...