Auto-scaling scikit-learn with Apache SparkFebruary 8, 2016 by Tim Hunter and Joseph Bradley in Engineering Blog Data scientists often spend hours or days tuning models to get the highest accuracy. This tuning typically involves running a large number of...
Faster Stateful Stream Processing in Apache Spark StreamingFebruary 1, 2016 by Tathagata Das and Shixiong Zhu in Engineering Blog Many complex stream processing pipelines must maintain state across a period of time. For example, if you are interested in understanding user behavior...
Deep Learning with Apache Spark and TensorFlowJanuary 25, 2016 by Tim Hunter in Engineering Blog Neural networks have seen spectacular progress during the last few years and they are now the state of the art in image recognition...
MLlib Highlights in Apache Spark 1.6January 21, 2016 by Joseph Bradley in Engineering Blog To learn more about Apache Spark, attend Spark Summit East in New York in Feb 2016 . With the latest release, Apache Spark’s...
Apache Spark 2015 Year In ReviewJanuary 5, 2016 by Reynold Xin, Matei Zaharia and Patrick Wendell in Solutions To learn more about Apache Spark, attend Spark Summit East in New York in Feb 2016 . 2015 has been a year of...