Skip to main content
<
Page 12

Improvements to Kafka integration of Spark Streaming

Apache Kafka is rapidly becoming one of the most popular open source stream ingestion platforms. We see the same trend among the users...

Introducing Streaming k-means in Apache Spark 1.2

January 28, 2015 by Jeremy Freeman in
Many real world data are acquired sequentially over time, whether messages from social media users, time series from wearable sensors, or — in...

Improved Fault-tolerance and Zero Data Loss in Apache Spark Streaming

January 15, 2015 by Tathagata Das in
Real-time stream processing systems must be operational 24/7, which requires them to recover from all kinds of failures in the system. Since its...

Apache Spark as a platform for large-scale neuroscience

October 1, 2014 by Jeremy Freeman in
The brain is the most complicated organ of the body, and probably one of the most complicated structures in the universe. It’s millions...

Apache Spark 1.1: The State of Spark Streaming

September 16, 2014 by Tathagata Das and Patrick Wendell in
With Apache Spark 1.1 recently released, we’d like to take this occasion to feature one of the most popular Spark components - Spark...

Announcing Apache Spark 1.1

September 11, 2014 by Patrick Wendell in
Today we’re thrilled to announce the release of Apache Spark 1.1! Apache Spark 1.1 introduces many new features along with scale and stability improvements. This post will introduce some key features of Apache Spark 1.1 and provide context on the priorities of Spark for this and the next release.