How to use SparkSession in Apache Spark 2.0August 15, 2016 by Jules Damji in Engineering Blog Generally, a session is an interaction between two or more entities. In computer parlance, its usage is prominent in the realm of networked...
Databricks Bi-Weekly Digest: 8/8/16August 8, 2016 by Jules Damji in Engineering Blog Continuing with our bi-weekly digest series, here’s our recap of what’s transpired over the last two weeks with Apache Spark since our previous...
Spark Structured StreamingJuly 28, 2016 by Matei Zaharia, Tathagata Das, Michael Lumb and Reynold Xin in Engineering Blog Apache Spark 2.0 adds the first version of a new higher-level API, Structured Streaming, for building continuous applications . The main goal is...
Continuous Applications: Evolving Streaming in Apache Spark 2.0July 28, 2016 by Matei Zaharia in Engineering Blog Since its release, Spark Streaming has become one of the most widely used distributed streaming engines, thanks to its high-level API and exactly-once...
Introducing Apache Spark 2.0July 26, 2016 by Reynold Xin, Michael Lumb and Matei Zaharia in Engineering Blog Today, we're excited to announce the general availability of Apache Spark 2.0 on Databricks. This release builds on what the community has learned...