Processing Data in Apache Kafka with Structured Streaming in Apache Spark 2.2April 26, 2017 by Kunal Khamar, Tyson Condie and Michael Armbrust in Engineering Blog This is the third post in a multi-part series about how you can perform complex streaming analytics using Apache Spark. In this blog...
Real-Time End-to-End Integration with Apache Kafka in Apache Spark’s Structured StreamingApril 4, 2017 by Sunil Sitaula in Engineering Blog View the Notebook in Databricks Community Edition Structured Streaming APIs enable building end-to-end streaming applications called continuous applications in a consistent, fault-tolerant manner...
Next Generation Physical Planning in Apache SparkApril 1, 2017 by Aaron Davidson, Eric Liang and Thomas Desrosiers in Engineering Blog Never underestimate the bandwidth of a station wagon full of tapes hurtling down the highway. — Andrew Tanenbaum, 1981 magine a cold, windy...
On-Demand Webinar and FAQ: Apache Spark MLlib 2.x: How to Productionize your Machine Learning ModelsMarch 28, 2017 by Richard Garris and Jules Damji in Engineering Blog On March 9th, we hosted a live webinar— Apache Spark MLlib 2.x: How to Productionize your Machine Learning Models —to address the following...
Analyse One Year of Radio Station Songs Aired with Apache Spark, Spark SQL, Spotify, and DatabricksMarch 27, 2017 by Paul Leclercq in Engineering Blog Read Rise of the Data Lakehouse to explore why lakehouses are the data architecture of the future with the father of the data...