Spark Live 2016 Tour RecapJanuary 3, 2017 by Wayne Chan in Company Blog The Apache Spark community had quite the year in 2016. It has maintained its billing as the largest and most active open source...
Top 10 Apache Spark Blog Posts from 2016December 30, 2016 by Jules Damji in Engineering Blog Spark Summit will be held in Dublin, Ireland on Oct 24-26, 2017. Check out the get your ticket before it sells out! Here’s...
Introducing Apache Spark 2.1December 29, 2016 by Reynold Xin in Engineering Blog Spark Summit will be held in Boston on Feb 7-9, 2017. Check out the full agenda and get your ticket before it sells...
10 Things I Wish I Knew Before Using Apache SparkRDecember 28, 2016 by Neil Dewar in Engineering Blog This is a guest post from Neil Dewar , a senior data science manager at a global asset management firm. In this blog...
Deep Learning on DatabricksDecember 21, 2016 by Joseph Bradley and Tim Hunter in Engineering Blog We are excited to announce the general availability of Graphic Processing Unit (GPU) and deep learning support on Databricks! This blog post will...
Scalable Partition Handling for Cloud-Native Architecture in Apache Spark 2.1December 15, 2016 by Eric Liang, Michael Allman and Wenchen Fan in Engineering Blog Apache Spark 2.1 is just around the corner: the community is going through voting process for the release candidates. This blog post discusses...
On Demand Webinar and FAQ: Apache Spark MLlib 2.x: Migrating ML Workloads to DataFramesDecember 14, 2016 by Joseph Bradley and Jules Damji in Company Blog Last week, we held a live webinar, Apache Spark MLlib 2.x: Migrating ML Workloads to DataFrames , to demonstrate the ease with which...
Apache Spark Scala Library Development with DatabricksDecember 12, 2016 by Jason Pohl in Company Blog Try this notebook in Databricks The movie Toy Story was released in 1995 by Pixar as the first feature-length computer animated film. Even...
Integrating Apache Airflow and Databricks: Building ETL pipelines with Apache SparkDecember 8, 2016 by Peyman Mohajerian in Product This is one of a series of blogs on integrating Databricks with commonly used software packages. See the “What’s Next” section at the...
On-Demand Webinar and FAQ: How to Evaluate Cloud-based Apache Spark PlatformsNovember 23, 2016 by Wayne Chan in Company Blog Last week, we held a live webinar, How to Evaluate Cloud-based Apache Spark Platforms , to help those who are currently evaluating various...