Shell Oil Use Case: Parallelizing Large Simulations with Apache SparkR on DatabricksJune 23, 2017 by Wayne W. Jones, Dennis Vallinga and Hossein Falaki in Product This blog post is a joint engineering effort between Shell’s Data Science Team ( Wayne W. Jones and Dennis Vallinga ) and Databricks...
Managing and Securing Credentials in Databricks for Apache Spark JobsJune 20, 2017 by Jason Pohl in Platform Blog Since Apache Spark separates compute from storage, every Spark Job requires a set of credentials to connect to disparate data sources. Storing those...
Analysing Metro Operations Using Apache Spark on DatabricksJune 14, 2017 by Even Vinge, Senior Manager - EY Advisory, Data & Analytics in Company Blog This is a guest blog from EY Advisory Data & Analytics team, who have been working with Sporveien in Oslo building a platform...
Databricks Serverless: Next Generation Resource Management for Apache SparkJune 7, 2017 by Greg Owen, Eric Liang, Prakash Chockalingam and Srinath Shankar in Product As the amount of data in an organization grows, more and more engineers, analysts and data scientists need to analyze this data using...
Sharing Knowledge with the Community in a Preview of Apache Spark: The Definitive GuideJune 5, 2017 by Bill Chambers and Matei Zaharia in Announcements Apache Spark has seen immense growth over the past several years. The size and scale of this Spark Summit is a true reflection...