Databricks Delta: A Unified Data Management System for Real-time Big DataOctober 25, 2017 by Michael Armbrust, Bill Chambers and Matei Zaharia in Platform Blog Combining the best of data warehouses, data lakes and streaming For an in-depth look and demo, join the webinar . Today we are...
Introducing the Natural Language Processing Library for Apache SparkOctober 19, 2017 by David Talby in Solutions This is a community blog and effort from the engineering team at John Snow Labs, explaining their contribution to an open-source Apache Spark...
3 Things CISO’s expect from Tech Companies in a Cloudy WorldOctober 17, 2017 by David Cook in Company Blog Adding new software to an enterprise is a difficult process. In the past, choosing new software only required budget approval before it could...
Building Complex Data Pipelines with Unified Analytics PlatformOctober 5, 2017 by Jules Damji and Jason Pohl in Platform Blog Introduction Big data practitioners often post recurring questions on Quora: What is data engineering? How to become a data scientist? What’s a data...
Databricks invites Colleen Lewis to Speak about Diversity in the WorkplaceSeptember 15, 2017 by Angelos Mikelatos in Company Blog First I'll start with the sad truth. The technology industry at large has taken many hits over the years for discriminatory practices and...
Looker and Databricks Partner to Bring Data Scientists and Business Users TogetherSeptember 14, 2017 by Brian Dirking in Company Blog We are very excited today as we announce a partnership between Databricks and Looker. We have seen customers using these products together to...
Learn about Apache Spark APIs and Best PracticesSeptember 12, 2017 by Jules Damji and Silvio Fiorito in Company Blog Since Apache Spark 1.3, Spark and its APIs have evolved to make them easier, faster, and smarter. The goal has been to unify...
Build, Scale, and Deploy Deep Learning Pipelines with EaseSeptember 6, 2017 by Sue Ann Hong and Tim Hunter in Announcements At the Spark Summit in San Francisco in June , we announced an open-source project Deep Learning Pipelines . Deep Learning Pipelines provides...
A Summer of Personal and Professional Growth at DatabricksSeptember 5, 2017 by Karen Feng in Company Blog This summer, I worked at Databricks as a software engineering intern on the Growth team. By introducing two new features, user groups and...
Do your Streaming ETL at Scale with Apache Spark’s Structured StreamingSeptember 1, 2017 by Tathagata Das in Announcements At the Spark Summit in San Francisco in June , we announced that Apache Spark’s Structured Streaming is marked as production-ready and shared...