Persistent Clusters: Simplifying Cluster Management for AnalyticsMay 19, 2017 by Evan Ye, Haogang Chen, Henry Davidge and Prakash Chockalingam in Company Blog Today we are excited to announce persistent clusters for analytics in Databricks. With persistent clusters, users no longer need to go through the...
Query Watchdog: Handling Disruptive Queries in Spark SQLApril 17, 2017 by Alicja Luszczak, Srinath Shankar and Bill Chambers in Product Read Rise of the Data Lakehouse to explore why lakehouses are the data architecture of the future with the father of the data...
Delivering a Personalized Shopping Experience with Apache Spark on DatabricksMarch 31, 2017 by Brett Bevers, Engineering Manager, Data Engineering at Dollar Shave Club in Product This is a guest blog from our friends at Dollar Shave Club. Dollar Shave Club (DSC) is a men's lifestyle brand and e-commerce...
How Apache Spark on Databricks is Taming the Wild West of Wi-FiFebruary 27, 2017 by Tomasz Magdanski in Company Blog iPass is the world’s largest Wi-Fi provider, yet we don’t own a single hotspot. You can think of us as the Uber of...
Anonymizing Datasets at Scale Leveraging Databricks InteroperabilityFebruary 13, 2017 by Don Hillborn in Product A key challenge for data-driven companies across a wide range of industries is how to leverage the benefits of analytics at scale when...
Announcing the Spark Live 2017 World TourJanuary 31, 2017 by Wayne Chan in Company Blog Due to the enthusiasm and positive feedback from last year’s Spark Live tour, we will be hitting the road again in 2017 to...
Integrating Your Central Apache Hive Metastore with Apache Spark on DatabricksJanuary 30, 2017 by Miklos Christine in Company Blog Databricks provides a managed Apache Spark platform to simplify running production applications, real-time data exploration, and infrastructure complexity. A key piece of the...
Delivering Exceptional Care Through Data-Driven MedicineJanuary 25, 2017 by Jorge Caballero in Company Blog This is a guest blog from our friends at Distal. Today, 96% of U.S. health care providers use electronic health records (EHRs) -...
Integration of AWS Data Pipeline with Databricks: Building ETL pipelines with Apache SparkJanuary 23, 2017 by Peyman Mohajerian in Product This is one of a series of blogs on integrating Databricks with commonly used software packages. See the “What’s Next” section at the...
On-Demand Webinar and FAQ: Apache Spark - The Unified Engine for All WorkloadsJanuary 18, 2017 by Wayne Chan in Company Blog Last week, we held a live webinar — Apache Spark - The Unified Engine for All Workloads — to explain the real-world benefits...