Introducing Glow: An Open-Source Toolkit for Large-Scale Genomic AnalysisOctober 18, 2019 by Frank Austin Nothaft, Karen Feng, Henry Davidge, Ion Stoica, Dr. Jeff Reid, Dr. Lukas Habegger, Evan Maxwell, Leland Barnard and Kiavash Kianfar in Announcements The key to solving some of today’s most challenging medical problems lies in the analysis of genomics data. Understanding the impact of the...
Introducing the MLflow Model RegistryOctober 17, 2019 by Clemens Mewald, Matei Zaharia and Cyrielle Simeone in Announcements Watch the announcement and demo At today’s Spark + AI Summit in Amsterdam , we announced the availability of the MLflow Model Registry...
Delta Lake Now Hosted by the Linux Foundation to Become the Open Standard for Data LakesOctober 16, 2019 by Michael Armbrust and Reynold Xin in Platform Blog Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. At today’s Spark +...
How Informatica Data Engineering Goes Hadoop-less with DatabricksOctober 10, 2019 by Hiral Jasani in Company Blog Back in May, we announced our partnership with Informatica to build out a rich set of integrations between our two platforms. It’s been...
Simple, Reliable Upserts and Deletes on Delta Lake Tables using Python APIsOctober 3, 2019 by Tathagata Das and Denny Lee in Solutions We are excited to announce the release of Delta Lake 0.4.0 which introduces Python APIs for manipulating and managing data in Delta tables...