Glow 0.3.0 Introduces New Large-Scale Genomic Analysis FeaturesApril 23, 2020 by Kiavash Kianfar in Engineering Blog In October of last year, Databricks and the Regeneron Genetics Center ® partnered together to introduce Project Glow , an open-source analysis tool...
How a Fresh Approach to Safety Stock Analysis Can Optimize InventoryApril 22, 2020 by Bryan Smith and Rob Saker in Engineering Blog Check out the solution accelerator for the accompanying notebook A manufacturer is working on an order for a customer only to find that...
Building a Modern Clinical Health Data Lake with Delta LakeApril 21, 2020 by Frank Austin Nothaft, Michael Ortega and Amir Kermany in Platform Blog The healthcare industry is one of the biggest producers of data. In fact, the average healthcare organization is sitting on nearly 9 petabytes...
COVID-19 Datasets Now Available on Databricks: How the Data Community Can HelpApril 14, 2020 by Denny Lee in Engineering Blog Initially published April 14th, 2020; updated April 21st, 2020 With the massive disruption of the current COVID-19 pandemic, many data engineers and data...
10 Minutes from pandas to Koalas on Apache SparkMarch 31, 2020 by Haejoon Lee, Yifan Cao, Hyukjin Kwon and Takuya Ueshin in Solutions This is a guest community post from Haejoon Lee, a software engineer at Mobigen in South Korea and a Koalas contributor. pandas is...