Accelerating developers by ditching the data centerJune 10, 2020 by R Tyler Croy in Company Blog Guest blog by R Tyler Croy, Director of Platform Engineering at Scribd People don’t tend to get excited about the data platform. It...
How the Minnesota Twins Scaled Pitch Scenario Analysis to Measure Player Performance - Part 1June 4, 2020 by Rafi Kurlansik, Tushar Madan and Hector Leano in Company Blog Statistical Analysis in the Game of Baseball A single pitch in Major League Baseball (MLB) generates tens of megabytes of data, from pitch...
Data Science with Azure Databricks at Clifford ChanceMarch 31, 2020 by Mirko Bernardoni and Lulu Wan in Company Blog Guest blog by Mirko Bernardoni (Fiume Ltd) and Lulu Wan (Clifford Chance) Introduction With headquarters in London, Clifford Chance is a member of...
Engineering population scale Genome-Wide Association Studies with Apache Spark™, Delta Lake, and MLflowSeptember 20, 2019 by Frank Austin Nothaft, Henry Davidge and William Brandler in Engineering Blog Get an early preview of O'Reilly's new ebook for the step-by-step guidance you need to start using Delta Lake. Try this notebook series...
Building Foot-Traffic Insights DatasetAugust 25, 2019 by Safegraph Engineering in Customers Where should I build my next coffee shop? Businesses want to understand both the physical world around them and how people interact with...