Skip to main content
<
Page 161
>

Advanced Analytics with HyperLogLog Functions in Apache Spark

May 8, 2019 by Sim Simeonov in
Read Rise of the Data Lakehouse to explore why lakehouses are the data architecture of the future with the father of the data...

Spark + AI Summit 2019 Product Announcements and Recap. Watch the keynote recordings today!

May 7, 2019 by James Nguyen in
Spark + AI Summit 2019, the world’s largest data and machine learning conference for the Apache Spark™ Community, brought nearly 5000 registered data...

Efficient Databricks Deployment Automation with Terraform

Managing cloud infrastructure and provisioning resources can be a headache that DevOps engineers are all too familiar with. Even the most capable cloud...

Real-Time Distributed Monitoring and Logging in the Azure Cloud

How can you observe the unobservable? At Databricks we rely heavily on detailed metrics from our internal services to maintain high availability and...

Single Sign-On (SSO) to Third Party Applications

May 3, 2019 by Kyle Lim in
This past winter, I was a software engineering intern at Databricks on the Identity and Access Management (IAM) team. During my time here...

Detecting Financial Fraud at Scale with Decision Trees and MLflow on Databricks

Try this notebook in Databricks Detecting fraudulent patterns at scale using artificial intelligence is a challenge, no matter the use case. The massive...

Understanding Dynamic Time Warping

Try this notebook in Databricks This blog is part 1 of our two-part series Using Dynamic Time Warping and MLflow to Detect Sales...

Using Dynamic Time Warping and MLflow to Detect Sales Trends

Try this notebook series (in DBC format) in Databricks This blog is part 2 of our two-part series Using Dynamic Time Warping and...

Introducing MLflow Run Sidebar in Databricks Notebooks

April 30, 2019 by Andrew Chen and Matei Zaharia in
At Spark+AI Summit 2019, we announced the GA of Managed MLflow on Databricks in which we take the latest and greatest of open...

Announcing General Availability of Managed MLflow on Databricks

Try this tutorial in Databricks MLflow is an open source platform to help manage the complete machine learning lifecycle. With MLflow, data scientists...