Skip to main content
<
Page 116
>

Databricks Sets Official Data Warehousing Performance Record

November 2, 2021 by Reynold Xin and Mostafa Mokhtar in
Today, we are proud to announce that Databricks SQL has set a new world record in 100TB TPC-DS , the gold standard performance...

Now Generally Available: Simplify Data and Machine Learning Pipelines With Jobs Orchestration

November 1, 2021 by Roland Fäustlin in
We are excited to announce the general availability of Jobs orchestration , a new capability that lets Databricks customers easily build data and...

Moneyball 2.0: Real-time Decision Making With MLB’s Statcast Data

October 28, 2021 by Max Wittenberg in
The Oakland Athletics baseball team in 2002 used data analysis and quantitative modeling to identify undervalued players and create a competitive lineup on...

100 Years of Horror Films: An Analysis Using Databricks SQL

When it comes to the history of film, perhaps no genre says more about us as humans than horror, which taps into our...

GPU-accelerated Sentiment Analysis Using Pytorch and Huggingface on Databricks

October 28, 2021 by Srijith Rajamohan, Ph.D. in
Sentiment analysis is commonly used to analyze the sentiment present within a body of text, which could range from a review, an email...

Curating More Inclusive and Safer Online Communities With Databricks and Labelbox

October 21, 2021 by JT Vega in
This is a guest authored post by JT Vega , Support Engineering Manager, Labelbox. While video games and digital content are a source...

Simplifying Data + AI, One Line of TypeScript at a Time

October 21, 2021 by Reynold Xin and Matei Zaharia in
Today, Databricks is known for our backend engineering, building and operating cloud systems that span millions of virtual machines processing exabytes of data...

Introducing SQL User-Defined Functions

October 20, 2021 by Serge Rielau and Allison Wang in
A user-defined function (UDF) is a means for a user to extend the native capabilities of Apache Spark™ SQL. SQL on Databricks has...

Introducing Apache Spark™ 3.2

We are excited to announce the availability of Apache Spark™ 3.2 on Databricks as part of Databricks Runtime 10.0 . We want to...

MLflow for Bayesian Experiment Tracking

October 18, 2021 by Srijith Rajamohan, Ph.D. in
This post is the third in a series on Bayesian inference ( [1] , [2] ). Here we will illustrate how to use...