Matei Zaharia

Follow Matei Zaharia

Matei is the CTO and co-founder of Databricks and an Associate Professor of Computer Science at UC Berkeley. He started the Apache Spark project during his Ph.D. program at UC Berkeley in 2009 and has worked on other widely used data and AI software, including MLflow, Delta Lake, and DBRX. His most recent research is about combining large language models (LLMs) with external data sources, such as search systems, and improving their efficiency and result quality. Matei’s research was recognized through the 2014 ACM Doctoral Dissertation Award and the U.S. Presidential Early Career Award for Scientists and Engineers (PECASE).

Matei Zaharia's posts

CybersecurityJuly 14, 2026
Blocking Slow-Burn Attacks: Contextual Policies in Omnigent
7 min read
AnnouncementsJuly 8, 2026
Benchmarking Coding Agents on Databricks’ Multi-Million Line Codebase
9 min read
ProductJuly 7, 2026
Contextual Policies in Omnigent: Using session state to better govern AI agents
10 min read
AI EngineeringJune 13, 2026
Introducing Omnigent: A Meta-Harness to Combine, Control and Share Your Agents
5 min read
AnnouncementsMarch 11, 2026
Introducing Genie Code
10 min read
AnnouncementsMarch 11, 2026
Databricks acquires Quotient AI to power AI agent evaluations
3 min read
AnnouncementsFebruary 23, 2026
Spark Declarative Pipelines: Why Data Engineering Needs to Become End-to-End Declarative
6 min read
AnnouncementsDecember 2, 2025
Completing the Lakehouse Vision: Open Storage, Open Access, Unified Governance
6 min read
AnnouncementsJune 12, 2025
A New Era of Databases: Lakebase
6 min read
AnnouncementsJune 11, 2025
MLflow 3.0: Build, Evaluate, and Deploy Generative AI with Confidence
12 min read

Showing 1 - 10 of 20 results

Get the latest posts in your inbox

Subscribe to our blog and get the latest posts delivered to your inbox.

Matei Zaharia's posts

Blocking Slow-Burn Attacks: Contextual Policies in Omnigent

Benchmarking Coding Agents on Databricks’ Multi-Million Line Codebase

Contextual Policies in Omnigent: Using session state to better govern AI agents

Introducing Omnigent: A Meta-Harness to Combine, Control and Share Your Agents

Introducing Genie Code

Databricks acquires Quotient AI to power AI agent evaluations

Spark Declarative Pipelines: Why Data Engineering Needs to Become End-to-End Declarative

Completing the Lakehouse Vision: Open Storage, Open Access, Unified Governance

A New Era of Databases: Lakebase

MLflow 3.0: Build, Evaluate, and Deploy Generative AI with Confidence

Get the latest posts in your inbox

Sign up