Open-Source
Einige der weltweit beliebtesten Open-Source-Datentechnologien wurden ursprünglich von Databricks-Ingenieuren erfunden
Built on open data and AI projects trusted by millions of developers
Apache Spark™
Apache Spark is a unified engine for executing data engineering, data science and ML workloads.
Delta Lake
Delta Lake lets you build a lakehouse architecture on top of storage systems such as AWS S3, ADLS, GCS and HDFS.
Apache Iceberg™
Apache Iceberg lets you build a lakehouse architecture on top of storage systems such as AWS S3, ADLS, GCS and HDFS.
Unity Catalog
Unity Catalog is the industry’s only universal catalog for data and AI.
MLflow
MLflow manages the ML lifecycle, including experimentation, reproducibility, deployment and a central model registry.
Delta Sharing
Delta Sharing is the industry’s first open protocol for secure data sharing, making it simple to share data with other organizations.
Redash
Redash enables anyone to leverage SQL to explore, query, visualize and share data from both big and small data sources.