Introducing Spark Connect – The Power of Apache Spark, Everywhere
At last week’s Data and AI Summit, we highlighted a new project called Spark Connect in the opening keynote. This blog post walks…
At last week’s Data and AI Summit, we highlighted a new project called Spark Connect in the opening keynote. This blog post walks…
Making an open data marketplace Stepping into this brave new digital world we are certain that data will be a central product for…
Today we are thrilled to announce a full lineup of open source connectors for Go, Node.js, Python, as well as a new CLI…
Today we are happy to announce the availability of Apache Spark™ 3.3 on Databricks as part of Databricks Runtime 11.0. We want to…
Data + AI Summit is the global event for the data community, where practitioners, leaders and visionaries come together to engage in thought-provoking…
Streaming is one of the most important data processing techniques for ingestion and analysis. It provides users and developers with low latency and…
This blog article has been cross-posted from the Delta.io blog. We are excited for the release of Delta Sharing 0.4.0 for the open-source…
As with all parts of our platform, we are constantly raising the bar and adding new features to enhance developers’ abilities to build…
Delta Lake 1.1 improves performance for merge operations, adds the support for generated columns and improves nested field resolution With the tremendous contributions…
The Delta Standalone library is a single-node Java library that can be used to read from and write to Delta tables. Specifically, this…