We are proud to announce the release of Databricks Runtime 5.3, which includes several new features and improvements, including:
Delta Time Travel has graduated to general availability. It adds the ability to query a snapshot of a table using a timestamp string or a version, using SQL syntax as well as DataFrameReader options for timestamp expressions.
Time Travel has many use cases, including:
https://www.youtube.com/watch?v=TrBDuaF7g5A
For more details, see Query an older snapshot of a table (time travel), and Merge Into (Databricks Delta).
Databricks Runtime 5.3 lets you stream data from a MySQL table directly into Delta for downstream consumption in Spark analytics or data science workflows. Leveraging the same strategy that MySQL uses for replication to other instances, the binlog is used to identify updates that are then processed and streamed to Databricks as follows:
A new FUSE mount optimized for data loading, model checkpointing, and logging from each worker to a shared storage location, file:/dbfs/ml provides high-performance I/O for deep learning workloads.
For details, see Prepare Storage for Data Loading and Model Checkpointing.
Apart from the above, Databricks Runtime 5.3 also includes:
To learn more about the release, please see the Databricks Runtime 5.3 Release Notes.