Map Your Lakehouse Content with DiscoverX
Overview
An enterprise lakehouse contains many different datasets which are related to different sources and might belong to different business units. These datasets can span across hundreds of tables, and each table has a different schema, and those schemas evolve over time. The cyber security domain is a good example where datasets come from many different source systems and land in the lakehouse. With such a complex dataset ecosystem, answers to simple questions like “Have we ever detected this IP address?” or “Which columns contain IP addresses?” can become impractical and expensive.
DiscoverX can automate the discovery of all columns that might contain specific patterns, (e.g., IP addresses, MAC addresses, fully qualified domain names, etc.) and automatically generate search and indexing queries that span across multiple tables and columns.
Type
- Breakout
Experience
- In Person
Track
- Data Governance, Databricks Experience (DBX)
Industry
- Enterprise Technology, Professional Services
Difficulty
- Intermediate
Duration
- 40 min
Don't miss this year's event!
Register now