Nancy Hensley12
Let's face it, #governance projects are complicated and difficult to get started, that's why we have created this new test drive, in less than 10 minutes you can see what a data catalog can do for your company - how cool is that?
Srinivas Varanasi
How frequently you update this catalogue in data lake?
Srinivas Varanasi
As one of the key definition 's the variety of the data on BIG data keeps changing too.
Jo Ramos
IBM Unified Governance enables organizations to build a master catalog that includes all the data (structured and unstructured).
Jo Ramos
Each users in the enterprise can easily find data, understand what is the quality of that data, engage with other users to collaborate on data and analytics projects, share insights about data.
Jo Ramos
. It also enables data governance officers to ensure that data is managed and complies with corporate and regulatory mandates.
Jo Ramos
The catalog needs to be updated as you ingesting new data in the Data Lake. Ideally you want to do it data ingestion time. At the same time you ingesting, you want to profile, understand data quality, classify the data and publish to catalog.
Jo Ramos
Self-services users (data scientists, business analyst ) also augment the data lake catalog. As they prepare and create new data assets, that should be published to the catalog so that other users can find and leverage the new data.
Ron Reuben
To add to this - once data is landed in the lake and the metadata is extracted and cataloged, keeping the catalog updated is a 'it depends' answer. Depends on characteristics of data lake usage, how frequently data is changing and so on...
John Furrier
Automation is key to scaling data