IBM InfoSphere Information Governance Catalog Provides a window to the data in your enterprise so you can easily create, share and manage all your information assets through an interactive, web-based tool
SAo many companies are these days. IT is the business. Look at the IT "failure" at British Airways a few weeks back. Without working IT they could not operate. Massive impact (and like was said about data science... a lot of this is a human
Exciting to see Seth & ING talk about the virtualization layer and open metadata... This is the open source project I'm working on with others from IBM, ING, Hortonworks and anyone else who'd like to.. See https://cwiki.apache...
Let's face it, #governance projects are complicated and difficult to get started, that's why we have created this new test drive, in less than 10 minutes you can see what a data catalog can do for your company - how cool is that?
Each users in the enterprise can easily find data, understand what is the quality of that data, engage with other users to collaborate on data and analytics projects, share insights about data.
The catalog needs to be updated as you ingesting new data in the Data Lake. Ideally you want to do it data ingestion time. At the same time you ingesting, you want to profile, understand data quality, classify the data and publish to catalog.
Self-services users (data scientists, business analyst ) also augment the data lake catalog. As they prepare and create new data assets, that should be published to the catalog so that other users can find and leverage the new data.
To add to this - once data is landed in the lake and the metadata is extracted and cataloged, keeping the catalog updated is a 'it depends' answer. Depends on characteristics of data lake usage, how frequently data is changing and so on...
You can't be data driven without a data catalog and now you can test drive ours https://www.ibm.com/...
IBM InfoSphere Information Governance Catalog Provides a window to the data in your enterprise so you can easily create, share and manage all your information assets through an interactive, web-based tool