FU

Thursday, July 29th, 2021 2:49 PM

Using Data Catalog for unstructured data

Can anyone point me to resources online that show what catalog can do with unstructured data (e.g. images, py files). I’m also weighing the value of connecting to an Azure ADLS instance through SQL to pull down file metadata and data versus using file drivers directly to blob storage for csv, xls, and json. Has anyone implemented this and if so what ‘lift’ do you get. For example does it make scan refreshes more effective when new containers/folders are created, do you get more context on folder security, more metadata on files in the folders, etc?

Thanks

2 Messages

3 years ago

Hi Lance,

Not sure if it is relevant, but you might want to check the unstructured data plugin for Collibra. You can find out more here: https://marketplace.collibra.com/listings/data-x-ray-unstructured-data-classification/.

Let me know if you want to chat about it directly ([email protected]).

Kyle

Loading...