9 Messages
Data lineage automation using openlineage framework
We aim to get the full story around our data with Collibra Data Lineage using as much as possible automation
I would need to ingest data lineage metadata in DGC using Open Lineage framework, in order display real time technical data lineage paths in DGC centralized governance framework
Did you already experiment such pipelines solution or any alternative tool as metadata repository reference ?
SheeriCabral
30 Messages
3 years ago
Hello @jean-luc.garnier.safrangroup.com ! I’m the Product Manager for technical lineage in Collibra.
At this time, Open Lineage does an “operational lineage” - you can observe what is happening with pipelines - e.g. when they last ran successfully.
Collibra offers “definitional lineage” - you can see how your data flows through your environment. The use cases are impact analysis (“if I change this column’s name, what is affected by that?”) and security/privacy (“column X in table A is PII, and it’s transformed and loaded into column Y of table B, so table B/column Y should be marked as PII as well.”)
We are thinking about adding operational lineage - we would integrate with Open Lineage, at least using their schema/framework - but we do not have any firm plans or commitments.
Hope that helps!
-Sheeri Cabral
0
miguelguillen
3 Messages
2 years ago
Hi @sheeri.cabral , hope you are well. Are there any new updates or plans available to share? Thank you!
0
0
tomkuppens1
41 Messages
2 years ago
Hi,
We are also interested in such an integration.
We are planning a POC to hack Collibra is such a way to integrate operational lineage in Collibra, i.e., the inclusion of job information (job start and end times), whether a job failed. Think it’s possible, but it will cost us, but we prefer an integrated solution: this type of lineage is much needed.
Thanks,
Tom
5
0