38 Messages
Collibra Data Lineage
Please post here your remarks, questions or feedback regarding the Marketplace listing Collibra Data Lineage
38 Messages
Please post here your remarks, questions or feedback regarding the Marketplace listing Collibra Data Lineage
Former User
1.2K Messages
4 years ago
Hello,
My use case is to connect to Snowflake and get the technical lineage information and publish in collibra.
I’ve configured the lineage harvester tool as per documentation provided on link(https://productresources.collibra.com/docs/cloud-user/2020.12/Content/CollibraDataLineage/TechnicalLineage/InstallationAndConfiguration/ta_prepare-config-file.htm)
Now when I run the harvester tool, in the show lineage tab of asset I see below parsing error.
'[msg-P00452]: Nothing to process here (input was null) A common cause is that the Harvester’s SQL user doesn’t have the right permissions"
The snowflake user has full read permissions on database.
Did anyone faced such issue? Happy to know how to resolve such issue?
3
0
Former User
1.2K Messages
3 years ago
Hello @martin.masarik.collibra.com Does lineage harvester support NTLM login(window login) for SQL server ?
I’ve been trying A-Z to connect to SQL server from harvester tool but no luck so far, Interesting point is that NTLM is still supported by collibra provided JDBC driver for SQL Server.
I’ve been in contact with collibra support but they also told me that NTLM is NOT supported with harvester.Any idea when this would be supported.Since NTLM is supported with driver we can’t now step back on this.
1
0
noor
262 Messages
3 years ago
How do we handle technical lineage for data sources where there are no out of the box connectors? Can you please elaborate?
1
0
dianereynolds
7 Messages
3 years ago
@noor.shaik - If you find out, let me know?
0
Former User
1.2K Messages
3 years ago
Hi,
Is there a way to ‘default’ a view when a user goes into Diagram i.e. we want the first display to be LOB / Application / Data Domain and NOT have any Data Models open unless the user chooses to click on the “+” symbol?
2
0
Former User
1.2K Messages
3 years ago
Hi @paulo.taylor Is there possibility to feed technical lineage with transformations from external systems let’ say with collibra system APIs, without using lineage harvester?
0
0
paulotaylor
38 Messages
3 years ago
@martin.masarik.collibra.com
1
0
alexandruveruzi
2 Messages
3 years ago
Hi there @paulo.taylor, @martin.masarik.collibra.com,
For more than 1 month now, we cannot run the lineage harvester successfully to sync the technical lineage inside the DGC (failing 9 out of 10 times in average.). I’m talking about the “master batch”. The error seems to be a communication failure between DGC and Techlin.
Full error message: Received ‘410 Gone’ response code to GET https://techlin-aws-us.collibra.com/api/batch/dd9716a7-eea3-4351-a17a-431f8cbae488/status/ request. Response body:
{“slug”: “dd9716a7-eea3-4351-a17a-431f8cbae488”, “status”: “failed”, “message”: “PROCESSING ERROR: “HttpServerError: Received ‘500 Internal Server Error’ response code to POST https://XXXX.collibra.com/rest/catalog/1.0/internal/technicalLineage/relations/importRelations request. Server error after 5 tries. Response body:\n{“statusCode”:500}””}
We raised support cases with Collibra Support but no luck. Sometimes even the technical lineage ran forever and we killed it after 40 hours or so. Usually it should run in approx. 3 hours. This all started when we added Power BI in the mix.
That time the lineage harvester completed successfully in like 3 hours but now it is either running wild for more than 40 hours and then we stop it, or it is failing with those 500/410/424 errors, or it is completing successfully 1 out of 10 times.
This is very unpredictable for us and we cannot use it like this.
Here are some of the cases raised: #67173, #69188, #68439,
One other note is that on DGC it seems to be completing successfully as I see “Synchronization of batch for id: techlin COMPLETED”
How can we get additional help in fixing this issue?
Seems like a communication issue between DGC (hosted on cloud) and Techlin server.
Thanks,
Alex
2
0
Community_Alex
677 Messages
•
14.2K Points
3 years ago
Can anyone help Shruti with this ask re ‘Collibra Edge lineage harvest capability’? @martin.masarik.collibra.com, is there any documentation we can direct Shruti to?
1
0
noor
262 Messages
3 years ago
@martin.masarik.collibra.com @paulo.taylor
Hi Martin
Can you please shed some light on technical lineage harvester’s support for SparkSQL -
Can it simply read from a Databricks hive metastore, parse the table structures & link them across several databases (say, 3) by just doing name matching (assume table names, column names are same)? Or, it needs SQL files as input so it can understand the tables across those 3 databases are really linked via the SQLs? In my case, I have a metadata driven ETL load processes (that is fully parameterized) and I think there is anything meaningful there that can be fed to the lineage harvester (unless the harvester can parse parameterization)
2
0
martinmasarik
9 Messages
3 years ago
@noor.shaik Indeed output of that SELECT query is used to extract view definitions from the database. Then we scan for lineage and will capture and display lineage from underlying tables to that view. What you can upload as a file is any plain DML operations like INSERT, UPDATE, etc. Stored procedures are not currently supported for SparkSQL.
0
balasubrahmanyamestamsetty
1 Message
3 years ago
@martin.masarik.collibra.com Hi Martin,
I modified the suggested customConnectionProperties for azure sql server
“customConnectionProperties” : “instanceName=my-instance;databaseName=my-database;integratedSecurity=true;domain=my-domain;authenticationscheme=ActiveDirectoryPassword;user=my-user;password=my-password”
I took the following as reference : https://docs.microsoft.com/en-us/sql/connect/jdbc/setting-the-connection-properties?view=sql-server-ver16&viewFallbackFrom=sql-server-ver17
But I was not successful . It throwed the following error:
Caused by: com.microsoft.sqlserver.jdbc.SQLServerException: Cannot open server “******.com” requested by the login. The login failed. ClientConnectionId:c1699240-8a35-4a27-bf97-2e6af14c6517
1
0