I

Tuesday, June 21st, 2022 4:13 PM

Azure Purview to Collibra Integration - cma file installation not working

Hi Team,

The .cma file provided as part of this integration(https://marketplace.collibra.com/listings/azure-purview-to-collibra-integration/#) are not working during installation on DGC version 2022.06 as well as 2021.09.
Can you please check and let us know with the resolution

683 Messages

 • 

15.3K Points

3 years ago

368 Messages

2 years ago

Hello @indranil.chakraborty1.metlife.com,

Thanks for pointing this out.

We are currently looking into this and will keep you updated.

368 Messages

2 years ago

Hello @indranil.chakraborty1.metlife.com,

The Purview to Collibra integration is highly customisable. It is up to the user to pick and choose what kind of entities should be synced from Purview to Collibra and down to which level. Therefore, it makes more sense for users to choose themselves the asset/attribute/relation types they need (OOTB or otherwise), rather than having one CMA file containing many custom asset/attribute/relation types which might not be useful for most users. Having said that, certain OOTB asset/attribute/relation types are pre-set in the properties file by default, though they can still be configured by the user.

Regarding the error when importing the CMA file, can you please try creating the following attributes from the Collibra instance, settings page:

  • Size (collibra.attribute.size)
  • Version (collibra.attribute.version)
  • Nickname (collibra.attribute.nickname)

Additionally, can you please also try creating the following relation (in case it does not already exist):

  • Power BI capacity Workspace (collibra.relation.powerbi.capacity.workspace)

Finally, can you please update the respective properties from the Purview integration, application.properties file? Thanks

As a side note, just to let you know that these are the resources that are created by the CMA file. The Purview integration also supports other properties that are by default commented out in the properties file.

2 years ago

@spring-team.collibra.com - Thank you team for explaining the information in detailed. The Azure Purview integration is started working fine. After importing some metadata from Purview to Collibra, i could see for the tags the below error is coming up with multiple runs. Why it doesn’t give consistent behavior? Any suggestion to fix the below issues:

C:\Users\ICHAKR~1\AppData\Local\Temp\1\loadFile7228438158363517241
java.lang.RuntimeException: {“type”:“MESSAGE”,“message”:“An asset cant have the same tags assigned twice.”}
at com.collibra.marketplace.azure.purview.util.CollibraImportUtil.checkJobStatus(CollibraImportUtil.java:231)
at com.collibra.marketplace.azure.purview.component.CollibraGenericImportComponent.genericImportCall(CollibraGenericImportComponent.java:123)

Another error:

C:\Users\ICHAKR~1\AppData\Local\Temp\1\loadFile4659971217203693464
java.lang.RuntimeException: {“type”:“MESSAGE”,“message”:“An asset with signifier ‘mssql://xxxxxx:xxxx/MSSQLSERVER/DMSKSQLT/dbo/tbaddressdev’ already exists for domain ‘MSSQL Table’.”}
at com.collibra.marketplace.azure.purview.util.CollibraImportUtil.checkJobStatus(CollibraImportUtil.java:231)
at com.collibra.marketplace.azure.purview.component.CollibraGenericImportComponent.genericImportCall(CollibraGenericImportComponent.java:123)
at com.collibra.marketplace.azure.purview.FullSyncProcessor.start(FullSyncProcessor.java:1084)
at com.collibra.marketplace.azure.purview.controller.EntryPointController.fullSyncTriggeredByApiRequest(EntryPointController.java:64)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

368 Messages

2 years ago

Hello @indranil.chakraborty1.metlife.com,

Thanks for the update. Will check regarding the encountered import errors and get back to you.

2 years ago

@spring-team.collibra.com - it looks to me duplicated assets are coming up in the import json which is causing this issue.Do you have any insights to fix the issue please

2 years ago

@spring-team.collibra.com - looks like batch size parameter is being used in the import api which is deprecated in the latest DGC version 2022.06 . And i think it is causing the duplicated assets in the import json . Is there any way to fix it?

368 Messages

2 years ago

Hello @indranil.chakraborty1.metlife.com,

Thanks for the update.

Can you please (as a private message) share the latest loadFile that the integration created in a temporary folder just before it failed?

The path of this loadFile can be retrieved from the last log message starting with the message “Temporary Import Load File located at ::”. and it contains the Collibra Import API payload that is failing. Thanks

2 years ago

Summary

sametaggedassignedtwice.txt (1.0 MB)

Summary

java.lang.RuntimeException: {“type”:“MESSAGE”,“message”:“An asset cant have the same tags assigned twice.”}

2 years ago

@spring-team.collibra.com - please check the above temp json file which is causing the issue

2 years ago

@spring-team.collibra.com - do you have any update on this?duplicate assets are coming up in the import files which is causing the issue.

368 Messages

Hello @indranil.chakraborty1.metlife.com

Responded via a private message.

Thanks

2 years ago

@kristen.freer or @spring-team.collibra.com - I have responded to you via email . After replacing the source code , compiling and running, i am still getting the duplicate asset error in the import json file.

Regards,
Indranil

2 years ago

@kristen.free or @spring-team.collibra.com - I have replied to you on your latest email. Please let me know in case you didn’t receive it.

2 years ago

@spring-team.collibra.com - any update on the mentioned issue please?

683 Messages

 • 

15.3K Points

@indranil.chakraborty1.metlife.com Did you check your Community DM as @james.scicluna from @spring-team.collibra.com mentioned that he private messaged you there.

2 years ago

Yes @kristen.freer. I have sent the details to @james.scicluna through email. Waiting for the response.

368 Messages

Hi @indranil.chakraborty1.metlife.com

Thanks for sending the details. Responded again via a private message. Please let us know if the new fix solved the issue you are facing.

Thanks

368 Messages

@indranil.chakraborty1.metlife.com confirmed that the issue has been resolved.

Cause:

The duplicate assets issue was caused due to Purview’s API paged search request returning the same entities on different pages. For example, the following two search queries on the POST /search/advanced endpoint are both returning the same table in their result (even though the search pages defined by the offset and limit do not overlap):

  1. {“filter”: {“and”: [ { “entityType”: “mssql_table”, “includeSubTypes”: false } ] }, “keywords”: “*”, “offset”: 50, “limit”: 50 }

  2. {“filter”: {“and”: [ { “entityType”: “mssql_table”, “includeSubTypes”: false } ] }, “keywords”: “*”, “offset”: 100, “limit”: 50 }

Resolution:

A hot fix in the Purview Connector was implemented to make sure that any duplicate search results are removed (even though the Purview API should in principle not return duplicates on different pages).

@spring-team.collibra.com : Could you please share the HOT fix with me also I am also getting the same error.

368 Messages

Hi @rohit.chandra.1,

Just shared with you via a private message the Purview Connector hotfix.

Loading...