Collibra Community

Community Resources
Participate
- Discussions
- Ideation
Learn
Support
User Groups and Events

Categories

MoreDiscussions

Community

About Collibra
Collibra Data Intelligence Platform
Blog
Careers
Partner Program
Contact us

© 2025 Collibra. All rights reserved.

Privacy and legal
Do not sell or share my personal information

© 2025 Collibra. All rights reserved.

Knowledge base

Created Dec 6, 2024

98 members

35 discussions

Collibra’s Knowledge Base is a centralized hub for users to find clear, structured guidance on using the platform. It includes how-to articles, best practices, troubleshooting tips, and in-depth documentation to help teams manage data, workflows, and governance effectively. Whether you're just getting started or deep into implementation, it's your go-to resource for making the most of Collibra.

Pinned posts

Knowledge base
Collibra Community FAQ

Version update: March 31, 2025 What is the Collibra Community? The Collibra Community is your dedicated space to learn best practices, share insights and collaborate with 11,000+ data citizens worldwide. You can use the Community to access valuable content, tutorials, best practices, product documentation and more. How do I create an account and join the Community? Visit community.collibra.com and click on the person icon in the top left corner to register for the Community. How do I log in to the platform? Log in with this login link and use the SSO method, which allows you to use the same credentials as your other Collibra sites without creating new logins and passwords. How do I update my profile? Once you are logged into the Community, you can view your profile in the navigation menu. Click the person icon to the right of the bell icon. Click “View profile” and edit using the green “Edit” button. We encourage you to welcome your profile with a headshot and About description for a more engaged Community experience. How do I post a discussion? Visit community.collibra.com/discussion/ Click your preferred Category or view all categories from the left side panel Click ‘Start a discussion’ to create a post Visit this discussion post for more information. Can I edit my notification settings? Yes, you can edit your notification settings and customize your preferences for receiving updates, alerts, and notifications related to discussions, mentions, and activity within the community. Click your profile icon > “Profile Settings” > “Updates” to explore your options. How do I search for specific topics or discussions? Use our search bar to find specific topics, or visit the discussion page to view all of our active forums. How do I report bugs or issues on the new platform? If you encounter any bugs or issues on the new platform, please report them to [email protected]. Your feedback is invaluable in helping us improve the platform for everyone. What should I do if I encounter inappropriate behavior or content? If you encounter inappropriate behavior or content in the Collibra Community, please let us know by clicking on a discussion post and “Mark as Spam” or “Flag as Inappropriate.” You can also email us directly at [email protected] . Why can't I find my previous gamification points and badges in the Community? We recognize the importance of reputation points as a measure of community engagement. We are making improvements to our gamification to improve your Community experience. We plan to bring back your gamification information soon. Check back for more updates! How can I provide feedback on the Community? We strive to provide our community members with the best community experience possible. We would love your feedback to help identify areas for improvement and gather suggestions for new features and enhancements. We send a bi-annual Community member survey in July and December of each year. You can also email us directly at [email protected]. What are certified Data Citizen® User Groups? Data Citizens® User Groups are interactive virtual groups designed to bring together fellow Collibra data enthusiasts to share insights, collaborate on best practices and drive innovation. Certified groups gain exclusive access to resources, support, beta testing, product roadmaps and expert guidance, with Collibra providing the tools for success. How do I join a user group? Browse our user group list and select the group you are interested in joining. Sign up via the dedicated landing page, and we’ll send you next steps from there. Once you join, you can participate in discussions, attend events and connect with other members. How do I stay up to date with Community news? The Collibra Community sends our members a quarterly newsletter, including the latest community news, events and more. We also post weekly announcements on the Community.
Community_Alex Jorgenson
1
Knowledge base
Data Citizens Digest | October 2025
Hello Data Citizens, Welcome to this very exciting issue of the Data Citizens Digest. This month, we focus on the latest news from our October Product Premiere , plus other updates you won’t want to miss. We’re thrilled to announce a set of new features that push the boundaries of automation: Unstructured AI (Deasy Labs) : Transform scattered files into structured, governed and searchable metadata—enabling smart discovery, automated taxonomy creation and high-accuracy enterprise search for AI and AI applications at scale Semantic model generation and mapping agents : Bridge the gap between your data and business context with our new AI-powered agents to automatically generate and map your semantic layer AI model and agents registry : Gain unprecedented control and eliminate blind spots with Collibra’s AI Model and Agent Registries Data Quality & Observability (Cloud) : Extend data quality coverage to more data sources with pullup support Technical lineage diagrams : Reduce the time spent understanding data flows and resolving data issues with a new user interface for technical lineage diagrams Model context protocol server : Accelerate AI innovation by enabling copilots and agents to access governed metadata and business context from the Collibra Platform Remember: These are just a few highlights from our October Product Premier. Check out the release notes for all the details and make sure you join us for Data Citizens on the Road or our Product Premiere webinar to see these enhancements in action. Collibra Preview (Beta) program Help us build better features — visit the Preview Portal to join one or more testing opportunities: Microsoft Teams integration : With Collibra's new two-way Microsoft Teams integration, you can transform data governance into a seamless part of your workflow by acting on alerts, replying to comments and sharing trusted data, all without leaving your workspace. Join our Private Preview to test out our Microsoft Teams integration and be one of the first to experience this new capability Derived relations : Unlock a deeper understanding of your data with Derived Relations, a powerful new capability that exposes the critical, multi-hop connections or indirect relationships between assets that were previously hidden. Test out this new feature in our Private Preview New page editor : Improve user experiences with a new Homepage and page editor. Quickly tailor pages for your organizational needs. Test out this new feature in our Private Preview Data contracts : This new functionality helps you centralize contracts and gain transparency through a dedicated Contract Manifest Tab. Learn about the Preview here Microsoft Fabric and Qlick Cloud integrations : Get early access to these new integrations together with detailed documentation explaining the whole process of ingesting assets to Collibra Catalog, end to end. Learn more here Data Quality & Observability pull up support : If you are a Collibra Data Quality user, contact your Account Manager to enroll in the exclusive Data Quality Early Adopters program and try out the newest features As a participant, you’ll join live sessions with our product managers, complete test scenarios to explore new features, and share your feedback through tickets and surveys — the most engaged participants will be eligible to earn some great Collibra swag. New Customer Story: ASN Bank Facing compliance challenges like BCBS 239 due to legacy systems and manual, spreadsheet-based processes, ASN Bank implemented Collibra's unified platform to build a new, transparent foundation for data governance. As a result, the bank can now confidently demonstrate compliance to regulators by instantly tracing data lineage for risk reports and providing a clear, auditable view of data ownership, quality and usage. Are you looking for a community? Lucky for you, we have Data Citizen® User Groups here at Collibra. User groups are virtual, customer-led meetups where data professionals connect, share insights and solve real-world challenges together. User groups include: Industry-specific groups : Financial Services, State and Local, Utilities, Higher Education and more Regional groups : EMEA, Canada, APAC (and growing) Topic-based group : AI governance Join a user group today Check out our 17+ user groups New Collibra University courses Data deserves your attention. Popularity scores direct it Ever had so much work you didn’t know where to start? We can’t help you decide whether to do laundry or dishes first, but Collibra’s Popularity Scores can direct your attention at work. Check out how three colleagues use this feature to complete their tasks Create a workflow from scratch in Workflow Designer, part 3 (Latest UI) Convert manual business processes into automated workflows to save time and resources. Why it matters : Data Stewards and Data Owners waste valuable time manually completing data maintenance tasks. Workflows help organizations automate the most tedious aspects of governing their metadata. We’ll cover one of the most valuable automation tools in the developers toolbox: script tasks Other Education / Services news We have updated our Data Steward Certification exam. Learn more and prove your expertise. We’re proud to share that Collibra has been recognized as a Leader in The Forrester Wave™: Data Governance Solutions, Q3 2025, and also a Strong Performer in The Forrester Wave™: AI Governance Solutions, Q3 2025 . Read our blog to learn why . Ready to navigate the generative AI era with confidence? Download our workbook today to unlock insights on assessing AI literacy, implementing AI governance and building your 30-60-90 day AI literacy roadmap. Join us at Data Citizens on the Road ‘25 for a day of live demos, expert insights, peer networking and hands-on learning as we unveil Collibra’s latest innovations in unified governance for data and AI. Find out how Collibra can help your organization govern faster and smarter while reducing TCO. There’s still time to join us Check out the locations and register today Collibra Product Premiere Can’t make it to Data Citizens on the Road? Well, don’t miss the virtual Product Premiere on October 30 for a closer look at everything new and exciting in this month’s release.
Community_Alex Jorgenson
0
Knowledge base
Blog | Why AI Literacy isn’t optional anymore and what to do about it
As AI spans across every aspect of business, organizations can’t afford to treat AI literacy as a “nice to have.” Here’s why it’s now a strategic advantage as well as a regulatory requirement. The quiet gap no one wants to admit Most organizations today are racing to implement AI, from automating workflows to embedding intelligent agents into customer journeys. And yet—behind the scenes—a critical gap is quietly stalling progress: too many teams still don’t speak the language of AI. It’s not that they don’t care. It’s that AI knowledge, like AI itself, has evolved faster than most teams can keep up with. The result? Misaligned expectations, misunderstood risks, and models built in isolation. Why it’s time to take AI literacy seriously A growing number of leaders are recognizing this challenge. According to DataCamp’s 2024 State of Data and AI Literacy Report , 62% of executives acknowledge a skill gap , yet fewer than a quarter have implemented organization-wide training initiatives. This is no longer just a talent development issue—it’s a governance and performance issue. In fact, Gartner predicts that by 2027, organizations that prioritize AI literacy at the executive level will outperform their peers financially by 20%. That’s a striking reminder: understanding AI isn’t just for data scientists. It’s for decision-makers, risk managers, marketers, and product teams alike — it is for everyone . Read more about it here.
Community_Alex Jorgenson
0
Knowledge base
Why technical transparency matters in communities
When people think of an online technical community, they often picture a forum for asking questions, browsing documentation or troubleshooting specific issues. But in practice, a well-run community is more than a Q&A space—it’s part of the operational ecosystem. It works because members exchange information openly, validate each other’s findings and share solutions based on real-world experience. The Collibra Community , for example, has over 11,000+ members who contribute by learning best practices, sharing insights and offering implementation guidance. The key enabler here is technical transparency . What we mean by technical transparency In a community context, technical transparency means clearly explaining how something works—including known limitations or current defects—and providing enough detail for others to understand the reasoning or constraints behind decisions. It’s not about having the perfect answers; it’s about exposing the process so others can learn and build on it. In a world of open ecosystems, rapid innovation and peer-to-peer support, technical transparency isn’t just nice to have–it’s essential. It’s what enables members to collaborate effectively and keep pace with change. Why communities depend on transparency A product-centric community does more than provide support—it becomes part of the development and operational feedback loop. Transparency shapes how information flows, how people approach problem solving and how they engage with one another. Here are three practical reasons that a transparent approach matters: 1. Trust and credibility When a company acknowledges limitations, shares roadmap context or discusses open issues plainly, it signals respect for users’ time and expertise. Over time, this candor builds a reliable technical history the community can reference. 2. Faster learning Well-documented answers and example implementations help others avoid repeating the same work. This accelerates onboarding and reduces the number of unresolved threads or duplicated troubleshooting efforts. 3. Better collaboration Building on incomplete or inconsistent information slows integration work. When members understand why a system behaves a certain way—not just what it does—they can design extensions and modifications that align more closely with underlying architecture and constraints. How we practice transparency in the Collibra Community At Collibra, transparency isn’t just a buzzword–it’s how we operate. Here’s what it looks like in the Collibra Community : Open Knowledge Base and solution threads Whether it’s troubleshooting a bug or explaining a design decision, we keep discussions and Knowledge Base articles accessible and searchable for everyone. Honest replies about limitations or workarounds Sometimes the answer isn’t perfect–and that’s okay. We aim to be honest about what’s possible today and what’s coming soon. Participation from Product, Engineering and Support Community is a team sport. We bring our internal experts into the conversation so users get answers straight from the source. Transparency builds trust. Trust builds community. When we’re open about how our products work and where they’re headed, we’re not just solving support tickets–we’re building something bigger. A community where everyone feels heard, informed and empowered to contribute. If you’re a data professional, customer or partner who wants to connect, join us in the Collibra Community . Dive into discussion threads, share your insights, ask questions and help shape the future of data intelligence–together.
Community_Alex Jorgenson
0
Knowledge base
September | Data Citizens Digest
Hello Data Citizens, Welcome to this issue of the Data Citizens Digest. This month, we focus on the news from our latest release , plus other updates you won’t want to miss. Check it out! Data Quality & Observability We optimized the performance of the Quality tab, resulting in faster load times You can now browse your data by connection using the data source explorer on the Monitoring Overview page. The first connection is expanded by default, while others remain collapsed until selected On the Monitoring Overview page, the name of the connection you are using is now hyperlinked to the Connections page. Additionally, the technical breadcrumb starts with the name of the connection instead of "Data sources" You can now cancel all queued or running jobs directly from the Monitoring Overview page. This helps prevent overloading your Edge site, especially when too many jobs are created via quick monitoring When you create a job with back runs, the back runs are now referenced in the job summary on the “Review” step of the job creation process Asset management Line breaks are now respected in asset table views for both rich text and plain text You can now filter on multiple column headings in asset tables and relation tables, helping you refine your search results with more flexibility Asset pages now load faster because the number of people shown in the responsibilities in the "At a Glance" sidebar is limited to 10 Dashboards You can now hide the padding and border of the Dashboard Search widget, improving the visual display and alignment of your Dashboard widgets We improved the accessibility of the Collibra Platform for screen readers Diagrams The diagram legend is now sorted alphabetically In the context menu for a diagram's node, the "Start Here" options now include external link icons that indicate they will open in a new browser tab You can now search diagrams for text that is hidden due to long names or maximum text length Be sure to check the release notes for additional updates and details. Collibra Preview Program Your feedback helps us build better features. Visit the Preview Portal to join the testers community and connect with the Collibra Product team. Opportunities are available for both our current and upcoming programs. Apply now. Semantic Model Generation (private preview) Unified Data Quality & Observability: Custom Rules (private preview - early adopters program) Control Tower (private preview) New Page Editor and Homepage (private preview) Data Contract (private preview) Microsoft Fabric integration (private preview) Keep an eye on these key EOLs, updates and reminders — they may require your action. Reminders (announced in prior editions) • End of life for CLI lineage harvester: July 31, 2026. Learn more about technical lineage via Edge here • End of life for custom technical lineage single-file definition: July 31, 2026. Learn more about custom technical lineage here Adoption corner New to the adoption corner: A gamification playbook to promote faster learning and better retention. The playbook includes templates and promotional content for you to implement a gamification program to foster quicker adoption of Collibra New Collibra University courses Catalog ingestion: Bring your data into Collibra (Latest UI) Collibra Catalog is the place to easily find and understand data across all of your data sources, applications, BI and data science tools. But before Data Citizens can do that, you need to ingest assets from your sources. This course will show you how to ingest metadata and organize the new assets in Collibra Create a workflow from scratch in Workflow Designer, part 1 (Latest UI) Converting manual business processes to automated workflows will save you time and resources. Why it matters: Data stewards and data owners waste valuable time manually completing data maintenance tasks. Workflows help organizations automate the most tedious aspects of governing their metadata Create a workflow from scratch in Workflow Designer, part 2 (Latest UI) Converting manual business processes to automated workflows will save you time and resources. Continue from part 1 of this series where we focus on user tasks, forms and exclusive gateways Create a workflow from scratch in Workflow Designer, part 3 (Latest UI) Convert manual business processes into automated workflows to save time and resources.Why it matters: Data Stewards and Data Owners waste valuable time manually completing data maintenance tasks. Workflows help organizations automate the most tedious aspects of governing their metadata. We’ll cover one of the most valuable automation tools in the developers toolbox: script tasks Configure AI Copilot for your data consumers (Latest UI) Strong collaboration relies on trust — a fact that’s doubly true when you’re collaborating with a robot. But how can you trust artificial intelligence? Understanding how it works is one good way. Setting its scope is an even better option. Luckily, this course will help you do both. Data Citizens on the Road Join us at Data Citizens on the Road ‘25 for a day of live demos, expert insights, peer networking, and hands-on learning as we unveil Collibra’s latest innovations in unified governance for data and AI. Find out how Collibra can help your organization govern faster and smarter while reducing TCO. → Check out the locations and register today!
Community_Alex Jorgenson
0
Knowledge base
“Get AI Right from the Start” series
In Episode 5 of the “Get AI Right from the Start” series, Wouter Mertens explores why AI risk must be addressed early—not after models are built. Learn how a simple AI risk review helps avoid delays, compliance issues, and trust failures. Discover how Collibra enables proactive risk tracking, accountability, and auditable documentation from day one. Watch the video here
Community_Alex Jorgenson
0
Knowledge base
Blog | Why your agency needs smarter document management: The power of classification and tagging
Federal agencies are flooded with data. A staggering 80-90% of this data is unstructured, consisting of documents, emails, presentations and reports that pile up faster than they can be managed. Most of it is unlabeled, unorganized, and hard to find when you or your colleagues need them. This flood of unstructured data creates risks: compliance headaches, security vulnerabilities and missed opportunities to use information to serve the public. Agencies don’t need more storage. They need smarter and more secure content management powered by classification and tagging. By automatically classifying content and tagging sensitive information, agencies can transform data chaos into clarity, improve compliance and accelerate decision-making. Drowning in documents? Collibra can help. The challenge: Unstructured data overload in government - read more about it here in our Collibra Blog
Community_Alex Jorgenson
1
Knowledge base
ASN Bank builds better compliance with Collibra
A primary interest for any bank is to constantly improve the protection and management of customer and business (risk) data. This is even more important when governments and financial authorities increasingly scrutinize banks for compliance with data-related standards and regulations. ASN Bank (previously: De Volksbank), the fourth-largest bank in the Netherlands, rose to meet these standards by using Collibra’s unified platform, which simplifies and increases the transparency of compliance and data management. Read more about how ASN Bank is meeting the compliance challenge
Community_Alex Jorgenson
0
Knowledge base
Blog | 5 reasons your team needs you at Data Citizens on the Road 2025
Is your team facing the tragic impact of outdated data processes? We get it. The relentless struggle with fragmented data, manual governance and slow time-to-insight can be a silent killer of productivity and innovation. But imagine a solution. That’s what Data Citizens on the Road is about. This isn’t a conference; it's a strategic investment in solving your most critical data challenges. Here's a business-focused breakdown of why your boss should approve your attendance. 1. Transform data anomalies into actionable insights Finding out about data issues after they've already impacted the business is a tragic reality for many organizations. This leads to poor business decisions, inaccurate AI outputs and greater regulatory compliance risk. Poor data quality is a silent killer, costing organizations time and money. At Data Citizens on the Road, you will learn how to find data issues before they become business issues by using automated profiling, rule creation and monitoring. You'll see how Collibra transforms data anomalies into actionable insights with a complete platform that unifies quality, lineage and governance, giving you the context needed to prioritize your response based on policy and business impact. Read more here.
Community_Alex Jorgenson
0
Knowledge base
Ask Me Anything | Why Databricks and Collibra are better together
We’re excited to invite you to an insightful Ask Me Anything session with Databricks and Collibra. Use this opportunity to connect with subject matter experts from both organizations where you can ask questions about how your organization can tap into their combined power and better understand: Why your organization needs Collibra alongside Databricks Unity Catalog How you can scale AI initiatives and enable your people to build on each other’s successes Real-world use cases that showcase the Collibra and Databricks advantage This Q&A promises invaluable insights and an open forum for all your questions. You can watch the video [here] and read the live questions from the session below. _________________________________________________________________________________________________________ Question: Both Collibra and Databricks talk about access governance. What are the differences between their approaches, and what specific capabilities does each platform offer in this area? Answer: Collibra gives users visibility and context around data access, including who should have access and why. It helps users discover and understand the data available in Databricks and any other source, including the business context around who owns it, what it's used for, and the quality of the data to determine if it’s the right data for the use case. Once the right data is identified, the user can request access—directly in Collibra—and where required, Collibra will trigger the business workflows to secure the appropriate approvals. Once access is approved, Collibra pushes the request to Databricks Unity Catalog, which provides the policy enforcement layer and technical capabilities to ensure that policies are executed efficiently. Question: Can you talk more about how Collibra can help streamline access management and requests on my Databricks Unity catalog? Answer: There are a couple of different routes. The most obvious one is that most organizations have an agreed-upon process for provisioning data in source systems, including Databricks. Collibra allows you us to search for the right data, understand the context, and request access to it. It’s much like shopping for something on a popular shopping site—you find the product you want, add it to your shopping basket, and check out. Then Collibra could integrate with an internal ticket system that you have internally to put it through the approval process, providing an audit history of the approval. Further, we can extend our capabilities with Collibra Protect by pushing the policy down to Databricks Unity Catalog and translating it from a natural English language to a row filtering or column masking policy that Databricks Unity Catalog can use. In other words, if I want to give marketing access to sensitive data like customers' first, last, and email addresses, you can set up a policy that hashes these out using natural language. The policy is pushed down to Databricks Unity Catalog, which will then do the heavy lifting. Question: Are there any plans to get source tagging in Collibra from the JDBC connection to Databricks? Also, are there any plans for allowing profiling and sampling with Collibra’s Unity Catalog integration? Answer: Collibra has worked with some of our customers to successfully push additional tagging and context from Collibra to Databricks through a custom integration, so if you have a near-term need, we have an accelerator that can help. Currently, metadata exchange between Collibra and Databricks is one-way, with Collibra ingesting metadata from Unity Catalog. Bidirectional metadata exchange between Databricks Unity Catalog and Collibra is currently on our roadmap for this year. The exception is Collibra Protect, available today, which allows policies defined in Collibra to be enforced within the Databricks environment. Question: Will the "bidirectional metadata transfer" include the ability for a change made in Collibra to push updates into Databricks Unity Catalog? If so, can we tell Collibra which pieces of metadata should not be altered in this way (for example, table schema metadata)? Answer: As part of the integration and synchronization between Databricks Unity Catalog and Collibra, not only would metadata and lineage information come from Databricks Unity Catalog to Collibra, but Collibra metadata would be able to be ingested into Databricks Unity catalog. We certainly understand that customers have a lot of flexibility in capturing and managing metadata within Collibra. We would have to provide functionality to specify what metadata pieces you want to push back to Databricks Unity Catalog. We're developing that capability now, so I can't get too far into the details. Question: Are there any plans to integrate Collibra business glossary terms into auto-assign certified glossary terms to Delta Lake columns instead of using Databricks Assistant AI to generate the column description? Answer: From a Collibra perspective, you should be able to leverage the bidirectional metadata exchange on our roadmap to push metadata, including business glossary terms, back into Databricks. While we can't speak to all the specifics of the solution, we have planned the capability to push tags from Collibra into Databricks as a way to curate different objects in Databricks Unity Catalog, and I think that will help to share some business context between the two – for example, for auto-categorization and for column names, but it’s a bit of an open question as to where you're going to want to do that curation. Having much of it done in your enterprise business glossary makes sense. It is our understanding that a future release will give you the ability to get SQL-based lineage. In later releases, you will see incremental updates that support wider lineage capabilities like capturing Python transformations, volumes, notebooks, etc.In these future capabilities, you will be able to push metadata, including business glossary terms, back into Databricks. This capability may require further enhancement if it is not part of our initial integration launch. Question: We currently have to auto-stitch Databricks metadata back to legacy source systems. Is there anything on Collibra’s roadmap to support this automation? Answer: Besides the metadata ingestion from Databricks Unity Catalog, Collibra integrates with Databricks Unity Catalog to bring in the technical lineage that Databricks captures. You can stitch it together with your Collibra lineage—whether with PowerBI, Tableau, or ETL sources. On Collibra’s roadmap, we plan to enhance our technical lineage integrations with Databricks further. For example, volumes, notebooks, and SQL transformations happening within Databricks are some other items on our roadmap regarding technical lineage and technical lineage integration between Collibra and Databricks. If your organization wants to do more in this area, Collibra would love to have a follow-up conversation to better understand your situation. Question: Will the Collibra Databricks connection support lineage for indirect dependencies in the lineage? Answer: Collibra leverages the lineage information available within Databricks and system tables, and if you are experiencing a gap, both organizations would be interested in understanding your situation in more detail. Please reach out to your Databricks account team. Once we have a better understanding, Databricks can work with Collibra to see how that gap could be filled, as you can never be too ambitious about what you incorporate with lineage.. Question: We are struggling to visualize how Collibra and Databricks can work together as part of a data engineer's natural process/journey to seamlessly allow an engineer to explore the data catalog and then transition to using those data assets in their pipeline. Answer: It comes down to the personas that will be using that Databricks data. Collibra’s Data Marketplace lets you answer any persona’s questions up front, allowing them to channel into Databricks correctly. For example, maybe a data engineer is looking for data that's curated or looked after by another line of business and wants to understand how they are calculating KPIs in that particular data. Collibra has a variety of business and operational contexts, providing important details not available in the Unity Catalog. Question: Is there any plan to add support for Microsoft Entra ID to connect to Unity Catalog instead of using only Databricks service principals? Answer: From Databricks: This is like a single sign-on. We're always looking at additional ways to enhance our partnerships, including our hyperscaler partners, to ensure we're working the right way with their product lines. I don't have information about this specific one, but if this is of interest to our joint customers with Microsoft, please reach out to your Databricks account team. Question: Can Collibra pull DLT and traditional notebook plus Delta table transformation lineage from Databricks? Answer: When it comes to DLT, certain capabilities are supported today with Collibra, and some are on the roadmap. This is based on what's available to Collibra from Databricks.For notebooks, that is on the roadmap for Collibra for the second half of this year after volumes. In Databricks, I believe that is captured today if you are using materialized views or streaming tables that use DLT under the hood. But there may be a gap if you're directly creating DLT pipelines. Don't hesitate to get in touch with your Databricks account team and let them know the specific item you're looking for. We’ll check with the lineage team on timelines if there is a gap. If it's available on Databricks, Collibra will certainly pull that information. Question: We're currently migrating old workloads from our enterprise data warehouse on-prem. Can I help accelerate? Answer: We've seen a number of customers leverage Collibra to support their journey to the cloud or migration from one data store to another. Migration is an interesting challenge, and you can use Collibra to accelerate your journey in a couple of ways. Data Quality is top of mind. For example, suppose you're moving from more of a legacy technology, like operational SQL servers or Oracle DB and PostgreSQL DBS, to Databricks in the future. In that case, you can use Collibra data quality to help measure the underlying quality of the system and understand quality issues before being moved. Another example is assigning ownership for review. At Collibra, we have a very flexible operating model that enables you to assign ownership and responsibility to the right individuals. This is important as you prepare your data to move and for ongoing management and quality monitoring after the move. From a metadata and lineage perspective, it's absolutely critical to understand data before it’s migrated to see how it flows and transforms between systems. This will allow you to address any potential dependencies or challenges as you move critical workloads. Question: How does Collibra help with governance in AI applications like AI/BI genie? Answer: The great thing about working with Databricks and Collibra together is that Databricks allows you to have your data platform, your data warehouse, and your AI governance layer all in one place. The great thing about working with Collibra is that you can expose those technical capabilities to a much wider user group. I think a lot of compliance folks or legal folks are sitting in these organizations knowing that they have to get involved in these AI projects early, but they may not know how to do it or which platform to use to approach their data science team. Collibra’s AI governance capability really allows more parts of the organization to be involved in the AI story so that they can get their input on requirements and expected outcomes documented and considered at the beginning of an organization's AI journey rather than toward the end. Question: What's the experimental capabilities to elevate? Answer: Genie, an AI assistant capability, relies heavily on Unity Catalog, metadata. The richer the metadata, the richer your responses will be from this AI Genie product from Databricks. That's where, you know, Collibra can help because Collibra would have richer metadata across the enterprise that could be used as part of the response when you ask the question. Question: Can we connect lineage from tools like Power BI to Databricks, and how does that work? Answer: The Power BI connector definitely works back into Databricks Unity Catalog so that we can stitch together Databricks’ lineage with Power BI lineage. This is great because when people are fetching data from the warehouse into that Power BI data model or just directly querying into reports, we not only fetch those reports in the lineage but also interpret the Dax expressions using AI. This gives you full visibility on what data is being used for what mission-critical reports. Question: We have metadata and lineage and unity catalog. And then we have it in Collibra. What's the benefit of having both? Answer: It's not metadata and lineage in Collibra and metadata and lineage and in Unity Catalog. They're like two separate entities doing two separate things. The idea is that metadata and lineage form the linchpin that holds the systems together. Collibra pulls up the metadata and the lineage (and hopefully soon will also have that full bidirectional sync). In addition, the data quality gets created in Collibra Data Quality & Observability, and the processing gets pushed down to Databricks. So, there's already a lot of traversing between the two systems that rely on metadata. Their value is, of course, the metadata. And the lineage is a product of the fantastic things you're doing with Databricks, including delivering analytics and AI at scale and speed. What you're doing in Collibra is taking that metadata and then enriching it. It’s kind of like an inverted V funnel where everything has a center of gravity around metadata and lineage, where the two systems stick together. But the value of Collibra is that we're extending with all of the metadata that you don’t have in Unity Catalog – the business process, the terminology, the KPI definition, the use cases, the DQ metrics. All of these things come together to make it one holistic journey - find the Data Product and work your way down, understand everything as you go to drive fully informed, compliant access to data ultimately."So it's not one or the other. The two together produce something more significant than the sum of the two. Question: How does AI governance work with Databricks MLflow? Answer: From a Collibra perspective, our partnership with Databricks is trying to make it easier for organizations to adjust the metadata and lineage information so they don't have to use an API integration and maintain that. So, today, we support AI governance and Unity Catalog models. We are working on supporting MLflow as well, but today, it would be the customer's responsibility to extract or leverage the metadata in MLflow and push it into Collibra in support of AI governance. From Databricks: This actually kind of tags really nicely to the previous question about where you keep your metadata or how you have multiple copies of metadata. The metadata, the model metadata, and the table metadata that's in Unity Catalog are the foundations of your technical data governance solution. Then, you can use that information in multiple places, including your business data catalog, like Collibra. This is just another example of that. Instead of syncing table metadata to be curated in Collibra, you're thinking of creating model metadata. AI use cases in Collibra are another way to collaborate and bring what you have in Databricks to a broader user group. ______________________________________________________________________________________________________________ Thank you for participating in the AMA |Why Databricks and Collibra are better together. I f you have more questions you would like answered - comment below, and we will get you in touch with one of our experts.
Community_Alex Jorgenson
0