Thiago Positeli

By Thiago Positeli de Arruda

In 1 July 2025

One thing is already clear: without solid data foundations, no AI will ever be sustainable, secure, or trustworthy.

Artificial Intelligence promises to transform the way we operate day to day and interact with our clients both internal and external. Yet, few organizations are ready to scale AI with confidence. The reason is simple: they lack a proper foundation.

Before diving into what data governance foundations enable AI for the future, it’s worth revisiting a key question:

What are the fundamental pillars of AI Governance?

Institutions such as the OECD, NIST, ISO, and the newly introduced EU AI Act broadly agree on the following pillars for trustworthy and responsible AI:

  • Transparency & Explainability – AI must be understandable and auditable, with traceability of both data and decisions.

  • Data Quality & Governance – Without consistent, reliable, and governed data, effective AI is impossible.

  • Ethics, Fairness & Bias Mitigation – Models must not replicate or amplify existing biases.

  • Security & Privacy – Privacy by design, access controls, and compliance with LGPD/GDPR are essential.

  • Accountability – Clearly defined roles and responsibilities for automated decisions.

  • Model Lifecycle Management – Ongoing monitoring, re-evaluation, version control, and governance of models.

  • Culture & Education on Responsible AI – Organizational awareness of risks, limitations, and best practices.

Why does this matter?

Without structured data governance, AI projects encounter the same recurring issues:

  • Bias in models

  • Lack of traceability and explainability

  • Outdated, low-quality data

  • Sensitive data exposure

And here’s the reality: even data governance programs that have been in place for 3+ years still struggle with foundational gaps and yet, they’re expected to support advanced AI initiatives.

So… Where should you start?
Based on real market experience, here are the true foundational pillars of Data Governance that enable AI with confidence beyond the theory of the DAMA-DMBOK.

1. Metadata Management – The invisible pillar of explainability

AI without context is a black box. That context comes from metadata: definitions, lineage, glossaries, classifications, and business rules. Without them, neither humans nor machines can interpret data and yet, a lot can be automated with good metadata.

A data catalog without active metadata management is just an interface. Tools like Databricks Unity Catalog are great accelerators, but without metadata stewardship, AI cannot be sustained. Especially when metadata becomes duplicated or inconsistently stored.

2. Access & Classification Management – Secure AI starts with control

AI models deal with sensitive data. Without proper data classification, access control, audit trails, training paths, and approval workflows, any AI product or project is exposed to risk.

Don’t be fooled: this must come before your LLMs, predictive models, or any other trendy innovation. It’s a foundational part of data engineering and provisioning.

3. Data Quality – Bad data makes AI worse

AI amplifies everything including errors. Start small, focusing on critical datasets. Pay attention to data entry and source systems. Governance ensures consistency and scalability.

MDM (Master Data Management), often seen as old school, is a secret weapon here. Combine it with analytical environments to ensure trustworthy data at scale.

4. Roles & Responsibilities – Data doesn’t govern itself

Without Data Owners, Data Stewards, Technical Stewards, etc., nothing gets done. Governance without accountable people is just PowerPoint.

5. Data Culture & Decentralization – Without this, AI creates bottlenecks

Want to scale AI? Decentralize decision-making but with clear policies, standards, and automation where possible. Without it, bottlenecks pile up in IT or the data team. Business users end up waiting in endless queues while data teams are seen as order-takers a “data fast-food counter”.

What about DAMA-DMBOK and other frameworks?

The DAMA-DMBOK remains one of the most respected references to guide this journey. It outlines 11 core knowledge areas that support data governance and management and it clearly frames data governance as a business objective, not just a technical one.

When governance is built within the business, it’s much easier to get stakeholder buy-in and justify investment.

But let’s be real: most organizations today still place governance under the CDO or data office, rarely connecting it directly to business operations.

Is that wrong? It depends. The best structure is the one that fits your organization’s reality. But the warning is clear: if governance isn’t linked to business objectives, it loses power, visibility, and prioritization.

And what about the Modern Data Stack?

Despite all the evolving architectures, concepts, and tools Data Mesh, Lakehouse, AI/MLOps the foundations remain the same. Tools change. Principles don’t.

Modern Data Stacks solve for scale and speed.
Data Governance solves for trust, ethics, quality, and accountability.

In Conclusion

This is the foundation of scalable, responsible AI. But let’s be honest: selling that idea internally is tough. Why? Because the value of governance isn’t immediately visible.

If your company already has data products in production, now is the time to embed governance across the board. It will bring calm, trust, and real velocity for what lies ahead.

The secret? Prioritization. Identify the current “pain point” in your company and build governance around that.

“Don’t wait until your environment has 40,000 tables in your data warehouse, lake, or lakehouse or whatever you call it before starting to discuss data access control based on classification.
If possible, address this from the start. But if you’re already there, don’t panic: face the facts, use new insights, and build a solid action plan for the legacy.”

Want to learn more about Data Strategy, Governance & Responsible AI? APGAR designs and delivers innovative data and AI solutions and supports clients with expert advisory services to ensure successful adoption and longterm value. With a team of over 230 data and AI experts, APGAR combines product development, integration, and advisory capabilities to help companies turn data into a strategic advantage.

Would you like to get in touch with our experts?
If you agree, disagree or have something to add to these views on corporate strategy, please contact us.