Bridging Identity and Access Management and Data Governance With One Policy Engine

Identity and access management tells you who authenticated and what systems they can reach. It doesn't tell you what personal data they touched, whether that access had a lawful basis, or whether the data subject consented. Privacy regulations don't ask about system access — they ask about data access. Closing that gap requires a policy engine that understands identity context and data context simultaneously, enforced at the data layer, not the application layer.

Authors

Ethyca Team

Topic

Infrastructure

Published

May 15, 2026

Bridging IAM and Data Governance With One Policy Engine

Key Takeaways

Identity and access management (IAM) tells you who a user is and what systems they can reach. It does not tell you what personal data they accessed, whether that access was purposeful, or whether it aligned with a consent preference or regulatory obligation.
The gap between identity controls and data governance is an infrastructure gap. Workflow improvements and manual reconciliation processes cannot close it because the two systems operate on different enforcement planes.
Most IAM implementations make access decisions at the system or application level. Privacy regulations require decisions at the data level: what category of personal data, under what legal basis, for what processing purpose.
Bridging the two requires a policy engine that understands both identity context and data context simultaneously, enforcing rules at the point where data is actually processed.
Organizations that build this infrastructure gain the ability to honor consent in real time, fulfill data subject rights at machine speed, and adapt to new regulatory requirements through policy updates rather than system re-architecture.

According to the Verizon 2023 Data Breach Investigations Report, stolen or compromised credentials are at the root of more than 50% of all breaches, and 86% of basic web application attacks use stolen credentials for initial access. The identity and access management market is projected to reach $34.52 billion by 2028, according to Fortune Business Insights, growing at 14.5% annually. Investment in identity infrastructure is accelerating, and the security case for it is well established.

The governance case is less well understood. Identity and access management systems and data governance systems operate on separate enforcement planes, which means that even well-authenticated users routinely access personal data in ways that no governance layer tracks or controls. IAM tells you who a user is and what systems they can reach. It does not tell you what data they actually touched, whether that data is personal, or whether the access aligned with a privacy policy, a consent preference, or a regulatory obligation.

Most enterprises have deployed single sign-on, enforced multi-factor authentication, and built role-based access control matrices. Yet when a privacy engineer asks a direct question, "Which systems contain this customer's personal data, and which identities have accessed it in the last 90 days?"

The answer typically requires a manual investigation spanning weeks. The identity layer and the data layer operate as separate worlds, governed by separate teams, enforced by separate infrastructure. That separation is where governance breaks down.

What is identity and access management in cybersecurity?

Within cyber security, identity and access management serves as the perimeter logic for system-level access. It governs authentication protocols, session management, privilege escalation controls, and credential hygiene. IAM in cyber security is fundamentally concerned with preventing unauthorized entry to systems and applications, answering the question of whether a given identity should be allowed through a given door.

What it does not address is what happens once that identity is inside. A user authenticated through an IAM system and authorized to access a database can query, copy, export, or transform personal data in ways that no identity and access management tool currently tracks at the data-object level. The security perimeter is intact. The data governance perimeter does not exist.

This distinction matters because modern privacy regulations do not care about system access in isolation. They care about data access: what personal data was processed, for what purpose, under what legal basis, and whether the data subject consented. IAM infrastructure was never designed to answer those questions.

Why adding process does not close the gap

The instinct, when confronted with the gap between IAM and data governance, is to layer on process. Add a review step. Create a new approval workflow. Build a dashboard that cross-references IAM logs with data classification tags. Hire a team to reconcile the two.

This instinct is understandable, but it addresses the symptom rather than the cause.

The disconnect between identity controls and data governance is not a process gap but an infrastructure gap. No amount of workflow orchestration will compensate for the fact that the systems managing identity and the systems managing data operate on fundamentally different models, different schemas, and different enforcement planes.

Consider the mechanics. A typical enterprise runs identity and access management services across multiple providers: one for workforce identity, another for customer identity and access management, a third for cloud infrastructure permissions. Each maintains its own policy model. Each defines "access" differently. None of them has a native concept of "personal data" or "consent status" or "data processing purpose."

Meanwhile, data governance lives in a parallel universe. Data catalogs classify columns and tables. Privacy teams maintain records of processing activities. Consent management platforms store user preferences. These systems have no native awareness of who is accessing the data they govern, or whether that access conforms to the policies they define.

The result is two complete, internally consistent systems that share no common enforcement layer. Policies exist in both. Enforcement happens in neither, at least not in a unified way.

The Difference Between Identity Management and Access Management

Identity management and access management are often collapsed into a single acronym, but they address distinct concerns. Identity management is the discipline of establishing, maintaining, and eventually retiring digital identities. It covers provisioning, lifecycle management, directory services, and identity verification.

Access management governs what those identities are permitted to do: which applications they can enter, which API endpoints they can call, which database roles they inherit.

The difference is important because most organizations have invested heavily in identity management, including strong authentication, directory synchronization, and automated provisioning, while access management remains coarse-grained. Access decisions are typically made at the application or system level, not the data level. A user is granted access to a CRM platform, not to a specific subset of personal data within that platform based on their role, the data subject's consent, and the applicable regulatory jurisdiction.

This is where the infrastructure argument becomes concrete. Bridging identity management and access management at the data layer requires a policy engine that understands both the identity context and the data context simultaneously. That engine does not exist in traditional IAM architectures.

Three ways IAM fragments at scale

Three specific mechanics cause identity and access management to fragment as organizations grow. Each is structural, not incidental.

Identity source proliferation

A mid-stage SaaS company might manage workforce identities in Okta, customer identities in Auth0, machine identities in HashiCorp Vault, and cloud permissions in AWS IAM. Each system enforces its own policy model. Each defines roles and permissions differently. There is no single source of truth for "who can access what data" because "data" is not a first-class object in any of these systems.

Policy inconsistency across enforcement points

Access policies defined in an IAM system rarely match the data handling policies defined by a privacy team. An engineer might have IAM-level access to a production database but no legitimate purpose for querying the personal data it contains. The IAM policy says "allowed." The privacy policy says "not without a lawful basis." These two policies are enforced by different systems, written in different languages, and maintained by different teams.

Absence of data context in access decisions

Traditional identity and access management tools make binary decisions: this identity has this role, this role grants access to this resource. They do not evaluate whether the resource contains personal data, whether the data subject has consented to this type of processing, or whether the access request aligns with a documented processing purpose. The access decision is identity-aware but data-blind.

These are not edge cases. They are the default operating condition for any organization running more than a handful of systems with personal data.

Organizations consistently discover, through automated data inventory and classification, that their IAM coverage maps poorly onto their actual data footprint. Systems that IAM considers out of scope contain significant volumes of personal data. Roles that appear narrowly scoped at the application level grant broad, undifferentiated access at the data level.

Why IAM needs to understand data

The answer is operational, not theoretical. Consider a data subject access request. Under GDPR, CCPA, and similar regulations, an organization must locate all personal data associated with a requesting individual, determine who has accessed it, and produce a complete record. This requires traversing both the identity layer and the data layer simultaneously.

If those layers are disconnected, fulfillment becomes a manual, cross-team exercise. Privacy teams query data catalogs. Security teams pull IAM logs. Engineers write custom scripts to correlate the two. Organizations that lack unified infrastructure spend significantly more time and resources on each request, because fulfillment requires manual traversal of two disconnected systems.

The same logic applies to consent enforcement. A customer withdraws consent for marketing data processing. The consent management platform records the preference. Unless that preference is connected to the access controls governing the marketing analytics database, the data continues to be processed, because the policy is defined in one system while enforcement lives in another.

The infrastructure-first model: One policy engine for IAM and data governance

The alternative is not another tool layered on top of existing IAM infrastructure. It is a policy engine that operates at the intersection of identity and data, enforcing rules at the data layer itself.

This model has three defining characteristics.

Policy as code

Access and data governance policies are defined in machine-readable, version-controlled code rather than in spreadsheets, wikis, or application-specific configuration files. This means policies can be tested, reviewed, deployed, and audited using the same engineering workflows that govern application code.

An open-source privacy engineering framework built on this principle encodes privacy and access policies as code that is enforceable at the data layer, version-controlled, testable, and deployable through the same engineering workflows that govern application code.

Data-aware access decisions

Instead of making access decisions based solely on identity and role, the policy engine evaluates the data context: what type of data is being accessed, whether it is classified as personal, what consent preferences apply, and what regulatory jurisdiction governs the processing. This transforms access control from a binary gate into a contextual evaluation. The same identity might be granted access to aggregated analytics data but denied access to the underlying personal records, based on the same policy, enforced at the same layer.

Dynamic enforcement

Static role assignments cannot keep pace with the velocity of data creation, system proliferation, and regulatory change in a modern SaaS environment.

AI-driven policy enforcement enables dynamic access decisions based on real-time data context. When a new data store is provisioned, when a customer updates consent preferences, or when a new regulatory requirement takes effect, the enforcement layer adapts without manual intervention.

How does identity and access management work in this model?

Traditional IAM continues to handle what it does well: authentication, session management, credential lifecycle, and coarse-grained authorization at the application level. The policy engine does not replace IAM. It extends IAM's enforcement boundary to the data layer.

In practice, this means the policy engine ingests identity context from existing IAM providers and combines it with data context from automated classification and consent management systems. The resulting access decision reflects both who the user is and what the data is, which is the bridge that current identity and access management solutions lack.

The integration is bidirectional. When the policy engine determines that a particular data category requires elevated access controls, it can push that requirement back to the IAM layer as a policy constraint. When IAM detects a new identity or role, the policy engine evaluates what data-level permissions that identity should inherit based on the current policy set.

This composability is critical. Organizations do not need to rip out their existing IAM infrastructure. They need a layer that connects identity decisions to data decisions, enforced consistently, defined as code, and auditable end to end.

Benefits of unified policy infrastructure

When identity and data governance share a single policy engine, several capabilities that were previously manual or impossible become automatic.

Privacy rights fulfillment at machine speed

Data subject access requests, deletion requests, and portability requests require traversing the identity-data boundary. With a unified policy engine, the system already knows which identities map to which data, which systems hold that data, and which policies govern its processing. Fulfillment becomes an infrastructure operation, not a cross-team project.

Consent-aware access control

When a customer updates a consent preference, the change propagates through the policy engine to every system that processes their data. Access controls adjust in real time. Marketing systems lose access to data for which consent has been withdrawn. Analytics pipelines automatically exclude non-consented records. The policy is defined once and enforced everywhere.

Developer velocity without governance drag

Engineers building new features or integrating new data sources operate within policy guardrails that are technically enforced, not documented in a wiki they may never read. The policy engine evaluates new data flows against existing policies at deployment time. Teams can move quickly because the boundaries are clear and automatic.

Regulatory resilience across jurisdictions

When a new regulation takes effect or an existing one is amended, the change is expressed as a policy update in code. The engine evaluates the entire data estate against the new policy and identifies where enforcement must change. This is not a quarterly audit exercise but a continuous, automated evaluation.

Identity and access management needs a data layer

IAM solves the authentication and authorization problem well. What it does not solve is the governance problem: who accessed what personal data, under what legal basis, and whether that access aligned with the data subject's consent preferences and applicable regulatory requirements. As privacy regulations grow more specific and enforcement more frequent, that gap compounds with every new data system, every new jurisdiction, and every new processing purpose an organization adds.

The organizations that close this gap will do so at the infrastructure level, by connecting identity decisions to data decisions through a single, policy-driven enforcement layer. Those that continue to address it through process improvements and manual reconciliation will find the gap widens faster than any workflow can close it.

Ethyca addresses the policy enforcement gap this article describes through two products. Fides, the world's most-used open-source privacy engineering toolkit, encodes privacy and access policies as machine-readable, version-controlled code, enforced consistently across all infrastructure, vendors, and AI workflows. Astralis, Ethyca's AI policy enforcement product, scales data access intelligently, from policy definition to enforcement, making privacy rules executable by design across training, inference, and analytics pipelines.

Across more than 200 global brands, including The New York Times, Ramp, and SurveyMonkey, Ethyca manages more than 744 million privacy preferences and has delivered over $74 million in operational savings. To learn how Ethyca's unified policy engine bridges identity and access management with data governance, get in touch with our team.

FAQs

What is identity and access management and why is it insufficient for privacy?

Identity and access management governs who can reach which systems and under what conditions. It handles authentication, authorization, and credential lifecycle well. It does not evaluate what personal data exists within those systems, what legal basis governs it, or whether a specific user's access aligns with the data subject's consent preferences or applicable regulatory obligations.

What is the difference between identity management and access management?

Identity management establishes, maintains, and retires digital identities: provisioning, directory services, and lifecycle management. Access management governs what those identities are permitted to do. Most organizations have invested heavily in identity management while access management remains coarse-grained, operating at the system or application level rather than the data category level.

Why does role-based access control fail at the data governance layer?

Role-based access control makes binary decisions: this role grants access to this system. Privacy governance requires contextual decisions: this identity can access this data category, for this purpose, under this legal basis, given this consent status. RBAC was not designed to evaluate those conditions, so it grants system-level access that does not reflect the data-level permissions privacy regulations actually require.

What is a policy engine in the context of IAM and data governance?

A policy engine is infrastructure that evaluates access decisions against both identity context and data context simultaneously. Rather than asking only whether a user has the right role, it asks whether the data being accessed is personal, what processing purpose applies, and whether the data subject's consent covers that use. It enforces the answer at the data layer rather than the application layer.

How does unified IAM and data governance policy enforcement support regulatory compliance?

When identity and data governance share a single policy engine, consent changes propagate automatically to every system processing the relevant data, data subject rights requests can be fulfilled by traversing the identity-data boundary programmatically, and access decisions automatically align with the jurisdiction-specific requirements applicable to each data subject. Compliance becomes a continuous property of the infrastructure rather than a periodic audit exercise.

[X Twitter][Linkedin]

[4 articles]