AI Risk Management Auditor Core

AI Governance Framework

The AI Risk Management
Auditor Core

A comprehensive audit framework covering seven critical risk domains — from bias and hallucination to security and governance — for organisations deploying AI responsibly in 2025 and beyond.

May 2026 7 Risk Domains · 25+ Controls NIST AI RMF · EU AI Act · ISO 42001

Artificial intelligence spending is projected to surpass $2 trillion globally by 2026, yet the governance frameworks to manage that investment safely remain immature inside most organisations. The EU AI Act began enforcing high-risk system requirements in August 2026, while regulators in banking, healthcare, and insurance are increasingly demanding structured evidence that AI systems are fair, explainable, secure, and controlled.

The AI Risk Management Auditor Core is a structured, domain-by-domain blueprint covering every major category of AI risk. It gives auditors, risk officers, compliance teams, and AI engineers a shared language and a testable set of controls — grounded in NIST AI RMF, ISO/IEC 42001, and real-world incident taxonomy — to evaluate and harden any AI deployment.

88%

of businesses now use AI in at least one function

average GenAI apps running per enterprise, 10% classified high-risk

$670K

additional breach cost from Shadow AI per IBM 2025 data

Seven Domains. One Audit Standard.

Each domain below represents a distinct class of AI risk. Together they form a complete audit surface — from the data that trains the model to the policies that govern its deployment. Within each domain you will find the specific controls, testing methods, and assessment criteria that a mature AI audit programme must address.

🎯

Domain 01

Bias Risk

Critical

Bias in AI systems produces discriminatory outcomes in hiring, lending, healthcare triage, insurance pricing, and more. It arises from skewed training data, flawed labelling, or model architecture choices that encode historical prejudice. Regulatory frameworks — including the EU AI Act and US state-level AI laws — now mandate bias testing and fairness documentation for high-risk AI systems. An undetected bias problem is both an ethical and a legal liability.

Audit Controls

Bias Testing

Systematically test model outputs across demographic subgroups (gender, race, age, disability) using labelled evaluation sets. Document results and regression-test after every model update.
Dataset Diversity Audit

Profile training and evaluation data for representation gaps. Under-represented groups produce unreliable model behaviour at inference time and should trigger data sourcing or balancing actions.
Fairness Metrics — Demographic Parity

Measure whether positive prediction rates are equal across protected groups. A statistically significant gap triggers remediation or use-restriction review.
Fairness Metrics — Equal Opportunity

Ensure the true-positive rate (recall) is consistent across groups. Disparities in recall directly translate to inequitable access to beneficial outcomes — a primary concern in credit and clinical decision support.

🌀

Domain 02

Hallucination Risk

Critical

Generative AI models confidently produce inaccurate or entirely fabricated outputs — a behaviour known as hallucination. This creates operational exposure (wrong decisions), legal exposure (defamatory content, fabricated citations), and reputational damage. NIST flags hallucinations as a defining risk of generative models that must be evaluated from the earliest stages of system design. Effective control requires a layered approach: validate before deployment, filter at runtime, and ground responses in verified sources.

Audit Controls

Output Validation

Define accuracy and factual-consistency benchmarks. Apply automated scoring (ROUGE, BERTScore, LLM-as-judge) on representative test sets. Track hallucination rates across model versions.
Guardrails & Content Filters

Deploy input and output filters to detect and block unsafe, inaccurate, or out-of-scope content. Tools include NVIDIA NeMo Guardrails, LlamaGuard, and custom classifier layers. Test guardrails adversarially.
RAG Validation (Retrieval-Augmented Generation)

Audit the retrieval component: verify source quality, freshness, and relevance scoring. Test end-to-end faithfulness — does the generated answer accurately reflect retrieved documents? Monitor for “context poisoning” where injected documents skew output.

⚠️

Auditor Note

Hallucination rates are not static. They vary with temperature settings, prompt construction, and context length. Audit should include stress-testing at boundary conditions, not just average-case benchmarks.

📉

Domain 03

Model Drift Risk

High

A model that performs well at deployment can silently degrade as the real world changes. Model drift is the divergence between the statistical properties of production data and the data on which the model was trained. A 2026 Risk Management Magazine analysis highlights that credit-scoring models systematically misjudge applicants from emerging demographic segments when training data fails to stay representative. Regulators now increasingly expect continuous oversight — not one-off validation at launch.

Audit Controls

Performance Monitoring

Define key performance indicators (accuracy, precision, recall, AUC) and alert thresholds. Log model predictions in production and compare against ground-truth labels on a defined cadence.
Data Drift Detection

Monitor statistical distributions of input features (using PSI, KS-test, or Jensen-Shannon divergence). Significant shifts in input distributions are early warning signs before output quality degrades.
Concept Drift Detection

Detect when the underlying relationship between inputs and outputs has shifted — even if input distributions appear stable. Requires labelled ground-truth data and tracking error rates over time or using drift-detection algorithms (ADWIN, Page-Hinkley).
Retraining & Revalidation Policy

Document triggers for retraining (drift thresholds, scheduled reviews, incident events). Revalidation must include bias re-testing, performance benchmarking, and documented approval before re-deployment.

🔍

Domain 04

Explainability Risk

High

Advanced AI models — particularly deep neural networks and ensemble methods — operate as opaque black boxes. In regulated domains, this opacity directly undermines compliance: GDPR’s Article 22 grants individuals a “right to explanation” for automated decisions. ISACA’s 2025 audit guidance notes that explainability tools like SHAP and LIME have become the primary mechanism for making AI logic accessible to non-technical stakeholders, auditors, and regulators. Without them, accountability is impossible and legal risk is unquantifiable.

Audit Controls

Explainability Tools — SHAP

SHapley Additive Explanations assigns each feature a contribution score for individual predictions. Use SHAP for both global model behaviour analysis and local per-prediction auditing. Particularly effective for tabular models; computationally intensive at scale.
Explainability Tools — LIME

Local Interpretable Model-agnostic Explanations generates locally faithful approximations around individual predictions. Effective for instance-level audit of specific decisions — e.g., why a specific loan application was declined.
Model Documentation & Model Cards

Require and review standardised model cards documenting intended use, performance across subgroups, limitations, and training data provenance. Model cards are now a compliance artefact under the EU AI Act for high-risk systems.
Feature Importance Review

Audit which input features drive model predictions. Flag legally protected features (or their proxies) appearing as high-importance predictors — a key mechanism for detecting hidden discriminatory logic.

🔒

Domain 05

Data Privacy Risk

Critical

AI systems consume vast personal datasets during training and inference, creating multi-layered privacy exposure. Data may be inadvertently memorised by large language models, leaked through model outputs, or exfiltrated via prompt injection. Post-hoc explainability methods (SHAP, LIME) can themselves act as privacy leakage vectors by revealing sensitive behavioural patterns. The convergence of data governance and AI governance — now an industry trend — demands unified oversight covering both data processing and model-level privacy controls.

Audit Controls

Data Minimisation

Verify that only the minimum personal data necessary for the AI task is collected and used. Audit data pipelines for scope creep and review legal basis documentation for every data category processed.
Purpose Limitation

Confirm that data collected for one stated purpose is not being repurposed for AI model training without consent or a valid legal basis. Particularly critical when third-party data pipelines feed AI systems.
Data Masking & Anonymisation

Audit the anonymisation pipeline: confirm techniques meet regulatory standards (k-anonymity, differential privacy, synthetic data generation). Test for re-identification risk before data is used in training or evaluation.
Access Control & Encryption

Review role-based access controls on training datasets, model artefacts, and inference logs. Verify encryption at rest and in transit. Audit access logs for unusual query patterns that may indicate data extraction.

🛡️

Domain 06

Security Risk

Critical

AI systems introduce a novel attack surface that goes far beyond traditional application security. Adversaries can manipulate model behaviour through carefully crafted inputs, corrupt training pipelines, steal proprietary models, or infer private training data — all without ever accessing source code or infrastructure directly. The 2026 OWASP Top 10 for Agentic Applications specifically catalogues these AI-specific attack vectors as primary risk categories requiring dedicated security controls.

Audit Controls

Penetration Testing & Red Teaming

Commission AI-specific red team exercises that simulate realistic adversarial actors. Document scope, methodology, findings, and remediation evidence. Repeat annually and after major model changes.
Prompt Injection Testing

Test whether malicious inputs embedded in user queries, documents, or retrieved context can hijack model instructions. Prompt injection is the dominant attack vector against LLM-powered applications and requires dedicated testing libraries and human red-teamers.
Adversarial Input Testing

Generate and test adversarial examples — inputs crafted to cause misclassification or unsafe outputs. Use techniques such as FGSM, PGD, or AutoAttack. Measure robustness and document acceptable degradation bounds.
Data Poisoning Testing

Validate the integrity of the training data pipeline. Simulate poisoning attacks where adversarial samples are injected into training data to create backdoors or systematic errors. Audit data provenance and ingestion controls.
Model Extraction Attack Testing

Assess whether an adversary can reconstruct a functional copy of the model through API queries alone. Evaluate rate limiting, output obfuscation, and API design controls that limit information leakage from model responses.
Membership Inference Attack Testing

Determine whether an attacker can infer whether a specific individual’s data was used in training. High membership inference risk signals insufficient privacy controls in the training pipeline and may violate GDPR data processing obligations.

🚨

Emerging Threat: Agentic AI

Autonomous AI agents that plan and execute multi-step tasks introduce goal hijacking and cascading multi-agent failure risks that traditional security testing does not cover. Red teaming must include agentic threat scenarios for any autonomous AI deployment.

🏛️

Domain 07

Governance Risk

High

All technical controls exist within an organisational governance structure. Without clear policies, defined accountability, a maintained risk register, and a living AI inventory, controls are ad hoc and unauditable. The NIST AI RMF’s Govern function directly maps to EU AI Act Article 9 risk management requirements, meaning that governance maturity is simultaneously a compliance requirement and a prerequisite for all other risk domains to function effectively.

Audit Controls

AI Policies & Standards

Review the existence, currency, and enforceability of AI-specific policies covering acceptable use, prohibited use cases, third-party model procurement, and incident response. Policies must be approved at Board or senior leadership level.
Roles & Responsibilities

Confirm that ownership of AI risk is clearly assigned: who is accountable for model performance, who approves deployments, who manages incidents. Absence of clear accountability is a critical governance finding.
AI Risk Register

Assess the quality and maintenance of the AI risk register: is every known risk documented with likelihood, impact, owner, and status? Is it reviewed regularly? Does it feed into enterprise risk management processes?
AI Inventory

Verify that a comprehensive, up-to-date inventory of all AI systems exists — including Shadow AI. The average enterprise runs 66 GenAI applications; without an inventory, risk management is blind. IBM data shows Shadow AI adds $670K to breach costs and 10 additional days to containment.
Risk Management Process

Evaluate whether a documented, repeatable process exists for assessing AI risk at procurement, development, deployment, and ongoing monitoring stages. The process should include risk appetite thresholds, escalation paths, and independent review gates.

Quick Reference: Domain Summary

Use this matrix to rapidly orient auditors, steering committees, or compliance reviews to domain scope, severity, and primary standards alignment.

Risk Domain	Severity	Controls Count	Primary Standards	Key Technique
Bias Risk	Critical	4	EU AI Act Art.10, IEEE 7003	Demographic Parity, Equal Opportunity
Hallucination Risk	Critical	3	NIST AI RMF, ISO 42001	RAG Validation, Guardrails
Model Drift Risk	High	4	EU AI Act Art.72, NIST Map	PSI, KS-Test, ADWIN
Explainability Risk	High	4	GDPR Art.22, EU AI Act Art.15	SHAP, LIME, Model Cards
Data Privacy Risk	Critical	4	GDPR, ISO 27701, EU AI Act	Differential Privacy, k-Anonymity
Security Risk	Critical	6	OWASP LLM Top 10, NIST CSF	Red Teaming, Adversarial Testing
Governance Risk	High	5	NIST Govern, ISO 42001 Cl.6	AI Inventory, Risk Register

Actionable Insights for Audit Leaders

💡

Start With the Inventory

You cannot audit what you cannot see. The AI inventory is the foundation of every other governance control. Before running any technical test, confirm that a complete, maintained list of all AI systems — including third-party tools and Shadow AI deployments — exists and is owned by a named responsible party.

💡

Treat Technical and Governance Risks as Inseparable

Bias testing without a retraining policy is a point-in-time snapshot, not a control. Security penetration testing without a documented incident response process yields findings with no remediation path. Effective AI audit always closes the loop between technical findings and governance accountability.

💡

Map Once, Comply Across Jurisdictions

The NIST AI RMF, EU AI Act, and ISO 42001 share significant structural overlap. A governance investment that satisfies the NIST Govern function simultaneously addresses EU AI Act Article 9. Organisations operating across jurisdictions should build a single control framework with explicit cross-walk mapping rather than duplicating compliance programmes.

⚠️

Don’t Overlook Third-Party Model Risk

Most organisations inherit risk embedded in external or open-source AI components. The March 2025 NIST AI RMF update specifically emphasises model provenance and third-party model assessment. Apply the full auditor core framework to vendor-supplied models — not just internally built ones.

Conclusion: The Audit Standard AI Demands

The era of AI adoption without structured risk oversight is ending. Regulatory enforcement is accelerating — the EU AI Act’s high-risk provisions are now active, US state-level AI laws are proliferating, and sector regulators in banking, insurance, and healthcare are demanding documented evidence of fairness, security, and governance maturity.

The AI Risk Management Auditor Core provides a tested, comprehensive architecture for that evidence. Its seven domains — Bias, Hallucination, Model Drift, Explainability, Data Privacy, Security, and Governance — map to every meaningful axis of AI risk that organisations face today. Together, they enable audit teams to move from ad hoc reviews to systematic, repeatable, and defensible assessments.

The organisations that embed this framework now — before incidents force their hand — will not only reduce risk exposure; they will build the stakeholder trust and regulatory capital that become enduring competitive advantages as AI becomes ever more central to every industry.

Referenced frameworks: NIST AI RMF 1.0 (2023, updated 2025) · EU AI Act (Enforcement 2025–2026) · ISO/IEC 42001:2023 · OWASP Top 10 for LLM Applications · ISACA AI Audit Guidance 2025 · GDPR Article 22