AI Governance

The Finance Team’s 6-Step Guide to AI Model Risk Management (SR 11-7)

SR 11-7 has governed model risk in US banking since 2011. Its enduring principles apply to AI, but AI strains the framework’s assumptions. Here is a six-step guide to managing AI model risk under SR 11-7 in 2026.

Kognitos 13 min read
The finance team's six-step guide to AI model risk management under SR 11-7 in 2026: govern and inventory, tier by risk, validate with effective challenge, document the lifecycle, monitor continuously, and address the AI-specific strains, with how AI strains the framework's static-model assumption. By Kognitos.

TL;DR

SR 11-7 is the Federal Reserve and OCC’s Supervisory Guidance on Model Risk Management (April 2011, with companion guidance OCC 2011-12), the definitive US standard for managing model risk in banking. It is built on three pillars — sound model development, rigorous independent validation, and effective governance — and the concept of effective challenge: critical analysis of a model by competent, independent parties. Its principles were written broadly enough to apply to any quantitative model, including AI and machine learning, so AI/ML models used in banking (credit decisioning, fraud detection, AML, pricing) fall under SR 11-7.

The challenge in 2026 is that SR 11-7 assumed models are simplified, relatively static representations, while modern AI is dynamic, probabilistic, and increasingly autonomous — which strains the framework’s assumptions even as it remains the stable reference point for model governance. Managing AI model risk under SR 11-7 therefore means applying its enduring principles while adapting validation and monitoring to AI’s nature.

Six steps:

  1. Govern and inventory: Establish governance, policies, and a complete inventory of AI/ML models, treating them as models subject to MRM.
  2. Tier by risk: Classify models by materiality and complexity so oversight is proportionate, focusing effort on high-risk models.
  3. Validate with effective challenge: Independently validate conceptual soundness, testing, and outcomes, adapting validation to AI with explainability testing, robustness checks, and stress testing.
  4. Document the full lifecycle: Maintain documentation sufficient for an independent party to understand the model’s purpose, design, limitations, and use, providing the audit trail.
  5. Monitor continuously: AI models drift and degrade, so set-and-forget validation fails — ongoing monitoring of performance and drift is essential.
  6. Address the AI-specific strains: Explainability of opaque models, the static-model assumption breaking for adaptive systems, and the autonomy of agentic AI — with extra controls where the framework strains.

A cross-cutting theme: AI that is explainable, stable, and auditable is far easier to validate and govern under SR 11-7 than opaque, probabilistic, drifting AI, which is why the architecture of the AI affects how hard model risk management is. For the related governance work, see Deterministic AI vs Generative AI for Finance Controls.

What SR 11-7 is and why AI falls under it

SR 11-7, formally the Federal Reserve’s Supervisory Guidance on Model Risk Management issued April 2011 (with the OCC’s companion guidance OCC 2011-12), is the definitive supervisory standard for managing model risk in US banking. It arose from the recognition that quantitative models had become central to bank decision-making — credit underwriting, valuation, risk management, reserve adequacy — and that models carry risk: the risk of adverse consequences from decisions based on models that are incorrect or misused.

The framework rests on three pillars. Sound model development: models should be developed with rigor, conceptual soundness, and appropriate data. Rigorous validation: models should be independently validated to confirm they work as intended, through the principle of effective challenge — critical analysis by informed, competent, independent parties who can identify limitations and assumptions. And effective governance: policies, controls, roles, documentation, and accountability across the model lifecycle. Model risk is to be managed like other risks, identifying sources and magnitude, considering both individual and aggregate model risk.

Crucially for AI, SR 11-7’s principles were written broadly enough to apply to any quantitative model regardless of methodology. So when a bank deploys an AI or machine learning model — for credit decisioning, fraud detection, AML surveillance, pricing — the model falls under SR 11-7, and the bank must manage it as a model: develop it soundly, validate it independently, and govern it properly. AI does not get a pass from MRM; it is subject to it. This is why AI model risk management is, for banks, largely a question of applying SR 11-7 to AI — which is what the six steps below address.

The 2026 challenge: SR 11-7 was written for static models

Before the steps, the central tension worth understanding. SR 11-7 was written in 2011, before AI/ML were mainstream in finance, and it was grounded in a particular assumption: that models are simplified, relatively static representations of real-world relationships. A traditional model — a credit scorecard, a valuation model — is built, validated, and then used in a stable form until it is revised, so validation at a point in time and periodic revalidation fit it well.

Modern AI strains this assumption in three ways. AI models are often dynamic rather than static — they can learn and change, so a model validated at one point may behave differently later without an explicit revision. Many are probabilistic rather than deterministic, producing outputs with inherent variability rather than fixed relationships, which complicates validation that assumes consistent behavior. And agentic AI systems are increasingly autonomous, making decisions and choosing actions rather than computing a fixed output, which tests whether the very definition of “model” in SR 11-7 still fits.

The result is a framework whose enduring principles — sound development, independent validation, effective governance — remain sound and necessary, but whose specific validation and monitoring practices have to be adapted to AI’s dynamic, probabilistic, sometimes autonomous nature. SR 11-7 remains one of the few stable reference points for model governance, which makes clarity about its scope and limitations more important, not less. The practical approach is to apply SR 11-7’s principles rigorously while adapting the techniques — adding explainability testing, robustness checks, continuous monitoring — and to recognize where the framework strains and add controls accordingly.

The six steps

Step 1: Govern and inventory

The foundation is governance and a complete model inventory. Establish a model risk governance framework — policies defining MRM activities, procedures implementing them, allocation of resources, roles and responsibilities, and accountability — with independent oversight (often a dedicated MRM function or committee that can escalate to senior leadership). And build and maintain a complete inventory of models, including AI/ML models, because you cannot manage the risk of models you have not identified, and AI models deployed across the institution (sometimes embedded in third-party tools) are easy to miss.

For AI specifically, this step requires recognizing AI/ML systems as models subject to MRM in the first place, which is sometimes the gap: an AI tool adopted by a business unit may not be flagged as a model requiring validation and governance. The inventory should capture the AI/ML models, their purpose, their use, and their risk classification (the next step), and the governance framework should explicitly address AI, defining who is accountable for AI model risk and how AI models are approved, changed, and retired. See AI Audit Trail Requirements: A 2026 Checklist for how auditability connects to model governance.

Step 2: Tier by risk

Not all models warrant the same oversight, so tier them by materiality and complexity to make MRM proportionate and focus effort on the highest-risk models. SR 11-7 calls for considering both individual and aggregate model risk, and leading firms use tiering frameworks that classify models by their materiality (the magnitude of the decisions and exposures they drive) and complexity (how hard they are to understand and validate), concentrating the most rigorous validation and monitoring on the high-risk models while applying proportionate oversight to lower-risk ones.

For AI, tiering matters even more, because AI models vary enormously in risk — an AI model driving credit decisions or AML surveillance carries high materiality and regulatory consequence, while a low-stakes internal AI tool carries little — and the complexity (and opacity) of AI models affects how hard they are to validate. Tiering directs the heaviest MRM effort to the AI models where the risk is greatest, which is both efficient and aligned with SR 11-7’s risk-based intent.

Step 3: Validate with effective challenge

Validation is the core of SR 11-7, and the principle of effective challenge — critical analysis by informed, competent, independent parties who can identify the model’s limitations and assumptions — is central. Validation should confirm the model is conceptually sound, test it (including outcomes analysis comparing model outputs to actual results), and assess its limitations, performed independently of the model’s developers so the challenge is genuine.

For AI, validation has to be adapted to the model’s nature, and this is where 2026 practice is evolving. Leading firms are incorporating explainability testing (can the model’s decisions be understood and explained?), robustness checks against adversarial inputs (does the model behave reliably under unusual or hostile inputs?), and scenario-based stress testing for AI and GenAI outputs (how does the model behave across a range of conditions?). For probabilistic models, validation may use proxies like prompt-variance, stability, and human-alignment to evidence reliability where traditional deterministic validation does not directly apply. The effective-challenge principle endures; the techniques expand to address what makes AI models harder to validate than static ones. See When Confidence Scores Lie: Why “94% Confident” Is Not an Audit Trail for why probabilistic outputs require extra scrutiny.

Step 4: Document the full lifecycle

SR 11-7 requires thorough documentation across the model lifecycle, sufficient for an independent party unfamiliar with the model to understand its purpose, design, capabilities, limitations, and use. Documentation provides the audit trail, enables the model to be understood and challenged, and demonstrates control and compliance to examiners. It should span development, validation, approval, use, changes, and monitoring.

For AI, documentation is both more important and harder. More important because AI models are often less intuitively understandable than traditional models, so the documentation carries more of the burden of making the model comprehensible and its limitations clear. Harder because the model’s behavior may be less transparent — the documentation has to capture not just the design but the validation evidence, the monitoring approach, the data lineage, and the limitations specific to the AI, including, for GenAI and agentic systems, how the model’s decisions can be reconstructed and explained. See 5 SOX Compliance Risks When Using Generative AI in Finance Controls for how documentation gaps create compliance exposure.

Step 5: Monitor continuously

SR 11-7 emphasizes understanding how models behave in practice and implementing controls to manage their risk over time — and for AI this becomes continuous monitoring, because AI models drift and degrade in ways traditional static models do not. A model validated as sound at one point can deteriorate as the data it encounters shifts, its performance decays, or (for adaptive models) its behavior changes, so set-and-forget validation is inadequate. Ongoing monitoring of model performance, output quality, and drift is essential, with thresholds that trigger investigation or revalidation when performance degrades.

This is one of the sharpest adaptations AI requires of SR 11-7 practice. The traditional model of validate-then-periodically-revalidate assumes relative stability between reviews, which AI models violate. Continuous monitoring — watching the model’s behavior in production and catching drift or degradation as it happens rather than at the next scheduled review — is what keeps AI model risk managed between formal validations. The monitoring burden is greater for probabilistic, adaptive models than for stable, deterministic ones, which is part of why the architecture of the AI affects the cost of compliance.

Step 6: Address the AI-specific strains

The sixth step is to recognize and add controls where SR 11-7’s assumptions strain against AI, rather than pretending the framework fits AI perfectly. Three strains matter most. Explainability: where SR 11-7 assumes a model can be understood and its limitations identified, opaque AI models resist this, so additional explainability controls and techniques are needed to meet the framework’s intent. The static-model assumption: where SR 11-7 assumes relative stability, adaptive AI models change, so additional change-detection and continuous-monitoring controls are needed. And autonomy: where SR 11-7 assumes a model produces an output for human use, agentic AI takes autonomous action, which tests whether MRM alone is sufficient or whether additional operational and governance controls around the AI’s autonomy are required.

The practical response is to apply SR 11-7’s principles fully and add controls where the framework strains: stronger explainability requirements for opaque models, robust change-detection and continuous monitoring for adaptive ones, and defined autonomy boundaries and oversight for agentic systems. This is also where the architecture of the AI becomes a model-risk consideration in itself — because some architectures strain the framework far less than others. See RPA vs Agentic AI in Finance: 6 Key Differences for how agentic autonomy differs from prior automation and why it adds governance complexity.

The cross-cutting theme: architecture affects how hard MRM is

Running through all six steps is a theme worth making explicit: how hard it is to manage an AI model’s risk under SR 11-7 depends heavily on the architecture of the AI. Models that are explainable, stable, and auditable are far easier to validate (step three), document (step four), monitor (step five), and reconcile with SR 11-7’s assumptions (step six) than models that are opaque, drifting, and probabilistic.

An explainable model can be understood and challenged, satisfying validation and the effective-challenge principle; an opaque one resists it. A stable model behaves consistently, fitting the framework’s static-model assumption; a drifting one strains it and demands heavier monitoring. An auditable model provides the documentation and reconstructability SR 11-7 requires; an unauditable one cannot. So the choice of AI architecture is, in part, a model-risk-management decision: the more explainable, stable, and auditable the AI, the lower the MRM burden and the better the model fares under supervisory scrutiny.

This is where deterministic, explainable AI is relevant to model risk management, honestly framed. A deterministic system that produces the same output from the same input, executes explainable logic, and is auditable by design strains SR 11-7’s assumptions far less than a probabilistic, opaque, drifting model: it is easier to validate (its logic can be examined and challenged), easier to document (its decisions are reconstructable), easier to monitor (it does not drift the way probabilistic models do), and more consistent with the framework’s static-model assumption. This is the relevance of a platform like Kognitos — which is deterministic, explainable, and auditable by design — in finance processes touching SR 11-7-relevant decisions: not that it replaces model risk management, which remains a required discipline, but that its architecture makes the model risk easier to manage and the model easier to validate and govern under the framework. SR 11-7 compliance is a discipline the institution performs, and architectures that are explainable and auditable make that discipline meaningfully easier than opaque ones do. This connects to the broader architectural argument in Deterministic AI vs Generative AI for Finance Controls: 5 Things CFOs Must Understand.

For related reading: The Top AI Automation Tools for Banking Back-Office Operations, The Hidden Cost of Manual Cash Application, What is Neurosymbolic AI?, and What is English as Code?

Book a working session with a Kognitos solutions engineer  •  Try Kognitos free

Putting it together

SR 11-7 is the definitive US standard for managing model risk in banking, built on sound development, rigorous independent validation through effective challenge, and effective governance — and its principles apply to AI and machine learning models just as to traditional ones. The 2026 challenge is that SR 11-7 assumed models are simplified and relatively static, while AI is dynamic, probabilistic, and increasingly autonomous, which strains the framework even as it remains the stable reference point for model governance. Managing AI model risk under SR 11-7 means applying its enduring principles while adapting the techniques: govern and inventory the AI models, tier them by risk, validate them with effective challenge adapted through explainability and robustness testing, document the full lifecycle, monitor continuously for drift, and add controls where the framework strains against AI’s explainability, adaptivity, and autonomy. The cross-cutting reality is that the architecture of the AI affects how hard this is: explainable, stable, auditable AI is far easier to validate, document, monitor, and govern under SR 11-7 than opaque, drifting, probabilistic AI, which makes the choice of AI architecture itself a model-risk-management consideration.

Last updated: June 2026. Information reflects publicly available sources as of mid-2026, including the Federal Reserve and OCC SR 11-7 / OCC 2011-12 guidance and industry analysis of its application to AI and machine learning. SR 11-7 compliance requirements should be validated with qualified counsel and the institution’s regulators. This article is informational and does not constitute legal, regulatory, or compliance advice.

Frequently asked questions

SR 11-7 is the Federal Reserve’s Supervisory Guidance on Model Risk Management, issued in April 2011 along with the OCC’s companion guidance (OCC 2011-12), and it is the definitive supervisory standard for managing model risk in US banking. It arose from the recognition that quantitative models had become central to bank decision-making — in credit underwriting, valuation, risk management, and reserve adequacy — and that models carry risk: the risk of adverse consequences from decisions based on incorrect or misused models. The framework rests on three pillars: sound model development (rigor, conceptual soundness, appropriate data), rigorous independent validation (through “effective challenge,” critical analysis by competent, independent parties who can identify limitations), and effective governance (policies, controls, roles, documentation, and accountability across the model lifecycle). SR 11-7 directs banks to manage model risk like other risks, considering both individual and aggregate model risk. Its principles were written broadly enough to apply to any quantitative model regardless of methodology, which is why they apply to AI and machine learning models, not just traditional statistical ones.
Yes. SR 11-7’s principles were written broadly enough to apply to any quantitative model regardless of the underlying methodology, so AI and machine learning models used in banking fall under it. When a bank deploys an AI/ML model for credit decisioning, fraud detection, AML surveillance, pricing, or similar functions, that model is subject to SR 11-7’s requirements: it must be developed soundly, validated independently through effective challenge, and governed properly across its lifecycle. AI does not get an exemption from model risk management; it is subject to the same supervisory expectations as traditional models. The complication is that SR 11-7 was written in 2011, before AI/ML were mainstream in finance, and it assumed models are simplified, relatively static representations of real-world relationships. Modern AI, being dynamic, probabilistic, and increasingly autonomous, strains those assumptions, so while the framework clearly applies, its specific validation and monitoring techniques have to be adapted to AI’s nature. Banks must therefore apply SR 11-7’s enduring principles to AI while adapting the practices — adding explainability testing, robustness checks, and continuous monitoring — to address what makes AI models different from the static models the framework originally envisioned.
Validating an AI model under SR 11-7 applies the framework’s core principle of effective challenge — independent critical analysis confirming the model is conceptually sound, testing it (including outcomes analysis comparing outputs to actual results), and identifying its limitations — while adapting the techniques to AI’s nature. Because AI models are often less transparent and more dynamic than traditional models, leading firms supplement traditional validation with AI-specific techniques: explainability testing (can the model’s decisions be understood and explained?), robustness checks against adversarial or unusual inputs, and scenario-based stress testing across a range of conditions. For probabilistic models like GenAI, validation may use quantitative proxies such as prompt-variance, stability, and human-alignment to evidence reasoning reliability where traditional deterministic validation does not directly apply. The validation must be performed independently of the model’s developers to constitute genuine effective challenge. A key practical point is that explainable, stable models are far easier to validate than opaque, drifting ones, because effective challenge requires being able to understand and critically analyze the model, which opaque models resist. The validation should be thoroughly documented to provide the audit trail and demonstrate compliance to examiners.
AI strains SR 11-7 because the framework was written in 2011 around the assumption that models are simplified, relatively static representations of real-world relationships, while modern AI is dynamic, probabilistic, and increasingly autonomous. Three specific strains arise. First, AI models can be dynamic rather than static — they may learn and change, so a model validated at one point can behave differently later without an explicit revision, which the validate-then-periodically-revalidate approach does not anticipate. Second, many AI models are probabilistic rather than deterministic, producing variable outputs rather than fixed relationships, which complicates validation that assumes consistent behavior. Third, agentic AI systems are increasingly autonomous, taking actions and making decisions rather than producing a fixed output for human use, which tests whether SR 11-7’s very definition of a “model” still fits. Despite these strains, SR 11-7 remains one of the few stable reference points for model governance, so the practical approach is not to abandon it but to apply its enduring principles — sound development, independent validation, effective governance — while adapting the techniques and adding controls where the framework strains. Recognizing the strains is what lets institutions manage AI model risk realistically rather than pretending the framework fits AI perfectly.
Model drift — the degradation or change in a model’s behavior over time — is one of the most important AI-specific challenges in model risk management, because it breaks the assumption underlying traditional validation. SR 11-7’s traditional approach of validating a model and then periodically revalidating it assumes relative stability between reviews, which holds for static models but not for AI models, which can drift as the data they encounter shifts, their performance decays, or (for adaptive models) their behavior changes. A model validated as sound can deteriorate before the next scheduled review without anyone realizing, leaving it operating outside its validated performance. This is why continuous monitoring is essential for AI model risk management: ongoing tracking of model performance, output quality, and drift in production, with thresholds that trigger investigation or revalidation when performance degrades, rather than relying on periodic point-in-time validation alone. The monitoring burden is greater for probabilistic and adaptive models, which drift more, than for stable, deterministic ones, which is part of why the architecture of the AI affects the cost and difficulty of model risk management. Managing drift well is central to keeping AI model risk controlled between formal validations.
Explainability is central to AI model risk management under SR 11-7 because the framework’s core requirements — validation through effective challenge, documentation sufficient for an independent party to understand the model, and identification of model limitations — all depend on being able to understand how the model works. An explainable model can be validated (its logic can be examined and critically challenged), documented (its decisions and limitations can be clearly described), and governed (its behavior can be understood and overseen), satisfying SR 11-7’s intent. An opaque model resists all of this: it is hard to validate because effective challenge requires understanding the model, hard to document comprehensibly, and hard to govern because its behavior is not transparent. This is why leading firms incorporate explainability testing into AI model validation, and why explainability is increasingly treated as a requirement rather than a nice-to-have for AI models in regulated finance. It also means the architecture of the AI matters for model risk: explainable architectures (including deterministic, transparent systems) are substantially easier to validate and govern under SR 11-7 than opaque, black-box models, making explainability a practical determinant of how difficult and costly model risk management is for a given AI system.
Deterministic AI — which produces the same output from the same input through explainable logic — makes SR 11-7 compliance meaningfully easier than opaque, probabilistic AI does, though it does not eliminate the model risk management discipline, which remains required. The advantages map directly to SR 11-7’s requirements. Deterministic, explainable systems are easier to validate, because their logic can be examined and subjected to effective challenge, where opaque models resist this. They are easier to document, because their decisions are reconstructable, providing the audit trail SR 11-7 requires. They are easier to monitor, because they do not drift the way probabilistic, adaptive models do — the same input produces the same output over time, so the monitoring focuses on intended logic and data changes rather than unpredictable behavioral degradation. And they fit SR 11-7’s static-model assumption better than dynamic, learning models that the framework strains to accommodate. So while a bank using deterministic AI in model-relevant decisions still performs model risk management (governance, validation, documentation, monitoring), the architecture makes that discipline substantially less burdensome and the model easier to validate and govern under supervisory scrutiny. This is why the choice of AI architecture is itself a model-risk consideration, with explainable, deterministic, auditable systems straining the framework far less than opaque, probabilistic ones.
SR 11-7 rests on three pillars. The first is sound model development: models should be developed with rigor and conceptual soundness, using appropriate data, with developers clearly articulating the theory behind the model, the logic, and the key assumptions, and checking that they are met in practice. The second is rigorous independent validation: models should be validated independently from developers through “effective challenge” — critical analysis by informed, competent, independent parties who can identify the model’s limitations, assumptions, and potential weaknesses. Validation includes evaluating conceptual soundness, the quality of testing and data, and the ongoing monitoring of model performance. The third is effective governance: policies, controls, roles, documentation, and accountability across the full model lifecycle — from development through use, change, and retirement. This includes maintaining a complete model inventory, ensuring senior management accountability, and providing board-level oversight of model risk as a category of risk to be managed. Together, the three pillars ensure that models are built soundly, tested independently, and governed rigorously — the combination that SR 11-7 holds necessary for prudent model risk management.
K
Kognitos
Kognitos

AI model risk is easier to manage when the AI is auditable by design

Kognitos is deterministic, explainable, and auditable — the architecture that strains SR 11-7’s assumptions far less than opaque, probabilistic AI. See how it applies to your finance processes.

Book a Working Session
Or try it free →