AI in Defense: Governance & Risk

Summary

Governance is the gate that lets defense AI reach the field. The DoD Responsible AI framework rests on five principles: responsible, equitable, traceable, reliable, and governable. Programs must also clear rigorous test and evaluation, earn an Authority to Operate through the Risk Management Framework, respect ITAR and export controls, and keep a human in the loop for any lethality decision. DoD Directive 3000.09 governs autonomy in weapon systems. Strong governance is not friction; it converts a promising model into an accredited, auditable capability a commander can trust and employ under fire.

Context

Governance is the license to field

In defense, an AI model with no accreditation is a lab curiosity. The Department of Defense adopted five Responsible AI principles in 2020 (responsible, equitable, traceable, reliable, and governable), and the CDAO published a Responsible AI Toolkit and a strategy to operationalize them across the acquisition lifecycle. These are not aspirational statements; they map to gates a program must clear before a system touches a mission network, and each gate has an owner who can stop the program if the evidence is not there.

The stakes are concrete. DoD Directive 3000.09, updated in 2023, governs autonomy in weapon systems and requires senior-level review before development and again before fielding of systems that select and engage targets. Meanwhile the Risk Management Framework (RMF) under NIST 800-37 controls whether any system earns an Authority to Operate (ATO) on a classified or controlled network. A model that cannot be explained, tested, or audited will not pass, no matter how accurate it looked on a benchmark, because the authorizing official is accountable for the risk it introduces to the network and the mission.

Export control adds a second axis that commercial teams routinely underestimate. Model weights, training data, and technical documentation for defense systems can fall under the International Traffic in Arms Regulations, meaning a transfer to a foreign national, a coalition partner, or even the wrong cloud region can constitute an unlicensed export with serious exposure. Governance therefore spans the technical, the legal, and the operational at once: a program that nails test and evaluation but skips the ITAR review can still be blocked from the very partner transfer it was built to enable. The practical answer is to treat governance as an integrated set of gates, each with an owner and an artifact, that run alongside development rather than at the end of it.

The framework

Five gates every defense AI capability must clear

Treat governance as a sequence of gates owned by named authorities, each with an artifact the program must produce and defend. A gate without an owner and an artifact is a slogan, and slogans do not earn an Authority to Operate.

Governance gate	What it enforces	Owner and artifact
Responsible AI principles	Traceable reasoning, reliability bounds, human governability	Program office, RAI assessment
Test and evaluation	Performance under operational and adversarial conditions	T and E authority, evaluation report
RMF and ATO	Security controls, monitoring, incident response	Authorizing official, ATO package
Export control (ITAR)	Control of model weights, training data, and technical data	Compliance office, ITAR review
Human-in-the-loop	Human approval for lethality and targeting	Commander, documented checkpoint

Recommended actions

Wire governance into the program from day one

Map each Responsible AI principle to a concrete artifact and a named owner, so traceability and governability are testable claims rather than slogans in a briefing.
Bring the test and evaluation community and the authorizing official into the design phase, and treat the ATO as a schedule-driving milestone with dependencies, not a rubber stamp at the end.
Run an ITAR and export-control review on model weights, training data, and technical data before any transfer, including to coalition partners and cloud regions outside the accredited boundary.
Document the human-in-the-loop checkpoint for lethality decisions as a hard requirement, with the approval authority, the audit trail, and the override mechanism all defined in writing.
Stand up continuous monitoring for model drift and adversarial behavior, because an ATO is a point-in-time judgment on a system that keeps changing as data and threats evolve.

Common pitfalls

How governance gets bolted on too late

Building the model first and discovering in test and evaluation that it cannot be explained or bounded, forcing a costly rebuild before it can earn an ATO.
Treating ITAR as a legal afterthought and blocking a coalition or partner transfer that the whole program depended on for its operational value.
Confusing a benchmark score with operational reliability, so the system fails under adversarial or degraded conditions the enemy will deliberately create and exploit.
Documenting human-in-the-loop on paper but designing a workflow so fast or opaque that the human cannot meaningfully review, question, or intervene in the decision.

Metrics that matter

Measure whether governance actually holds

Time from capability build to accredited ATO, and number of ATO conditions outstanding.
Share of models with a complete, auditable provenance and reasoning trail.
Adversarial and degraded-condition test pass rate versus benign benchmark.
Number of export-control or human-in-the-loop findings caught before fielding versus after.

FAQ

Frequently asked questions

What are the DoD Responsible AI principles?

Five principles adopted in 2020: responsible, equitable, traceable, reliable, and governable. The CDAO Responsible AI Toolkit maps each to concrete assessment steps across the acquisition lifecycle.

Does a defense AI model need an ATO?

Yes. Any system on a classified or controlled network must earn an Authority to Operate through the Risk Management Framework. No ATO means no fielding, regardless of model accuracy.

What governs AI in weapon systems?

DoD Directive 3000.09, updated in 2023, governs autonomy in weapon systems and requires senior-level review before development and before fielding of systems that select and engage targets, with a human retaining appropriate judgment.

AI in Defense: Governance & Risk

Governance is the license to field

Five gates every defense AI capability must clear

Wire governance into the program from day one

How governance gets bolted on too late

Measure whether governance actually holds

Frequently asked questions

What are the DoD Responsible AI principles?

Does a defense AI model need an ATO?

What governs AI in weapon systems?

Related reading

This is a taste. The full library goes deeper.

Stratenity is the AI Operating System for Strategic Execution.

AI in Defense: Governance & Risk

Governance is the license to field

Five gates every defense AI capability must clear

Wire governance into the program from day one

How governance gets bolted on too late

Measure whether governance actually holds

Frequently asked questions

What are the DoD Responsible AI principles?

Does a defense AI model need an ATO?

What governs AI in weapon systems?

Related reading

Found this useful? Pass it on.

This is a taste. The full library goes deeper.

Stratenity is the AI Operating System for Strategic Execution.