Frameworks · DELIVER

Observability
Scorecard

Governance tells you what's allowed. Observability tells you what's actually happening. Six monitors for live AI workflows.

The Problem This Solves

Problems surface at the review meeting — when it's too late to course-correct cheaply.

Governance frameworks tell you what rules exist, but they don't tell you whether a live workflow is performing. Without real-time monitoring between formal cadence reviews, problems accumulate silently — costs escalate, quality degrades, adoption drops.

The Observability Scorecard is what the workflow owner watches between reviews. No scorecard data, no credible review.

How It Works

Six monitors, three frequencies

Cost per Run

Weekly
Tracks

API and compute cost per execution

Threshold

Alert if 7-day avg exceeds 120% of baseline

Output Quality

Weekly
Tracks

Accuracy, completeness, rubric score

Threshold

Alert if sample audit drops below minimum

Human Escalation Rate

Weekly
Tracks

% requiring human correction

Threshold

Alert if rate exceeds 25%

Adoption Rate

Fortnightly
Tracks

% of eligible staff using workflow

Threshold

Alert if drops below 60% after 30 days

Edge Case Accumulation

Fortnightly
Tracks

Unhandled input types or scenarios

Threshold

Flag when 3+ unhandled in a period

Drift Detection

Monthly
Tracks

Output character changes without model changes

Threshold

Flag if distribution shifts vs. baseline

The Running System Rule

You don't get to break the business while fixing it. Every AI deployment must maintain operational continuity, communicate changes early, and have a rollback plan before go-live. If the live workflow degrades existing performance, it triggers an automatic pause.

Where This Fits

DELIVER phase — between cadence reviews

The Observability Scorecard is part of the Board-Ready Pack delivered in the Decision Discipline Program. Named workflow owners use it between 90-Day Cadence™ reviews to track performance.

At each 30/60/90-day review, the scorecard data feeds directly into the evidence the owner presents. The data either confirms the workflow is delivering — or triggers a Fund/Park/Kill reassessment via the Investable Bet Gate™.

Monitor what matters

Governance tells you what's allowed. Observability tells you what's actually happening.