Playbook kit

Metric trust register.

A worksheet for cataloging which metrics are trusted, which are derived twice, and which can't be reproduced. The first conversation that fixes a quarter.

§ Sample artifact · what the install produces

Sample output — Trust register · v1.

Sample · Metric trust register

A worksheet for cataloging which metrics are trusted, which are derived twice, and which can't be reproduced. The first conversation that fixes a quarter.

Final output · Trust register · v1

§ Install order · five steps

How it actually goes in.

STEP · 01

List every metric in active use.

Dashboards, weekly reports, board packages, quarterly reviews. Typical mid-market operations have 40-80. If your list is under 30, you're missing categories.

Day 1

STEP · 02

Identify the source system for each metric.

Where does the underlying data live? ERP, CRM, marketing automation, spreadsheet maintained by hand. Metrics whose source is 'a spreadsheet' are usually category 2 or 3.

Day 2

STEP · 03

Document calculation logic in plain language.

Not the formula — the logic, in two sentences. If you can't, the metric is probably more complicated than its users realize.

Day 3

STEP · 04

Classify each as trusted, derived-twice, or unreproducible.

Trusted: reproducible from source, named owner, validated within 90 days. Derived-twice: two sources, different versions, not reconciled. Unreproducible: logic undocumented, source not retrievable.

Day 4

STEP · 05

Rank the remediation queue by usage weight.

Top of queue: unreproducible metrics referenced weekly in operating decisions. Bottom: derived-twice metrics quoted quarterly. Anything beyond a year of remediation effort is a retire candidate.

Day 5

§ Outcomes scorecard

What good looks like, ninety days in.

Metrics catalogued

40–80

Typical mid-market operations have this many metrics in active use across all dashboards, reports, and decision contexts.

Typically derived-twice or unreproducible

30–50%

Most operations discover that a third to a half of metrics they reference weekly can't be defended under scrutiny.

Top remediations identifiable

5–10

Ranked by usage weight, sequenced for the next quarter of data-layer rebuild.

Install effort

1 week

Five days of structured collection plus classification. Foundation for almost every downstream data investment.

§ The operator narrative

Why this kit is worth installing.

The Most Uncomfortable Operating Audit

Most operating audits surface things the team had not yet noticed. The Metric Trust Register is structurally different. It surfaces things the team has been noticing and not naming — for months, in some cases years.

The register catalogs every operating metric in the business, classifies each as trusted, derived twice, or unreproducible, and produces a ranked remediation queue. The output is uncomfortable because most operations discover that 30-50% of the metrics they reference weekly fall in the second or third category. The metrics the leadership team trusts are partly fiction. The team has been managing around the fiction by checking with the analytics lead before quoting any number that matters.

This essay is about what the register surfaces, why running it is the prerequisite for almost every downstream data investment, and what makes the install land vs. produce another spreadsheet nobody updates. The kit guide covers the structural mechanics; this is the operator narrative.

The Pattern That Makes the Register Necessary

The data layer of most mid-market operations got built incrementally over years. Each system migration added metrics. Each personnel change took some metric lineage with it. Each new initiative produced its own KPIs. The accumulated result is a measurement layer that nobody architected — it accreted.

The accretion produces three pathologies in predictable proportions.

Trusted metrics. Reproducible from source data by someone other than the named owner. Owned by a specific person. Validated against source within the last 90 days. The metric's definition has not changed materially in the last year. These are the metrics the leadership team can reference with confidence. In typical mid-market operations, about a third of the metrics qualify.

Derived twice metrics. Two or more sources produce slightly different versions of the metric. The difference is "explained" rather than reconciled. The metric is often quoted with a caveat ("depending on whose number you use"). About a third of metrics fall here. The leadership team has learned to work around the inconsistency by picking whichever source aligns with the conversation they're having.

Unreproducible metrics. The calculation logic is not documented, the source data is not retrievable, the named owner is no longer in the role, or the metric requires a manual step that has not been re-verified. The metric is referenced in operating decisions but cannot be defended under scrutiny. About a third of metrics fall here too.

The proportions are remarkably consistent across operations. The register's value is not in surfacing the pathologies — most teams know the pathologies exist. The value is in cataloging them comprehensively, classifying each metric specifically, and producing a remediation queue ranked by usage weight.

Why This Matters More Than It Sounds

Operators who hear the register description sometimes underweight its strategic importance. "Sure, our data has some quality issues, but we're operating fine without fixing them."

This is structurally wrong. Three reasons.

Compounding decision quality. Every operating decision made on a derived-twice metric is a decision made on an unstable foundation. Most of those decisions survive because operators apply judgment over the data; some don't, and the operations team rationalizes the misses as bad luck rather than diagnosing them as decisions made on unreliable input. The cumulative cost over a year is substantial and invisible.

AI deployment gate. The AI Readiness Index is structurally derived from the Data & Metrics score, with floor-driven penalties when that category scores low. Operations with unaddressed register pathology cannot deploy AI productively — the workflows will amplify the data problems at higher speed. The register is the prerequisite for AI investment that compounds.

Board and capital trust. Any future capital event will involve diligence on the data layer. Operations whose key metrics turn out to be derived twice or unreproducible take a discount in the diligence process. The discount can be material. The register, done in advance, surfaces the issues so they can be addressed before the diligence team finds them.

The operations that defer the register pay these costs invisibly. The operations that run the register pay the one-week install cost and eliminate the invisible compounding.

The Three Categories Are Not Equally Expensive

Operators sometimes treat the three categories as equivalent failure modes. They are not.

Trusted is the safe state. No action needed beyond maintenance — periodic validation, lineage documentation, named-owner continuity. The trusted column is the baseline; the goal is to grow it over time.

Derived twice is the dangerous middle. The metric appears trustworthy because it has a clear source and a clear name; the trustworthiness is undermined by the second-source disagreement that nobody has reconciled. Decisions made on derived-twice metrics tend to be wrong in a specific direction — they look defensible at the time and turn out to be wrong after the fact. The remediation is reconciliation: pick the canonical version, document the methodology, retire the alternative versions.

Unreproducible is the most expensive. The metric is being quoted in operating decisions without anyone able to defend the methodology. The first time the metric matters in a high-stakes context (board diligence, audit, strategic decision), the unreproducibility becomes a credibility crisis. The remediation is either rebuilding the metric from source (which often reveals that the historical numbers were wrong by 5-15%) or retiring the metric entirely.

The register's remediation queue should prioritize unreproducible metrics first, derived-twice metrics second, by usage weight. The metrics that drive the most operating decisions get fixed first; the metrics that are referenced occasionally can be addressed in a slower cycle or retired.

The Retire Column Is the Most Underused

The register's most underused output is the retire column. Operators often want to keep every metric in some form because each one was added at some point for a reason. The reason has usually been forgotten.

Three signals indicate a metric belongs on the retire list:

Unreproducible AND used in fewer than two operating decisions per quarter. The cost of rebuilding lineage exceeds the cost of removing the metric from the conversation entirely. Retire the metric; remove the references; reduce the dashboard surface area.

Derived twice AND "explained" rather than reconciled for more than six months. Six months of attention without reconciliation means the reconciliation isn't going to happen. Either retire the metric or commit serious effort to one canonical version. The middle state is the worst — operators continue to reference a metric whose meaning is contested.

Trusted AND nobody can articulate what decision it informs. A trusted metric with no operational use is overhead. Retire it; reduce dashboard surface area; protect the team's attention for the metrics that drive actual decisions.

The retire decisions are usually the most contested in the working session that surfaces them. Operators have emotional attachment to metrics they personally added. The discipline is to retire on operating criteria rather than on attachment. Operations that maintain this discipline keep their dashboards focused; operations that don't keep accumulating metrics until the dashboards become unscannable.

The Three Failure Modes the Register Catches

Three operating patterns surface from the register that are otherwise hard to see.

The phantom dashboard. Comprehensive dashboards with dozens of metrics that nobody actually operates against. The leadership team reviews the dashboard weekly; the actual operating decisions get made on a smaller set of metrics that the CFO trusts. The register reveals that 60-70% of the dashboard is decorative and the operating reality runs on a much smaller set.

The conflicting truth. Two metrics that should be the same number are different by 5-15%. Sales reports one revenue figure; finance reports a slightly different one. Marketing reports one conversion rate; analytics reports another. The difference is "explained" rather than reconciled. The team has learned which number to use depending on which executive is asking. The register makes the conflict explicit and forces the reconciliation.

The orphaned metric. A metric that drives a compensation calculation or a board KPI is owned by someone who left the company 18 months ago. The calculation logic lives in a spreadsheet nobody fully understands. Every quarter, the metric is reproduced — sometimes by the analyst who used to support the previous owner, sometimes by someone new. The number is in the board package. The number is wrong. Nobody knows yet. The register surfaces the orphan before the next quarterly close.

Each pattern has a specific remediation. The register's value is in surfacing which pattern is dominant in which functions, so the remediation effort can be targeted.

The Cultural Move That Makes the Install Land

The register is, in part, a critique of the existing data culture. How you frame it determines whether the team participates or defends.

Frame as inventory, not audit. "We are cataloging every metric we use so we know what we have" is true and produces collaboration. "We are going to find out which metrics are wrong" is also true and produces defensiveness. The framing matters because the team's participation determines the register's completeness, and incomplete registers produce false confidence in the cleanup.

Acknowledge the historical pattern. Derived-twice and unreproducible metrics are the historical default for mid-market operations. Nobody set out to build a fragile data layer. The fragility is the accumulated result of years of operating decisions, system migrations, and personnel changes. The register is the first time the operation has surfaced the accumulation in one place. Most teams are relieved when this is named explicitly.

Show the remediation queue with the first three commitments. A register without commitments is a complaint. A register with three specific remediations sequenced for the next quarter is a plan. Walk into the team conversation with the plan, not just the diagnostic.

What to Do This Week

If the Ops Health Check has surfaced Data & Metrics as a top-three risk, the register is the next install. One week of structured work; the foundation for almost every downstream data investment.

Day 1. List every metric. Be exhaustive. Aim for 40-80 metrics; if you're under 30, you're missing categories.

Day 2. Identify source system for each metric. The fragmented sources usually indicate the derived-twice and unreproducible categories.

Day 3. Document calculation logic in plain language. Not the formula — the logic. If you can't describe it in two sentences, the metric is probably in the second or third category.

Day 4. Assign trust classification per metric. Be honest. Bias toward the lower category when uncertain.

Day 5. Rank the remediation queue. Top of the queue should be addressable in the next quarter; bottom should be addressable in the next year. Anything beyond a year is a retire candidate.

The kit guide at /playbooks/metric-trust-register covers the install mechanics in detail. This essay is the operator narrative for why the register matters more than its description suggests. If Data & Metrics is dragging your composite or your AI Readiness Index, the register is the prerequisite install.

One week of inventory work. The first conversation that distinguishes signal from noise in your operating data.

Sibling kits in the Diagnostic bundle

01 · AssessDiagnostic

Ops Health Check kit.

01 · AssessTemplate

Twenty-decision audit.

Member tier · free account

Members get the deeper templates.

Member tier is a free account. Adds the deeper templates, install guides, and access to member office hours. Engagement-tier kits stay engagement-only.

Not sure where to start?

The Ops Health Check is the front door.

Twelve minutes. Personalized phase-by-phase output. Then come back and pick the kit that matches what came out.

Take the Ops Health Check →