MEM Academy
For OrgsClara

Sprint 8 · Module 05 of 10

Module3 min read · 669 words
Not started

Saved on this device · no account needed

05 · Data Quality and Confidence Factor

Use: Every published outcome and SROI ratio carries a confidence factor between 0% and 100%. This file defines how it is calculated, when we suppress a number entirely, and how we communicate uncertainty in plain English.

The confidence factor formula

confidence_factor =
    0.30 × sample_size_score
  + 0.25 × response_rate_score
  + 0.20 × instrument_validity_score
  + 0.15 × attribution_score
  + 0.10 × data_freshness_score

Result rounded down to the nearest 1%. Maximum publishable: 95% (we don't claim 100%; impact work always has residual uncertainty).

Component definitions

Component1.00.50
sample_size_scoren ≥ 30 in the metricn ≥ 10n < 10 (and ≥5; below 5 is suppressed)
response_rate_score≥75% of cohort responded to both pre and post≥50%<50%
instrument_validity_scoreValidated instrument used unchangedValidated instrument adapted (length / wording) with rationale loggedBespoke instrument, no validation work
attribution_scoreControl cohort or strong comparatorPre/post only, cleanSelf-report only, no pre-baseline
data_freshness_scoreWithin 30 days of measurementWithin 90 daysOlder than 90 days

Suppression rules — when we don't publish a number at all

A metric is suppressed (replaced with "n suppressed for anonymity" or "insufficient data — see methodology") when ANY of:

  1. n < 5 for the metric.
  2. The combination of demographic filters used to slice the data would re-identify an individual (e.g. "one menopause-experiencing manager in Engineering" — even if n ≥ 5 in the parent cohort).
  3. Response rate is below 25% — the responders are too self-selected to draw any signal.
  4. Confidence factor would compute below 30% — published number would be more misleading than useful.

Suppressed metrics are listed by name in every report with the reason, so the reader knows we measured them rather than chose not to.

Communicating uncertainty in plain English

Every published number is paired with a confidence band:

Confidence factorPlain-English label
80–95%"Strong evidence."
60–79%"Good evidence with normal caveats."
40–59%"Early signal, treat as indicative."
30–39%"Direction of travel only; small sample."
<30%Suppressed (do not publish).

Example caption:

73% of managers said they had held a difficult conversation they would previously have avoided. Strong evidence (confidence 82%, n=41 of 48 cohort members, pre/post pulse, instrument adapted from Sprint 1 / 01).

What "data quality" means at intake

Before a metric goes anywhere near a report, it passes the intake check:

  • Source row(s) exist in the data model (Sprint 5 / 05) with valid foreign keys.
  • Aggregate respects the n ≥ 5 view (Sprint 5 / 05).
  • Pre and post timestamps are within the engagement window.
  • Free-text responses have been coded by ≥2 reviewers if used as a theme.
  • Outliers (>3 SD) are flagged; kept by default, removed only with reason logged.
  • Missing-data rate is logged.

If any box is unchecked, the metric goes back to the Account Manager and Data Lead before publication.

Independent re-calculation

Once per quarter, the Head of Impact picks one client's headline numbers at random and asks the Data Lead to re-derive them from raw data without looking at the prior calculation. Discrepancies >2% trigger an investigation; >5% trigger a correction under file 10.

What we do not do with confidence factor

  • Hide it. It appears next to every monetised number, every time.
  • Apply it after-the-fact to make a weak number look stronger ("rounded up for presentation").
  • Use it as an excuse to publish anything below the suppression threshold.
  • Average confidence factors across unrelated metrics into a single "report confidence" — that hides where the weakness actually lives.

Continue learning

All sprints

Want to roll this out to your team?

Bring Evidence & Impact into your organisation.

MEM is a self-serve resource library — your managers and staff work through the modules at their own pace, with the workbooks, runsheets and pocket cards provided. Coach-led delivery is available only for our corporate fitness sessions, not the educational modules. Every funded seat also opens a mirrored free seat for someone leaving prison, with SROI your board can sign off.