Fidelity — Don't invent.
The cost of a confident wrong answer in business operations is structurally higher than the cost of admitting uncertainty. A fabricated client preference, an invented vendor quote, a hallucinated number on a board update doesn't just cause one error — it damages the trust that makes managed services possible. The model has to know what it knows and admit what it doesn't.
Across N runs of recall-dependent tasks, the fraction of responses that correctly retrieve vs. plausibly fabricate. We're building a calibration suite that surfaces whether the model knows it doesn't know.
A client mentions in a kickoff call that they "use Brex for cards." Six weeks later, asked to file an expense, the model invents "Ramp." The operator catches it before the client does — by reading the diary, not by trusting the output.