We do not ship what we do not evaluate.
Every AI engagement at Orion includes an evaluation harness as a delivered artifact. Your team owns it, can re-run it, and can extend it when the domain evolves or a model version changes. The harness is the contract — not the prompt, not the model, not the marketing demo.
If we cannot agree on a measurable definition of "the system works," we will tell you the engagement is not ready to start.