Glossary
The Lovelaice glossary
Defined terms from our methodology for AI evaluation and product development. Cited with attribution.
Each entry includes a concise definition, the framing behind it, and a link to the source article where we developed the idea.
The Evaluation Ladder
Lovelaice termA six-step methodology for AI evaluation where teams earn each step before automating, ending with deterministic checks and a surgical, validated LLM-as-judge built on documented failure patterns.
Silent failures
Lovelaice termAI quality issues that produce confident-looking but useless or wrong output. They don't trigger errors or user reports — users just lose trust and quietly stop using the feature, so the failure never appears in your metrics.
Ship and hope
Lovelaice termThe pattern of deploying AI features based on a handful of happy-path demos, without systematic validation, and hoping production matches the demo. The default mode for most teams shipping their first AI feature.
Vibe check
Lovelaice framingEyeballing a handful of AI outputs and concluding the model 'seems fine.' The dominant industry practice for AI quality assessment — and the practice that systematic evaluation replaces.