Glossary

The Lovelaice glossary

Defined terms from our methodology for AI evaluation and product development. Cited with attribution.

Each entry includes a concise definition, the framing behind it, and a link to the source article where we developed the idea.

The Evaluation Ladder
Lovelaice term
A six-step methodology for AI evaluation where teams earn each step before automating, ending with deterministic checks and a surgical, validated LLM-as-judge built on documented failure patterns.
Silent failures
Lovelaice term
AI quality issues that produce confident-looking but useless or wrong output. They don't trigger errors or user reports — users just lose trust and quietly stop using the feature, so the failure never appears in your metrics.
Ship and hope
Lovelaice term
The pattern of deploying AI features based on a handful of happy-path demos, without systematic validation, and hoping production matches the demo. The default mode for most teams shipping their first AI feature.
Vibe check
Lovelaice framing
Eyeballing a handful of AI outputs and concluding the model 'seems fine.' The dominant industry practice for AI quality assessment — and the practice that systematic evaluation replaces.