mlstories

Evaluation & Observability

⚖️

evaluation · week 1

The Model on Trial

Three witnesses, BLEU, RAGAS, and Judge, put one model answer on trial and cannot agree on whether it is actually good.

⏱ 4 min read read story →