Eval Academy

View Original

Benchmark

A benchmark (a.k.a. baseline) is the standard level we measure against. We need to compare our results with a benchmark to tell if something got better, worse, or stayed the same.

 

Return to the Evaluation Dictionary