SourceScore
SourceScore VERITAS · verified claim100% confidence

AlpacaEval introduced in: Li et al. 2023 — LLM-as-judge evaluation benchmark.

Subject
AlpacaEval
Predicate
introduced_in
Object
Li et al. 2023 — LLM-as-judge evaluation benchmark
Primary source · github release · 2023-05-25
AlpacaEval — automatic evaluator for instruction-following models Tatsu Lab / Stanford
Last verified 2026-05-16 · 2 sources · 2f14f3078741c0adView full claim →