AlpacaEval introduced in: Li et al. 2023 — LLM-as-judge evaluation benchmark.
Object
Li et al. 2023 — LLM-as-judge evaluation benchmark
Primary source · github release · 2023-05-25
AlpacaEval — automatic evaluator for instruction-following models — Tatsu Lab / Stanford