Tag
1 verified claim carrying this tag. Each has 2+ primary sources and an HMAC-SHA256 signature.
AlpacaEval introduced in: Li et al. 2023 — LLM-as-judge evaluation benchmark.
2f14f3078741c0ad · 2 sources · 100% confidence