Tag
stanford
5 verified claims carrying this tag. Each has 2+ primary sources and an HMAC-SHA256 signature.
Direct Preference Optimization (DPO) introduced in paper: Direct Preference Optimization: Your Language Model is Secretly a Reward Model (Rafailov et al., 2023).
a3e691683a4577af · 2 sources · 100% confidence
FlashAttention introduced in paper: FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness (Dao et al., 2022).
e120182d1e01ea2b · 2 sources · 100% confidence
AlpacaEval introduced in: Li et al. 2023 — LLM-as-judge evaluation benchmark.
2f14f3078741c0ad · 2 sources · 100% confidence
Stanford Alpaca publicly released on: 2023-03-13 — instruction-tuned LLaMA 7B from Stanford CRFM.
a1cbe9c4e3a5c8d3 · 2 sources · 100% confidence
ColBERT introduced in: Khattab & Zaharia 2020 — late-interaction retrieval.
2335984b07f28cac · 2 sources · 100% confidence