Tag
instructgpt
2 verified claims carrying this tag. Each has 2+ primary sources and an HMAC-SHA256 signature.
InstructGPT methodology introduced in paper: Training language models to follow instructions with human feedback (Ouyang et al., 2022).
5da8f8dffc038b8e · 2 sources · 100% confidence
InstructGPT introduced in: Ouyang et al. 2022 — RLHF-tuned GPT-3, direct ancestor of ChatGPT.
590b9de765b8126e · 2 sources · 100% confidence