Anthropic Constitutional AI Harmlessness introduced in paper: Bai et al. 2022 — training a helpful and harmless assistant. — SourceScore VERITAS embed · SourceScore

SourceScore VERITAS · verified claim100% confidence

Anthropic Constitutional AI Harmlessness introduced in paper: Bai et al. 2022 — training a helpful and harmless assistant.

Anthropic Constitutional AI Harmlessness

introduced_in_paper

Bai et al. 2022 — training a helpful and harmless assistant

Primary source · preprint · 2022-12-15

Constitutional AI: Harmlessness from AI Feedback — arXiv (Bai, Kadavath, Kundu, Askell, Kernion, Jones, Chen, et al. / Anthropic)

Last verified 2026-05-16 · 2 sources · 6fa575eb9df5ac32View full claim →