DeepSeek-R1 released on: 2025-01-20 with reasoning chain-of-thought capabilities.
Object
2025-01-20 with reasoning chain-of-thought capabilities
Primary source · preprint · 2025-01-22
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning — arXiv (DeepSeek-AI)