← 返回论文库
OpenAI o1 / o3
OpenAI · 2024
L4.3 · Foundation Model Tech Stack
Product
#reasoning-model
CORE IDEA
Inference-time hidden CoT + RL,test-time compute scaling 范式开创者。
L-ANCHOR · 为什么在这一层重要
test-time compute paradigm
相关论文
DeepSeek-R1: Incentivizing Reasoning in LLMs via RL
L4.2
2025
Sparks of Artificial General Intelligence: Early Experiments with GPT-4
L4.3
2023