OpenAI o1 / o3

OpenAI · 2024

L4.3 · Foundation Model Tech StackProduct#reasoning-model

CORE IDEA

Inference-time hidden CoT + RL，test-time compute scaling 范式开创者。

L-ANCHOR · 为什么在这一层重要

test-time compute paradigm

相关论文