← 返回论文库

Playing Atari with Deep Reinforcement Learning (DQN)

Mnih, et al. (DeepMind) · 2013

L5.1 · Algorithmic FoundationsNature 518 (2015)#rl

CORE IDEA

Deep Q-Network：CNN + Q-learning，第一次 deep RL 大规模成功。

L-ANCHOR · 为什么在这一层重要

deep RL 起点

arXiv:1312.5602 ↗

相关论文

QuantFactor REINFORCE

DeepSeek-R1: Incentivizing Reasoning in LLMs via RL

Mastering the Game of Go with Deep Neural Networks and Tree Search (AlphaGo)