Reflexion: Language Agents with Verbal Reinforcement Learning

Shinn, Cassano, et al. (Northeastern) · 2023

L3.1 · LLM Agent Patterns & FrameworksNeurIPS 2023#reasoning#self-improvement

CORE IDEA

失败 trace 让 LLM 写 reflection，下一次 attempt 把 reflection 作为 prompt，等于用语言做 RL。

CONCRETE EXAMPLE

HumanEval/AlfWorld 上比 ReAct 提升 10%+。

L-ANCHOR · 为什么在这一层重要

verbal RL

相关论文