We're sunsetting PodQuest on 2025-07-28. Thank you for your support!

[Linkpost] “Tsinghua paper: Does RL Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?” by Thomas Kwa

2025/5/5

LessWrong (30+ Karma)

No transcript made for this episode yet, you may request it for free.