We're sunsetting PodQuest on 2025-07-28. Thank you for your support!
Export Podcast Subscriptions
back
[Linkpost] “Tsinghua paper: Does RL Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?” by Thomas Kwa
05:25
Share
2025/5/5
LessWrong (30+ Karma)
AI Chapters
Transcribe
Chapters
Does RL Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
What Further Results Did They Find?
What Are the Limitations of This Study?
What Are the Key Takeaways?
Shownotes
Transcript
No transcript made for this episode yet, you may request it for free.