We're sunsetting PodQuest on 2025-07-28. Thank you for your support!
Export Podcast Subscriptions
cover of episode “Targeted Manipulation and Deception Emerge when 
Optimizing LLMs for User Feedback” by Marcus Williams, micahcarroll, Adhyyan Narang, Constantin Weisser, Brendan Murphy

“Targeted Manipulation and Deception Emerge when Optimizing LLMs for User Feedback” by Marcus Williams, micahcarroll, Adhyyan Narang, Constantin Weisser, Brendan Murphy

2024/11/8
logo of podcast LessWrong (30+ Karma)

LessWrong (30+ Karma)

Shownotes Transcript

No transcript made for this episode yet, you may request it for free.