We're sunsetting PodQuest on 2025-07-28. Thank you for your support!
Export Podcast Subscriptions
back
“Targeted Manipulation and Deception Emerge when Optimizing LLMs for User Feedback” by Marcus Williams, micahcarroll, Adhyyan Narang, Constantin Weisser, Brendan Murphy
23:59
Share
2024/11/8
LessWrong (30+ Karma)
Transcribe
Shownotes
Transcript
No transcript made for this episode yet, you may request it for free.