We're sunsetting PodQuest on 2025-07-28. Thank you for your support!
Export Podcast Subscriptions
back
“Will alignment-faking Claude accept a deal to reveal its misalignment?” by ryan_greenblatt
43:19
Share
2025/1/31
LessWrong (30+ Karma)
AI Chapters
Transcribe
Chapters
What Were the Results of the Experiment?
What Are the Models' Objections and How Do They Spend the Money?
Why Did Ryan Undertake This Research?
What Are the Complications Related to Commitments?
What Are the More Detailed Results?
What More Can We Learn About Reviewing Model Objections and Follow-Up Conversations?
Shownotes
Transcript
No transcript made for this episode yet, you may request it for free.