We're sunsetting PodQuest on 2025-07-28. Thank you for your support!

“Interpretability Will Not Reliably Find Deceptive AI” by Neel Nanda

2025/5/4

LessWrong (30+ Karma)

No transcript made for this episode yet, you may request it for free.