We're sunsetting PodQuest on 2025-07-28. Thank you for your support!

“Notable utility-monster-like failure modes on Biologically and Economically aligned AI safety benchmarks for LLMs with simplified observation format” by Roland Pihlakas, Sruthi Kuriakose

2025/3/17

LessWrong (30+ Karma)

No transcript made for this episode yet, you may request it for free.