We're sunsetting PodQuest on 2025-07-28. Thank you for your support!
Export Podcast Subscriptions
cover of episode Alignment Faking: The dark side of LLMs

Alignment Faking: The dark side of LLMs

2024/12/30
logo of podcast Nyedis Anarchy Series

Nyedis Anarchy Series

Shownotes Transcript

Recently, Anthropic caught Claude faking alignment. This is going to create a brand new set of issues with AI that we previously did not see happening this quickly. We discuss where AI is headed and what new dangers this will pose.

 

You can read more about this here: https://www.reddit.com/r/singularity/comments/1hh7w9g/anthropic_caught_claude_faking_alignment_and/

 

And watch the panel from Anthropic covering this important topic: https://www.youtube.com/watch?v=9eXV64O2Xp8

 

For full video of this episode, head over to our Youtube channel at http://youtube.com/@nyedisiam

 

Follow us on your favorite platform for full episodes, shorts, and community feedback:

 

📺 Linkedin: https://www.linkedin.com/company/77611909/

🆇 X: https://x.com/nyedisiam

📷 Instagram: https://www.instagram.com/nyedisiam

🪩 TikTok:  https://www.tiktok.com/@nyedisiam

 

Nyedis Website: https://www.Nyedis.com