We're sunsetting PodQuest on 2025-07-28. Thank you for your support!

“Will LLM agents become the first takeover-capable AGIs?” by Seth Herd

2025/3/3

One of my takeaways from EA Global this year was that most alignment people aren't explicitly focused on LLM-based agents (LMAs)[1] as a route to takeover-capable AGI. I want to better understand this position, since I estimate this path to AGI as likely enough (maybe around 60%) to be worth specific focus and concern. Two reasons people might not care about aligning LMAs in particular:

Thinking this route to AGI is quite possible but that aligning LLMs mostly covers aligning LLM agents Thinking LLM-based agents are unlikely to be the first takeover-capable AGI

I'm aware of arguments/questions like Have LLMs Generated Novel Insights?, LLM Generality is a Timeline Crux, and LLMs' weakness on what Steve Byrnes calls discernment: the ability to tell their better ideas/outputs from their worse ones.[2] I'm curious if these or other ideas play a major role in your thinking. I'm even more curious about [...]

The original text contained 3 footnotes which were omitted from this narration.

First published: March 2nd, 2025

Source: https://www.lesswrong.com/posts/2zijHz4BFFEtDCDH4/will-llm-agents-become-the-first-takeover-capable-agis)

---

Narrated by TYPE III AUDIO).

“Will LLM agents become the first takeover-capable AGIs?” by Seth Herd

LessWrong (30+ Karma)

Shownotes Transcript

“Will LLM agents become the first takeover-capable AGIs?” by Seth Herd 01:50 Share

LessWrong (30+ Karma)

Shownotes Transcript

“Will LLM agents become the first takeover-capable AGIs?” by Seth Herd