Audio narrations of LessWrong posts. Includes all curated posts and all posts with 125+ karma.If you
I've been trying to avoid the terms "good faith" and "bad faith". I'm
The Carving of Reality, third volume of the Best of LessWrong books is now available on Amazon (US).
LLMs can do many incredible things. They can generate unique creative content, carry on long convers
Intro: I am a psychotherapist, and I help people working on AI safety. I noticed patterns of mental
This is a linkpost for the article "Ten Thousand Years of Solitude", written by Jared Diam
I gave a talk about the different risk models, followed by an interpretability presentation, then I
I've been workshopping a new rationality training paradigm. (By "rationality training para
Inflection.ai (co-founded by DeepMind co-founder Mustafa Suleyman) should be perceived as a frontier
TL;DR: This document lays out the case for research on “model organisms of misalignment” – in vitro
In "Towards understanding-based safety evaluations," I discussed why I think evaluating sp
Blogpost versionPaperWe have just released our first public report. It introduces methodology for as
Summary of Argument: The public debate among AI experts is confusing because there are, to a first a
So this morning I thought to myself, "Okay, now I will actually try to study the LK99 question,
I believe that sharing information about the capabilities and limits of existing ML systems, and esp
In the early 2010s, a popular idea was to provide coworking spaces and shared living to people who w
This month I lost a bunch of bets.Back in early 2016 I bet at even odds that self-driving ride shari
Some early biologist, equipped with knowledge of evolution but not much else, might see all these cr
The Lightspeed application asks: “What impact will [your project] have on the world? What is your p
Previously Jacob Cannell wrote the post "Brain Efficiency" which makes several radical cla
I think "Rationality is winning" is a bit of a trap. (The original phrase is notably "