LessWrong (Curated & Popular)

"Assume Bad Faith" by Zack_M_Davis

2023/8/28

I've been trying to avoid the terms "good faith" and "bad faith". I'm

"Book Launch: "The Carving of Reality," Best of LessWrong vol. III" by Raemon

2023/8/28

The Carving of Reality, third volume of the Best of LessWrong books is now available on Amazon (US).

"Large Language Models will be Great for Censorship" by Ethan Edwards

2023/8/23

LLMs can do many incredible things. They can generate unique creative content, carry on long convers

"6 non-obvious mental health issues specific to AI safety" by Igor Ivanov

2023/8/22

Intro: I am a psychotherapist, and I help people working on AI safety. I noticed patterns of mental

"Ten Thousand Years of Solitude" by agp

2023/8/22

This is a linkpost for the article "Ten Thousand Years of Solitude", written by Jared Diam

"Against Almost Every Theory of Impact of Interpretability" by Charbel-Raphaël

2023/8/21

I gave a talk about the different risk models, followed by an interpretability presentation, then I

"Feedbackloop-first Rationality" by Raemon

2023/8/15

I've been workshopping a new rationality training paradigm. (By "rationality training para

"Inflection.ai is a major AGI lab" by Nikola

2023/8/15

Inflection.ai (co-founded by DeepMind co-founder Mustafa Suleyman) should be perceived as a frontier

"Model Organisms of Misalignment: The Case for a New Pillar of Alignment Research" by evhub, Nicholas Schiefer, Carson Denison, Ethan Perez

2023/8/9

TL;DR: This document lays out the case for research on “model organisms of misalignment” – in vitro

"When can we trust model evaluations?" bu evhub

2023/8/9

In "Towards understanding-based safety evaluations," I discussed why I think evaluating sp

"ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks" by Beth Barnes

2023/8/4

Blogpost versionPaperWe have just released our first public report. It introduces methodology for as

"The "public debate" about AI is confusing for the general public and for policymakers because it is a three-sided debate" by Adam David Long

2023/8/4

Summary of Argument: The public debate among AI experts is confusing because there are, to a first a

Episodes

"Assume Bad Faith" by Zack_M_Davis

"Book Launch: "The Carving of Reality," Best of LessWrong vol. III" by Raemon

"Large Language Models will be Great for Censorship" by Ethan Edwards

"6 non-obvious mental health issues specific to AI safety" by Igor Ivanov

"Ten Thousand Years of Solitude" by agp

"Against Almost Every Theory of Impact of Interpretability" by Charbel-Raphaël

"Feedbackloop-first Rationality" by Raemon

"Inflection.ai is a major AGI lab" by Nikola

"Model Organisms of Misalignment: The Case for a New Pillar of Alignment Research" by evhub, Nicholas Schiefer, Carson Denison, Ethan Perez

"When can we trust model evaluations?" bu evhub

"ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks" by Beth Barnes

"The "public debate" about AI is confusing for the general public and for policymakers because it is a three-sided debate" by Adam David Long

"My current LK99 questions" by Eliezer Yudkowsky

"Thoughts on sharing information about language model capabilities" by paulfchristiano

"Cultivating a state of mind where new ideas are born" by Henrik Karlsson

"Self-driving car bets" by paulfchristiano

"Yes, It's Subjective, But Why All The Crabs?" by johnswentworth

"Grant applications and grand narratives" by Elizabeth

"Brain Efficiency Cannell Prize Contest Award Ceremony" by Alexander Gietelink Oldenziel

"Rationality !== Winning" by Raemon