As an employee of the European AI Office, it's important for me to emphasize this point: The views and opinions of the author expressed herein are personal and do not necessarily reflect those of the European Commission or other EU institutions.
Also, to stave off a common confusion: I worked at ARC Theory, which is now simply called ARC, on Paul Christiano's theoretical alignment agenda. The more famous ARC Evals was a different group working on evaluations, their work was completely separate from ARC Theory, and they were only housed under the same organization out of convenience, until ARC Evals spun off under the name METR. Nothing I write here has any implication about the work of ARC Evals/METR in any way.
** Low Probability Estimation**
This is my third post in a sequence of posts on ARC's agenda, you should definitely read the first post before this one for [...]
Outline:
(00:56) Low Probability Estimation
(02:42) LPE on real distributions
(04:41) LPE as training signal
(07:55) Does LPE work at all?
The original text contained 11 footnotes which were omitted from this narration.
First published: May 2nd, 2025
---
Narrated by TYPE III AUDIO).