We're sunsetting PodQuest on 2025-07-28. Thank you for your support!
Export Podcast Subscriptions
back
“Negative Results on Group SAEs” by Josh Engels
18:55
Share
2025/5/7
LessWrong (30+ Karma)
AI Chapters
Transcribe
Chapters
What led to the exploration of Group SAEs?
Understanding Group SAEs
Synthetic Circles Experiments: What Did They Reveal?
Training Group SAEs on GPT-2: Initial Metrics
Do Group SAEs Capture Known Circular Subspaces?
What Other Approaches Did We Try?
Exploring Learned Group Space
Conclusion: What Can We Learn from These Negative Results?
Shownotes
Transcript
No transcript made for this episode yet, you may request it for free.