Within this scope...
subprojects
timeline
Paper & Poster
24 Apr 2025Sparse Autoencoders Do Not Find Canonical Units of Analysis
Conference poster
15 Dec 2024BatchTopK Sparse Autoencoders
Conference poster
15 Dec 2024Stitching Sparse Autoencoders of Different Sizes
Workshop
6 May 2024AI Forensics IRL Meeting
Preprint
1 Nov 2023CoinRun: Solving Goal Misgeneralisation