GPT2

Calendar feature geometry in GPT-2 layer 8 residual stream SAEs

LLMs SAEs GPT2 Geometry

Patrick Leask, Bart Bussmann, and Neel Nanda take a close look at GPT-2’s SAE feature geometry on the AI Alignment Forum