Mechanistic interpretability

Interpreting neural networks on the basis of their parameters.