The Linear Representation Hypothesis and the Geometry of LLMs
We formalize different notions of linear representations in LLMs (e.g., linear probes and steering vectors) and unify those notions by identifying an inner product that encodes semantic structure. Published in ICML 2024.