Continuing on the
@AnthropicAI's Transformer Circuit series and as a part of daily paper discussions on the
@ykilcher discord server, I will be volunteering to lead the analysis of the following mechanistic interpretability work 🧮 🔍
📜 Toy Models of Superposition authored by Nelson Elhage,
@trishume,
@catherineols,
@nschiefer, et al.
🌐
transformer-circuits.pub/202…
🕰 Friday, Sep 19, 2024 12:30 AM UTC // Friday, Sep 19, 2024 6.00 AM IST // Thursday, Sep 18, 2024 5:30 PM PT
Previous Mechanistic Interpretability papers in this series that we talked about:
🔬 Softmax Linear Units @
transformer-circuits.pub/202…
🔬 In-context Learning and Induction Heads @
transformer-circuits.pub/202…
🔬 A Mathematical Framework for Transformer Circuits @
transformer-circuits.pub/202…
Join in for the fun ~
ykilcher.com/discord
#DailyPaperDiscussions #TransformerCircuits #MechanisticInterpretability