Mechanistic Interpretability (Mathematical Framework for Transformer Circuits & Monosemanticity)

Jinen Setpal
Jinen Setpal
ML Research Intern @ Apple

Deep Learning Optimization Theory & Intrinsic Interpretability