Mechanistic Interpretability (Mathematical Framework for Transformer Circuits & Monosemanticity)

Jinen Setpal
Jinen Setpal
ECE PhD @ Purdue

Deep Learning Optimization Theory & Intrinsic Interpretability