In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention

S. S. Wilks Memorial Seminar in Statistics

Zhuoran Yang *22, Yale University

September 15

12:15 pm

Sherrerd Hall, 101

See event website for additional details and how to view or participate.

Sponsors

Sherrerd Hall

Operations Research and Financial Engineering

Developing mathematical and computational tools for making decisions under uncertainty