He, J., Pan, X., Chen, S., & Yang, Z. (2025). In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention.
Chicago Style (17th ed.) CitationHe, Jianliang, Xintian Pan, Siyu Chen, and Zhuoran Yang. In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention. 2025.
MLA (9th ed.) CitationHe, Jianliang, et al. In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention. 2025.
Warning: These citations may not always be 100% accurate.