Yang, H., Kailkhura, B., Wang, Z., & Liang, Y. (2024). Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis.
Style de citation Chicago (17e éd.)Yang, Hongru, Bhavya Kailkhura, Zhangyang Wang, et Yingbin Liang. Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis. 2024.
Style de citation MLA (9e éd.)Yang, Hongru, et al. Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis. 2024.
Attention : ces citations peuvent ne pas être correctes à 100%.