Labbi, S., Tiapkin, D., Mangold, P., & Moulines, E. (2026). Beyond Softmax and Entropy: Convergence Rates of Policy Gradients with f-SoftArgmax Parameterization & Coupled Regularization.
Chicago Style (17th ed.) CitationLabbi, Safwan, Daniil Tiapkin, Paul Mangold, and Eric Moulines. Beyond Softmax and Entropy: Convergence Rates of Policy Gradients with F-SoftArgmax Parameterization & Coupled Regularization. 2026.
MLA (9th ed.) CitationLabbi, Safwan, et al. Beyond Softmax and Entropy: Convergence Rates of Policy Gradients with F-SoftArgmax Parameterization & Coupled Regularization. 2026.
Warning: These citations may not always be 100% accurate.