Zhang, L., Jia, T., Zhai, Y., Xie, Z., Duan, C., He, M., . . . Li, Y. (2026). From Feedback Loops to Policy Updates: Reinforcement Fine-Tuning for LLM-Based Alpha Factor Discovery.
Cita Chicago Style (17a ed.)Zhang, Lingzhe, Tong Jia, Yunpeng Zhai, Zixuan Xie, Chiming Duan, Minghua He, Philip S. Yu, y Ying Li. From Feedback Loops to Policy Updates: Reinforcement Fine-Tuning for LLM-Based Alpha Factor Discovery. 2026.
Cita MLA (9a ed.)Zhang, Lingzhe, et al. From Feedback Loops to Policy Updates: Reinforcement Fine-Tuning for LLM-Based Alpha Factor Discovery. 2026.
Precaución: Estas citas no son 100% exactas.