Li, X., Fang, X., Ding, S., Li, Y., Li, L., Duan, H., . . . Chen, K. (2026). Forge: Quality-Aware Reinforcement Learning for NP-Hard Optimization in LLMs.
Chicago Style (17th ed.) CitationLi, Xiaozhe, Xinyu Fang, Shengyuan Ding, Yang Li, Linyang Li, Haodong Duan, Qingwen Liu, and Kai Chen. Forge: Quality-Aware Reinforcement Learning for NP-Hard Optimization in LLMs. 2026.
MLA (9th ed.) CitationLi, Xiaozhe, et al. Forge: Quality-Aware Reinforcement Learning for NP-Hard Optimization in LLMs. 2026.
Warning: These citations may not always be 100% accurate.