Liu, J., He, C., Lin, Y., Yang, M., Shen, F., & Liu, S. (2025). ETTRL: Balancing Exploration and Exploitation in LLM Test-Time Reinforcement Learning Via Entropy Mechanism.
Chicago Style (17th ed.) CitationLiu, Jia, ChangYi He, YingQiao Lin, MingMin Yang, FeiYang Shen, and ShaoGuo Liu. ETTRL: Balancing Exploration and Exploitation in LLM Test-Time Reinforcement Learning Via Entropy Mechanism. 2025.
MLA (9th ed.) CitationLiu, Jia, et al. ETTRL: Balancing Exploration and Exploitation in LLM Test-Time Reinforcement Learning Via Entropy Mechanism. 2025.
Warning: These citations may not always be 100% accurate.