Yin, Z., Wang, S., Wang, X., Ma, X., & Wang, Y. (2025). Deliberative Searcher: Improving LLM Reliability via Reinforcement Learning with constraints.
Chicago Style (17th ed.) CitationYin, Zhenyun, Shujie Wang, Xuhong Wang, Xingjun Ma, and Yinchun Wang. Deliberative Searcher: Improving LLM Reliability via Reinforcement Learning with Constraints. 2025.
MLA (9th ed.) CitationYin, Zhenyun, et al. Deliberative Searcher: Improving LLM Reliability via Reinforcement Learning with Constraints. 2025.
Warning: These citations may not always be 100% accurate.