Liu, Y., Zhang, W., Cao, C., Lu, W., Yuan, F., Guo, D., . . . Ma, Z. (2026). PRISMA: Reinforcement Learning Guided Two-Stage Policy Optimization in Multi-Agent Architecture for Open-Domain Multi-Hop Question Answering.
Chicago Style (17th ed.) CitationLiu, Yu, et al. PRISMA: Reinforcement Learning Guided Two-Stage Policy Optimization in Multi-Agent Architecture for Open-Domain Multi-Hop Question Answering. 2026.
MLA (9th ed.) CitationLiu, Yu, et al. PRISMA: Reinforcement Learning Guided Two-Stage Policy Optimization in Multi-Agent Architecture for Open-Domain Multi-Hop Question Answering. 2026.
Warning: These citations may not always be 100% accurate.