Ouyang, Y., Huang, X., Liu, B., Zheng, Z., Gu, Y., & Zhang, X. (2026). Benchmarks are Not Enough: RAMP for Runtime Assessing of Agentic Models in Production Systems.
Chicago Style (17th ed.) CitationOuyang, Yipeng, Xin Huang, Bingjie Liu, Zhongchun Zheng, Yuhao Gu, and Xianwei Zhang. Benchmarks Are Not Enough: RAMP for Runtime Assessing of Agentic Models in Production Systems. 2026.
MLA (9th ed.) CitationOuyang, Yipeng, et al. Benchmarks Are Not Enough: RAMP for Runtime Assessing of Agentic Models in Production Systems. 2026.
Warning: These citations may not always be 100% accurate.