Zhang, Z., Guo, Y., Liang, Y., Zhao, D., & Duan, N. (2024). PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion.
Chicago Style (17th ed.) CitationZhang, Zekai, Yiduo Guo, Yaobo Liang, Dongyan Zhao, and Nan Duan. PPTC-R Benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion. 2024.
MLA (9th ed.) CitationZhang, Zekai, et al. PPTC-R Benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion. 2024.
Warning: These citations may not always be 100% accurate.