Lin, Y., Kwon, W., Pineda, R., & Paravecino, F. N. (2024). APEX: An Extensible and Dynamism-Aware Simulator for Automated Parallel Execution in LLM Serving.
Chicago Style (17th ed.) CitationLin, Yi-Chien, Woosuk Kwon, Ronald Pineda, and Fanny Nina Paravecino. APEX: An Extensible and Dynamism-Aware Simulator for Automated Parallel Execution in LLM Serving. 2024.
MLA (9th ed.) CitationLin, Yi-Chien, et al. APEX: An Extensible and Dynamism-Aware Simulator for Automated Parallel Execution in LLM Serving. 2024.
Warning: These citations may not always be 100% accurate.