Bournias, I., Cavigelli, L., & Zacharopoulos, G. (2024). AcceLLM: Accelerating LLM Inference using Redundancy for Load Balancing and Data Locality.
Chicago Style (17th ed.) CitationBournias, Ilias, Lukas Cavigelli, and Georgios Zacharopoulos. AcceLLM: Accelerating LLM Inference Using Redundancy for Load Balancing and Data Locality. 2024.
MLA (9th ed.) CitationBournias, Ilias, et al. AcceLLM: Accelerating LLM Inference Using Redundancy for Load Balancing and Data Locality. 2024.
Warning: These citations may not always be 100% accurate.