Zhao, Y., Qin, Y., Wang, Y., Yang, X., Han, H., Wei, S., . . . Yin, S. (2025). MoBiLE: Efficient Mixture-of-Experts Inference on Consumer GPU with Mixture of Big Little Experts.
Chicago Style (17th ed.) CitationZhao, Yushu, Yubin Qin, Yang Wang, Xiaolong Yang, Huiming Han, Shaojun Wei, Yang Hu, and Shouyi Yin. MoBiLE: Efficient Mixture-of-Experts Inference on Consumer GPU with Mixture of Big Little Experts. 2025.
MLA (9th ed.) CitationZhao, Yushu, et al. MoBiLE: Efficient Mixture-of-Experts Inference on Consumer GPU with Mixture of Big Little Experts. 2025.
Warning: These citations may not always be 100% accurate.