Yang, L., Yu, Z., Meng, C., Xu, M., Ermon, S., & Cui, B. (2024). Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs.
Chicago Style (17th ed.) CitationYang, Ling, Zhaochen Yu, Chenlin Meng, Minkai Xu, Stefano Ermon, and Bin Cui. Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs. 2024.
MLA (9th ed.) CitationYang, Ling, et al. Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs. 2024.
Warning: These citations may not always be 100% accurate.