Agarwal, L., & Verma, B. (2025). Towards Explainable AI: Multi-Modal Transformer for Video-based Image Description Generation.
Chicago Style (17th ed.) CitationAgarwal, Lakshita, and Bindu Verma. Towards Explainable AI: Multi-Modal Transformer for Video-based Image Description Generation. 2025.
MLA (9th ed.) CitationAgarwal, Lakshita, and Bindu Verma. Towards Explainable AI: Multi-Modal Transformer for Video-based Image Description Generation. 2025.
Warning: These citations may not always be 100% accurate.