Saved in:
Bibliographic Details
Main Authors: Sharma, Ujjwal, Rudinac, Stevan, Mićković, Ana, van Dolen, Willemijn, Worring, Marcel
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2511.01550
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914132665565184
author Sharma, Ujjwal
Rudinac, Stevan
Mićković, Ana
van Dolen, Willemijn
Worring, Marcel
author_facet Sharma, Ujjwal
Rudinac, Stevan
Mićković, Ana
van Dolen, Willemijn
Worring, Marcel
contents In this work, we introduce a multimodal analysis pipeline that leverages large foundation models in vision and language to analyze corporate social media content, with a focus on sustainability-related communication. Addressing the challenges of evolving, multimodal, and often ambiguous corporate messaging on platforms such as X (formerly Twitter), we employ an ensemble of large language models (LLMs) to annotate a large corpus of corporate tweets on their topical alignment with the 17 Sustainable Development Goals (SDGs). This approach avoids the need for costly, task-specific annotations and explores the potential of such models as ad-hoc annotators for social media data that can efficiently capture both explicit and implicit references to sustainability themes in a scalable manner. Complementing this textual analysis, we utilize vision-language models (VLMs), within a visual understanding framework that uses semantic clusters to uncover patterns in visual sustainability communication. This integrated approach reveals sectoral differences in SDG engagement, temporal trends, and associations between corporate messaging, environmental, social, governance (ESG) risks, and consumer engagement. Our methods-automatic label generation and semantic visual clustering-are broadly applicable to other domains and offer a flexible framework for large-scale social media analysis.
format Preprint
id arxiv_https___arxiv_org_abs_2511_01550
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Analyzing Sustainability Messaging in Large-Scale Corporate Social Media
Sharma, Ujjwal
Rudinac, Stevan
Mićković, Ana
van Dolen, Willemijn
Worring, Marcel
Artificial Intelligence
In this work, we introduce a multimodal analysis pipeline that leverages large foundation models in vision and language to analyze corporate social media content, with a focus on sustainability-related communication. Addressing the challenges of evolving, multimodal, and often ambiguous corporate messaging on platforms such as X (formerly Twitter), we employ an ensemble of large language models (LLMs) to annotate a large corpus of corporate tweets on their topical alignment with the 17 Sustainable Development Goals (SDGs). This approach avoids the need for costly, task-specific annotations and explores the potential of such models as ad-hoc annotators for social media data that can efficiently capture both explicit and implicit references to sustainability themes in a scalable manner. Complementing this textual analysis, we utilize vision-language models (VLMs), within a visual understanding framework that uses semantic clusters to uncover patterns in visual sustainability communication. This integrated approach reveals sectoral differences in SDG engagement, temporal trends, and associations between corporate messaging, environmental, social, governance (ESG) risks, and consumer engagement. Our methods-automatic label generation and semantic visual clustering-are broadly applicable to other domains and offer a flexible framework for large-scale social media analysis.
title Analyzing Sustainability Messaging in Large-Scale Corporate Social Media
topic Artificial Intelligence
url https://arxiv.org/abs/2511.01550