Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Padhi, Trilok, Kursuncu, Ugur, Kumar, Yaman, Shalin, Valerie L., Fronczek, Lane Peterson
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence Computation and Language Computer Vision and Pattern Recognition Computers and Society Human-Computer Interaction I.2.7; I.2.10; I.2.4; I.2.1
Online Access:	https://arxiv.org/abs/2402.03607
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910946531737600
author	Padhi, Trilok Kursuncu, Ugur Kumar, Yaman Shalin, Valerie L. Fronczek, Lane Peterson
author_facet	Padhi, Trilok Kursuncu, Ugur Kumar, Yaman Shalin, Valerie L. Fronczek, Lane Peterson
contents	The digital landscape continually evolves with multimodality, enriching the online experience for users. Creators and marketers aim to weave subtle contextual cues from various modalities into congruent content to engage users with a harmonious message. This interplay of multimodal cues is often a crucial factor in attracting users' attention. However, this richness of multimodality presents a challenge to computational modeling, as the semantic contextual cues spanning across modalities need to be unified to capture the true holistic meaning of the multimodal content. This contextual meaning is critical in attracting user engagement as it conveys the intended message of the brand or the organization. In this work, we incorporate external commonsense knowledge from knowledge graphs to enhance the representation of multimodal data using compact Visual Language Models (VLMs) and predict the success of multi-modal crowdfunding campaigns. Our results show that external knowledge commonsense bridges the semantic gap between text and image modalities, and the enhanced knowledge-infused representations improve the predictive performance of models for campaign success upon the baselines without knowledge. Our findings highlight the significance of contextual congruence in online multimodal content for engaging and successful crowdfunding campaigns.
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_03607
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Enhancing Cross-Modal Contextual Congruence for Crowdfunding Success using Knowledge-infused Learning Padhi, Trilok Kursuncu, Ugur Kumar, Yaman Shalin, Valerie L. Fronczek, Lane Peterson Artificial Intelligence Computation and Language Computer Vision and Pattern Recognition Computers and Society Human-Computer Interaction I.2.7; I.2.10; I.2.4; I.2.1 The digital landscape continually evolves with multimodality, enriching the online experience for users. Creators and marketers aim to weave subtle contextual cues from various modalities into congruent content to engage users with a harmonious message. This interplay of multimodal cues is often a crucial factor in attracting users' attention. However, this richness of multimodality presents a challenge to computational modeling, as the semantic contextual cues spanning across modalities need to be unified to capture the true holistic meaning of the multimodal content. This contextual meaning is critical in attracting user engagement as it conveys the intended message of the brand or the organization. In this work, we incorporate external commonsense knowledge from knowledge graphs to enhance the representation of multimodal data using compact Visual Language Models (VLMs) and predict the success of multi-modal crowdfunding campaigns. Our results show that external knowledge commonsense bridges the semantic gap between text and image modalities, and the enhanced knowledge-infused representations improve the predictive performance of models for campaign success upon the baselines without knowledge. Our findings highlight the significance of contextual congruence in online multimodal content for engaging and successful crowdfunding campaigns.
title	Enhancing Cross-Modal Contextual Congruence for Crowdfunding Success using Knowledge-infused Learning
topic	Artificial Intelligence Computation and Language Computer Vision and Pattern Recognition Computers and Society Human-Computer Interaction I.2.7; I.2.10; I.2.4; I.2.1
url	https://arxiv.org/abs/2402.03607

Similar Items