Saved in:
Bibliographic Details
Main Authors: Łazęcka, Małgorzata, Szczurek, Ewa
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2504.18914
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910920260714496
author Łazęcka, Małgorzata
Szczurek, Ewa
author_facet Łazęcka, Małgorzata
Szczurek, Ewa
contents Integrating various data modalities brings valuable insights into underlying phenomena. Multimodal factor analysis (FA) uncovers shared axes of variation underlying different simple data modalities, where each sample is represented by a vector of features. However, FA is not suited for structured data modalities, such as text or single cell sequencing data, where multiple data points are measured per each sample and exhibit a clustering structure. To overcome this challenge, we introduce FACTM, a novel, multi-view and multi-structure Bayesian model that combines FA with correlated topic modeling and is optimized using variational inference. Additionally, we introduce a method for rotating latent factors to enhance interpretability with respect to binary features. On text and video benchmarks as well as real-world music and COVID-19 datasets, we demonstrate that FACTM outperforms other methods in identifying clusters in structured data, and integrating them with simple modalities via the inference of shared, interpretable factors.
format Preprint
id arxiv_https___arxiv_org_abs_2504_18914
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Factor Analysis with Correlated Topic Model for Multi-Modal Data
Łazęcka, Małgorzata
Szczurek, Ewa
Machine Learning
Applications
Integrating various data modalities brings valuable insights into underlying phenomena. Multimodal factor analysis (FA) uncovers shared axes of variation underlying different simple data modalities, where each sample is represented by a vector of features. However, FA is not suited for structured data modalities, such as text or single cell sequencing data, where multiple data points are measured per each sample and exhibit a clustering structure. To overcome this challenge, we introduce FACTM, a novel, multi-view and multi-structure Bayesian model that combines FA with correlated topic modeling and is optimized using variational inference. Additionally, we introduce a method for rotating latent factors to enhance interpretability with respect to binary features. On text and video benchmarks as well as real-world music and COVID-19 datasets, we demonstrate that FACTM outperforms other methods in identifying clusters in structured data, and integrating them with simple modalities via the inference of shared, interpretable factors.
title Factor Analysis with Correlated Topic Model for Multi-Modal Data
topic Machine Learning
Applications
url https://arxiv.org/abs/2504.18914