Saved in:
Bibliographic Details
Main Authors: Policzer, Nico, Braunstein, Cameron, Toneva, Mariya
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2511.07988
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Recent studies on audio models show brain-tuning - fine-tuning models to better predict corresponding fMRI activity - improves brain alignment and increases performance on downstream semantic and audio tasks. We extend this approach to a multimodal audio-video model to enhance social cognition, targeting the Superior Temporal Sulcus (STS), a key region for social processing, while subjects watch Friends. We find significant increases in brain alignment to the STS and an adjacent ROI, as well as improvements to a social cognition task related to the training data - sarcasm detection in sitcoms. In summary, our study extends brain-tuning to the multi-modal domain, demonstrating improvements to a downstream task after tuning to a relevant functional region.