Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Kawamura, Kazuki, Nakai, Kengo, Rekimoto, Jun
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Multimedia
Online Access:	https://arxiv.org/abs/2510.18014
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915565778501632
author	Kawamura, Kazuki Nakai, Kengo Rekimoto, Jun
author_facet	Kawamura, Kazuki Nakai, Kengo Rekimoto, Jun
contents	We present ManzaiSet, the first large scale multimodal dataset of viewer responses to Japanese manzai comedy, capturing facial videos and audio from 241 participants watching up to 10 professional performances in randomized order (94.6 percent watched >= 8; analyses focus on n=228). This addresses the Western centric bias in affective computing. Three key findings emerge: (1) k means clustering identified three distinct viewer types: High and Stable Appreciators (72.8 percent, n=166), Low and Variable Decliners (13.2 percent, n=30), and Variable Improvers (14.0 percent, n=32), with heterogeneity of variance (Brown Forsythe p < 0.001); (2) individual level analysis revealed a positive viewing order effect (mean slope = 0.488, t(227) = 5.42, p < 0.001, permutation p < 0.001), contradicting fatigue hypotheses; (3) automated humor classification (77 instances, 131 labels) plus viewer level response modeling found no type wise differences after FDR correction. The dataset enables culturally aware emotion AI development and personalized entertainment systems tailored to non Western contexts.
format	Preprint
id	arxiv_https___arxiv_org_abs_2510_18014
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	ManzaiSet: A Multimodal Dataset of Viewer Responses to Japanese Manzai Comedy Kawamura, Kazuki Nakai, Kengo Rekimoto, Jun Computer Vision and Pattern Recognition Multimedia We present ManzaiSet, the first large scale multimodal dataset of viewer responses to Japanese manzai comedy, capturing facial videos and audio from 241 participants watching up to 10 professional performances in randomized order (94.6 percent watched >= 8; analyses focus on n=228). This addresses the Western centric bias in affective computing. Three key findings emerge: (1) k means clustering identified three distinct viewer types: High and Stable Appreciators (72.8 percent, n=166), Low and Variable Decliners (13.2 percent, n=30), and Variable Improvers (14.0 percent, n=32), with heterogeneity of variance (Brown Forsythe p < 0.001); (2) individual level analysis revealed a positive viewing order effect (mean slope = 0.488, t(227) = 5.42, p < 0.001, permutation p < 0.001), contradicting fatigue hypotheses; (3) automated humor classification (77 instances, 131 labels) plus viewer level response modeling found no type wise differences after FDR correction. The dataset enables culturally aware emotion AI development and personalized entertainment systems tailored to non Western contexts.
title	ManzaiSet: A Multimodal Dataset of Viewer Responses to Japanese Manzai Comedy
topic	Computer Vision and Pattern Recognition Multimedia
url	https://arxiv.org/abs/2510.18014

Similar Items