Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Callejas, Sofia, Gomez, Nahuel, Pelachaud, Catherine, Ravenet, Brian, Barriere, Valentin
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2605.06309
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911679010308096
author	Callejas, Sofia Gomez, Nahuel Pelachaud, Catherine Ravenet, Brian Barriere, Valentin
author_facet	Callejas, Sofia Gomez, Nahuel Pelachaud, Catherine Ravenet, Brian Barriere, Valentin
contents	Laughter is a social non-vocalization that is universal across cultures and languages, and is crucial for human communication, including social bonding and communication signaling. However, detecting laughter in audio is a challenging task, and segmenting is even more difficult. Currently, Machine Learning methods generally rely on costly manual annotation, and their datasets are mostly based on English contexts. Thus, we propose an unsupervised multilingual method that sets up the laughter segmentation task as an anomaly detection of energy-based segmented audio sequences. Our method applies an Isolation Forest on audio representations learned from BYOL-A encoder. We compare our method with several state-of-the-art laughter detection algorithms on four datasets, including stand-up comedy, sitcoms, and general short audio from AudioSet. Our results show that state-of-the-art methods are not optimized for multilingual contexts, while our method outperforms them in non-English settings.
format	Preprint
id	arxiv_https___arxiv_org_abs_2605_06309
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	MultiLinguahah : A New Unsupervised Multilingual Acoustic Laughter Segmentation Method Callejas, Sofia Gomez, Nahuel Pelachaud, Catherine Ravenet, Brian Barriere, Valentin Computation and Language Laughter is a social non-vocalization that is universal across cultures and languages, and is crucial for human communication, including social bonding and communication signaling. However, detecting laughter in audio is a challenging task, and segmenting is even more difficult. Currently, Machine Learning methods generally rely on costly manual annotation, and their datasets are mostly based on English contexts. Thus, we propose an unsupervised multilingual method that sets up the laughter segmentation task as an anomaly detection of energy-based segmented audio sequences. Our method applies an Isolation Forest on audio representations learned from BYOL-A encoder. We compare our method with several state-of-the-art laughter detection algorithms on four datasets, including stand-up comedy, sitcoms, and general short audio from AudioSet. Our results show that state-of-the-art methods are not optimized for multilingual contexts, while our method outperforms them in non-English settings.
title	MultiLinguahah : A New Unsupervised Multilingual Acoustic Laughter Segmentation Method
topic	Computation and Language
url	https://arxiv.org/abs/2605.06309

Similar Items