Saved in:
Bibliographic Details
Main Authors: Callejas, Sofia, Gomez, Nahuel, Pelachaud, Catherine, Ravenet, Brian, Barriere, Valentin
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.06309
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911679010308096
author Callejas, Sofia
Gomez, Nahuel
Pelachaud, Catherine
Ravenet, Brian
Barriere, Valentin
author_facet Callejas, Sofia
Gomez, Nahuel
Pelachaud, Catherine
Ravenet, Brian
Barriere, Valentin
contents Laughter is a social non-vocalization that is universal across cultures and languages, and is crucial for human communication, including social bonding and communication signaling. However, detecting laughter in audio is a challenging task, and segmenting is even more difficult. Currently, Machine Learning methods generally rely on costly manual annotation, and their datasets are mostly based on English contexts. Thus, we propose an unsupervised multilingual method that sets up the laughter segmentation task as an anomaly detection of energy-based segmented audio sequences. Our method applies an Isolation Forest on audio representations learned from BYOL-A encoder. We compare our method with several state-of-the-art laughter detection algorithms on four datasets, including stand-up comedy, sitcoms, and general short audio from AudioSet. Our results show that state-of-the-art methods are not optimized for multilingual contexts, while our method outperforms them in non-English settings.
format Preprint
id arxiv_https___arxiv_org_abs_2605_06309
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle MultiLinguahah : A New Unsupervised Multilingual Acoustic Laughter Segmentation Method
Callejas, Sofia
Gomez, Nahuel
Pelachaud, Catherine
Ravenet, Brian
Barriere, Valentin
Computation and Language
Laughter is a social non-vocalization that is universal across cultures and languages, and is crucial for human communication, including social bonding and communication signaling. However, detecting laughter in audio is a challenging task, and segmenting is even more difficult. Currently, Machine Learning methods generally rely on costly manual annotation, and their datasets are mostly based on English contexts. Thus, we propose an unsupervised multilingual method that sets up the laughter segmentation task as an anomaly detection of energy-based segmented audio sequences. Our method applies an Isolation Forest on audio representations learned from BYOL-A encoder. We compare our method with several state-of-the-art laughter detection algorithms on four datasets, including stand-up comedy, sitcoms, and general short audio from AudioSet. Our results show that state-of-the-art methods are not optimized for multilingual contexts, while our method outperforms them in non-English settings.
title MultiLinguahah : A New Unsupervised Multilingual Acoustic Laughter Segmentation Method
topic Computation and Language
url https://arxiv.org/abs/2605.06309