Saved in:
Bibliographic Details
Main Authors: Rohanian, Morteza, Hüppi, Roya M., Nooralahzadeh, Farhad, Dannecker, Noemi, Pauli, Yves, Surbeck, Werner, Sommer, Iris, Hinzen, Wolfram, Langer, Nicolas, Krauthammer, Michael, Homan, Philipp
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.18285
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912245553823744
author Rohanian, Morteza
Hüppi, Roya M.
Nooralahzadeh, Farhad
Dannecker, Noemi
Pauli, Yves
Surbeck, Werner
Sommer, Iris
Hinzen, Wolfram
Langer, Nicolas
Krauthammer, Michael
Homan, Philipp
author_facet Rohanian, Morteza
Hüppi, Roya M.
Nooralahzadeh, Farhad
Dannecker, Noemi
Pauli, Yves
Surbeck, Werner
Sommer, Iris
Hinzen, Wolfram
Langer, Nicolas
Krauthammer, Michael
Homan, Philipp
contents Capturing subtle speech disruptions across the psychosis spectrum is challenging because of the inherent variability in speech patterns. This variability reflects individual differences and the fluctuating nature of symptoms in both clinical and non-clinical populations. Accounting for uncertainty in speech data is essential for predicting symptom severity and improving diagnostic precision. Speech disruptions characteristic of psychosis appear across the spectrum, including in non-clinical individuals. We develop an uncertainty-aware model integrating acoustic and linguistic features to predict symptom severity and psychosis-related traits. Quantifying uncertainty in specific modalities allows the model to address speech variability, improving prediction accuracy. We analyzed speech data from 114 participants, including 32 individuals with early psychosis and 82 with low or high schizotypy, collected through structured interviews, semi-structured autobiographical tasks, and narrative-driven interactions in German. The model improved prediction accuracy, reducing RMSE and achieving an F1-score of 83% with ECE = 4.5e-2, showing robust performance across different interaction contexts. Uncertainty estimation improved model interpretability by identifying reliability differences in speech markers such as pitch variability, fluency disruptions, and spectral instability. The model dynamically adjusted to task structures, weighting acoustic features more in structured settings and linguistic features in unstructured contexts. This approach strengthens early detection, personalized assessment, and clinical decision-making in psychosis-spectrum research.
format Preprint
id arxiv_https___arxiv_org_abs_2502_18285
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Uncertainty Modeling in Multimodal Speech Analysis Across the Psychosis Spectrum
Rohanian, Morteza
Hüppi, Roya M.
Nooralahzadeh, Farhad
Dannecker, Noemi
Pauli, Yves
Surbeck, Werner
Sommer, Iris
Hinzen, Wolfram
Langer, Nicolas
Krauthammer, Michael
Homan, Philipp
Computation and Language
Capturing subtle speech disruptions across the psychosis spectrum is challenging because of the inherent variability in speech patterns. This variability reflects individual differences and the fluctuating nature of symptoms in both clinical and non-clinical populations. Accounting for uncertainty in speech data is essential for predicting symptom severity and improving diagnostic precision. Speech disruptions characteristic of psychosis appear across the spectrum, including in non-clinical individuals. We develop an uncertainty-aware model integrating acoustic and linguistic features to predict symptom severity and psychosis-related traits. Quantifying uncertainty in specific modalities allows the model to address speech variability, improving prediction accuracy. We analyzed speech data from 114 participants, including 32 individuals with early psychosis and 82 with low or high schizotypy, collected through structured interviews, semi-structured autobiographical tasks, and narrative-driven interactions in German. The model improved prediction accuracy, reducing RMSE and achieving an F1-score of 83% with ECE = 4.5e-2, showing robust performance across different interaction contexts. Uncertainty estimation improved model interpretability by identifying reliability differences in speech markers such as pitch variability, fluency disruptions, and spectral instability. The model dynamically adjusted to task structures, weighting acoustic features more in structured settings and linguistic features in unstructured contexts. This approach strengthens early detection, personalized assessment, and clinical decision-making in psychosis-spectrum research.
title Uncertainty Modeling in Multimodal Speech Analysis Across the Psychosis Spectrum
topic Computation and Language
url https://arxiv.org/abs/2502.18285