Vista Equipo: :: Library Catalog

Guardado en:

Detalles Bibliográficos
Autores principales:	Ani, Saja Al, Cleland, Joanne, Zoha, Ahmed
Formato:	Preprint
Publicado:	2024
Materias:	Sound Artificial Intelligence Computer Vision and Pattern Recognition Audio and Speech Processing
Acceso en línea:	https://arxiv.org/abs/2402.17482
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

_version_	1866917133157400576
author	Ani, Saja Al Cleland, Joanne Zoha, Ahmed
author_facet	Ani, Saja Al Cleland, Joanne Zoha, Ahmed
contents	Speech sound disorder (SSD) is defined as a persistent impairment in speech sound production leading to reduced speech intelligibility and hindered verbal communication. Early recognition and intervention of children with SSD and timely referral to speech and language therapists (SLTs) for treatment are crucial. Automated detection of speech impairment is regarded as an efficient method for examining and screening large populations. This study focuses on advancing the automatic diagnosis of SSD in early childhood by proposing a technical solution that integrates ultrasound tongue imaging (UTI) with deep-learning models. The introduced FusionNet model combines UTI data with the extracted texture features to classify UTI. The overarching aim is to elevate the accuracy and efficiency of UTI analysis, particularly for classifying speech sounds associated with SSD. This study compared the FusionNet approach with standard deep-learning methodologies, highlighting the excellent improvement results of the FusionNet model in UTI classification and the potential of multi-learning in improving UTI classification in speech therapy clinics.
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_17482
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Automated Classification of Phonetic Segments in Child Speech Using Raw Ultrasound Imaging Ani, Saja Al Cleland, Joanne Zoha, Ahmed Sound Artificial Intelligence Computer Vision and Pattern Recognition Audio and Speech Processing Speech sound disorder (SSD) is defined as a persistent impairment in speech sound production leading to reduced speech intelligibility and hindered verbal communication. Early recognition and intervention of children with SSD and timely referral to speech and language therapists (SLTs) for treatment are crucial. Automated detection of speech impairment is regarded as an efficient method for examining and screening large populations. This study focuses on advancing the automatic diagnosis of SSD in early childhood by proposing a technical solution that integrates ultrasound tongue imaging (UTI) with deep-learning models. The introduced FusionNet model combines UTI data with the extracted texture features to classify UTI. The overarching aim is to elevate the accuracy and efficiency of UTI analysis, particularly for classifying speech sounds associated with SSD. This study compared the FusionNet approach with standard deep-learning methodologies, highlighting the excellent improvement results of the FusionNet model in UTI classification and the potential of multi-learning in improving UTI classification in speech therapy clinics.
title	Automated Classification of Phonetic Segments in Child Speech Using Raw Ultrasound Imaging
topic	Sound Artificial Intelligence Computer Vision and Pattern Recognition Audio and Speech Processing
url	https://arxiv.org/abs/2402.17482

Ejemplares similares