Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Simone Pinna, Francesca Maridina Malloci, Mirko Marras, Diego Reforgiato Recupero, Daniele Riboni, Giuseppe Scarpi
Format:	Recurso digital
Language:
Published:	Zenodo 2026
Online Access:	https://doi.org/10.5281/zenodo.19098231
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866901946319765504
author	Simone Pinna Francesca Maridina Malloci Mirko Marras Diego Reforgiato Recupero Daniele Riboni Giuseppe Scarpi
author_facet	Simone Pinna Francesca Maridina Malloci Mirko Marras Diego Reforgiato Recupero Daniele Riboni Giuseppe Scarpi
contents	<p>The following dataset is a collection of multilingual audio recordings in a controlled environment, divided according to the tone of urgency used in a hypothetical emergency scenario.<br>The dataset contains 200 audio files in 5 different languages: English, French, German, Italian, and Dutch. There are 40 audio files for each language, divided equally (20 and 20) between those labelled as urgent and those labelled as non-urgent. Urgent audio files are recordings of speech in simulated emergency situations in which the speaker adopts a tone defined as urgent. Non-urgent audio files are recordings of general content in which the speaker uses a neutral tone of voice. <br>The English audio files are produced using artificial intelligence-based text-to-speech software. <br>The remaining audio files for each language are recorded by the same person (i.e. 4 people), who are native speakers of the respective language. </p> <p> </p> <p>All audio files are provided in uncompressed WAV format with the following characteristics:</p> <ul> <li>Codec : pcm_s16le</li> <li>Sample rate: 48 kHZ</li> <li>Channels: 2 (stereo)</li> </ul> <p> </p> <p>Structure of the dataset:</p> <p><br>dataset:<br>    data:<br>         languages:<br>              urgent:<br>                   audio_xx_urgent.wav<br>                   transcription_language.txt<br>              not_urgent:<br>                   audio_xx_not_urgent.wav<br>                   transcription_language.txt</p>
format	Recurso digital
id	zenodo_https___doi_org_10_5281_zenodo_19098231
institution	Zenodo
language
publishDate	2026
publisher	Zenodo
record_format	zenodo
spellingShingle	Multilingual speech dataset with urgency tone annotation Simone Pinna Francesca Maridina Malloci Mirko Marras Diego Reforgiato Recupero Daniele Riboni Giuseppe Scarpi <p>The following dataset is a collection of multilingual audio recordings in a controlled environment, divided according to the tone of urgency used in a hypothetical emergency scenario.<br>The dataset contains 200 audio files in 5 different languages: English, French, German, Italian, and Dutch. There are 40 audio files for each language, divided equally (20 and 20) between those labelled as urgent and those labelled as non-urgent. Urgent audio files are recordings of speech in simulated emergency situations in which the speaker adopts a tone defined as urgent. Non-urgent audio files are recordings of general content in which the speaker uses a neutral tone of voice. <br>The English audio files are produced using artificial intelligence-based text-to-speech software. <br>The remaining audio files for each language are recorded by the same person (i.e. 4 people), who are native speakers of the respective language. </p> <p> </p> <p>All audio files are provided in uncompressed WAV format with the following characteristics:</p> <ul> <li>Codec : pcm_s16le</li> <li>Sample rate: 48 kHZ</li> <li>Channels: 2 (stereo)</li> </ul> <p> </p> <p>Structure of the dataset:</p> <p><br>dataset:<br>    data:<br>         languages:<br>              urgent:<br>                   audio_xx_urgent.wav<br>                   transcription_language.txt<br>              not_urgent:<br>                   audio_xx_not_urgent.wav<br>                   transcription_language.txt</p>
title	Multilingual speech dataset with urgency tone annotation
url	https://doi.org/10.5281/zenodo.19098231

Similar Items