Saved in:
Bibliographic Details
Main Authors: Simone Pinna, Francesca Maridina Malloci, Mirko Marras, Diego Reforgiato Recupero, Daniele Riboni, Giuseppe Scarpi
Format: Recurso digital
Language:
Published: Zenodo 2026
Online Access:https://doi.org/10.5281/zenodo.19098231
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866901946319765504
author Simone Pinna
Francesca Maridina Malloci
Mirko Marras
Diego Reforgiato Recupero
Daniele Riboni
Giuseppe Scarpi
author_facet Simone Pinna
Francesca Maridina Malloci
Mirko Marras
Diego Reforgiato Recupero
Daniele Riboni
Giuseppe Scarpi
contents <p>The following dataset is a collection of multilingual audio recordings in a controlled environment, divided according to the tone of urgency used in a hypothetical emergency scenario.<br>The dataset contains 200 audio files in 5 different languages: English, French, German, Italian, and Dutch. There are 40 audio files for each language, divided equally (20 and 20) between those labelled as urgent and those labelled as non-urgent. Urgent audio files are recordings of speech in simulated emergency situations in which the speaker adopts a tone defined as urgent. Non-urgent audio files are recordings of general content in which the speaker uses a neutral tone of voice. <br>The English audio files are produced using artificial intelligence-based text-to-speech software. <br>The remaining audio files for each language are recorded by the same person (i.e. 4 people), who are native speakers of the respective language. </p> <p> </p> <p>All audio files are provided in uncompressed WAV format with the following characteristics:</p> <ul> <li>Codec : pcm_s16le</li> <li>Sample rate: 48 kHZ</li> <li>Channels: 2 (stereo)</li> </ul> <p> </p> <p>Structure of the dataset:</p> <p><br>dataset:<br>    data:<br>         languages:<br>              urgent:<br>                   audio_xx_urgent.wav<br>                   transcription_language.txt<br>              not_urgent:<br>                   audio_xx_not_urgent.wav<br>                   transcription_language.txt</p>
format Recurso digital
id zenodo_https___doi_org_10_5281_zenodo_19098231
institution Zenodo
language
publishDate 2026
publisher Zenodo
record_format zenodo
spellingShingle Multilingual speech dataset with urgency tone annotation
Simone Pinna
Francesca Maridina Malloci
Mirko Marras
Diego Reforgiato Recupero
Daniele Riboni
Giuseppe Scarpi
<p>The following dataset is a collection of multilingual audio recordings in a controlled environment, divided according to the tone of urgency used in a hypothetical emergency scenario.<br>The dataset contains 200 audio files in 5 different languages: English, French, German, Italian, and Dutch. There are 40 audio files for each language, divided equally (20 and 20) between those labelled as urgent and those labelled as non-urgent. Urgent audio files are recordings of speech in simulated emergency situations in which the speaker adopts a tone defined as urgent. Non-urgent audio files are recordings of general content in which the speaker uses a neutral tone of voice. <br>The English audio files are produced using artificial intelligence-based text-to-speech software. <br>The remaining audio files for each language are recorded by the same person (i.e. 4 people), who are native speakers of the respective language. </p> <p> </p> <p>All audio files are provided in uncompressed WAV format with the following characteristics:</p> <ul> <li>Codec : pcm_s16le</li> <li>Sample rate: 48 kHZ</li> <li>Channels: 2 (stereo)</li> </ul> <p> </p> <p>Structure of the dataset:</p> <p><br>dataset:<br>    data:<br>         languages:<br>              urgent:<br>                   audio_xx_urgent.wav<br>                   transcription_language.txt<br>              not_urgent:<br>                   audio_xx_not_urgent.wav<br>                   transcription_language.txt</p>
title Multilingual speech dataset with urgency tone annotation
url https://doi.org/10.5281/zenodo.19098231