Table of Contents: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Simone Pinna, Francesca Maridina Malloci, Mirko Marras, Diego Reforgiato Recupero, Daniele Riboni, Giuseppe Scarpi
Format:	Recurso digital
Language:
Published:	Zenodo 2026
Online Access:	https://doi.org/10.5281/zenodo.19098231
Tags:	Add Tag No Tags, Be the first to tag this record!

Table of Contents:

The following dataset is a collection of multilingual audio recordings in a controlled environment, divided according to the tone of urgency used in a hypothetical emergency scenario. The dataset contains 200 audio files in 5 different languages: English, French, German, Italian, and Dutch. There are 40 audio files for each language, divided equally (20 and 20) between those labelled as urgent and those labelled as non-urgent. Urgent audio files are recordings of speech in simulated emergency situations in which the speaker adopts a tone defined as urgent. Non-urgent audio files are recordings of general content in which the speaker uses a neutral tone of voice.  The English audio files are produced using artificial intelligence-based text-to-speech software.  The remaining audio files for each language are recorded by the same person (i.e. 4 people), who are native speakers of the respective language.    All audio files are provided in uncompressed WAV format with the following characteristics: <ul> <li>Codec : pcm_s16le</li> <li>Sample rate: 48 kHZ</li> <li>Channels: 2 (stereo)</li> </ul>   Structure of the dataset: dataset:     data:          languages:               urgent:                    audio_xx_urgent.wav                    transcription_language.txt               not_urgent:                    audio_xx_not_urgent.wav                    transcription_language.txt

Similar Items