Saved in:
| Main Authors: | , , , , , |
|---|---|
| Format: | Recurso digital |
| Language: | |
| Published: |
Zenodo
2026
|
| Online Access: | https://doi.org/10.5281/zenodo.19098231 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Table of Contents:
- <p>The following dataset is a collection of multilingual audio recordings in a controlled environment, divided according to the tone of urgency used in a hypothetical emergency scenario.<br>The dataset contains 200 audio files in 5 different languages: English, French, German, Italian, and Dutch. There are 40 audio files for each language, divided equally (20 and 20) between those labelled as urgent and those labelled as non-urgent. Urgent audio files are recordings of speech in simulated emergency situations in which the speaker adopts a tone defined as urgent. Non-urgent audio files are recordings of general content in which the speaker uses a neutral tone of voice. <br>The English audio files are produced using artificial intelligence-based text-to-speech software. <br>The remaining audio files for each language are recorded by the same person (i.e. 4 people), who are native speakers of the respective language. </p> <p> </p> <p>All audio files are provided in uncompressed WAV format with the following characteristics:</p> <ul> <li>Codec : pcm_s16le</li> <li>Sample rate: 48 kHZ</li> <li>Channels: 2 (stereo)</li> </ul> <p> </p> <p>Structure of the dataset:</p> <p><br>dataset:<br> data:<br> languages:<br> urgent:<br> audio_xx_urgent.wav<br> transcription_language.txt<br> not_urgent:<br> audio_xx_not_urgent.wav<br> transcription_language.txt</p>