Saved in:
| Main Authors: | , , , , , |
|---|---|
| Format: | Recurso digital |
| Language: | |
| Published: |
Zenodo
2026
|
| Online Access: | https://doi.org/10.5281/zenodo.19098231 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1866901946319765504 |
|---|---|
| author | Simone Pinna Francesca Maridina Malloci Mirko Marras Diego Reforgiato Recupero Daniele Riboni Giuseppe Scarpi |
| author_facet | Simone Pinna Francesca Maridina Malloci Mirko Marras Diego Reforgiato Recupero Daniele Riboni Giuseppe Scarpi |
| contents | <p>The following dataset is a collection of multilingual audio recordings in a controlled environment, divided according to the tone of urgency used in a hypothetical emergency scenario.<br>The dataset contains 200 audio files in 5 different languages: English, French, German, Italian, and Dutch. There are 40 audio files for each language, divided equally (20 and 20) between those labelled as urgent and those labelled as non-urgent. Urgent audio files are recordings of speech in simulated emergency situations in which the speaker adopts a tone defined as urgent. Non-urgent audio files are recordings of general content in which the speaker uses a neutral tone of voice. <br>The English audio files are produced using artificial intelligence-based text-to-speech software. <br>The remaining audio files for each language are recorded by the same person (i.e. 4 people), who are native speakers of the respective language. </p> <p> </p> <p>All audio files are provided in uncompressed WAV format with the following characteristics:</p> <ul> <li>Codec : pcm_s16le</li> <li>Sample rate: 48 kHZ</li> <li>Channels: 2 (stereo)</li> </ul> <p> </p> <p>Structure of the dataset:</p> <p><br>dataset:<br> data:<br> languages:<br> urgent:<br> audio_xx_urgent.wav<br> transcription_language.txt<br> not_urgent:<br> audio_xx_not_urgent.wav<br> transcription_language.txt</p> |
| format | Recurso digital |
| id | zenodo_https___doi_org_10_5281_zenodo_19098231 |
| institution | Zenodo |
| language | |
| publishDate | 2026 |
| publisher | Zenodo |
| record_format | zenodo |
| spellingShingle | Multilingual speech dataset with urgency tone annotation Simone Pinna Francesca Maridina Malloci Mirko Marras Diego Reforgiato Recupero Daniele Riboni Giuseppe Scarpi <p>The following dataset is a collection of multilingual audio recordings in a controlled environment, divided according to the tone of urgency used in a hypothetical emergency scenario.<br>The dataset contains 200 audio files in 5 different languages: English, French, German, Italian, and Dutch. There are 40 audio files for each language, divided equally (20 and 20) between those labelled as urgent and those labelled as non-urgent. Urgent audio files are recordings of speech in simulated emergency situations in which the speaker adopts a tone defined as urgent. Non-urgent audio files are recordings of general content in which the speaker uses a neutral tone of voice. <br>The English audio files are produced using artificial intelligence-based text-to-speech software. <br>The remaining audio files for each language are recorded by the same person (i.e. 4 people), who are native speakers of the respective language. </p> <p> </p> <p>All audio files are provided in uncompressed WAV format with the following characteristics:</p> <ul> <li>Codec : pcm_s16le</li> <li>Sample rate: 48 kHZ</li> <li>Channels: 2 (stereo)</li> </ul> <p> </p> <p>Structure of the dataset:</p> <p><br>dataset:<br> data:<br> languages:<br> urgent:<br> audio_xx_urgent.wav<br> transcription_language.txt<br> not_urgent:<br> audio_xx_not_urgent.wav<br> transcription_language.txt</p> |
| title | Multilingual speech dataset with urgency tone annotation |
| url | https://doi.org/10.5281/zenodo.19098231 |