:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Duret, Jarod, Mdhaffar, Salima, Laperrière, Gaëlle, Whetten, Ryan, Galametz, Audrey, Kobus, Catherine, Martin, Marion-Cécile, Oleiwan, Jo, Estève, Yannick
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Computation and Language Artificial Intelligence
Online-Zugang:	https://arxiv.org/abs/2509.12101
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Ara-Best-RQ: Multi Dialectal Arabic SSL
von: Elleuch, Haroun, et al.
Veröffentlicht: (2026)

Performance Analysis of Speech Encoders for Low-Resource SLU and ASR in Tunisian Dialect
von: Mdhaffar, Salima, et al.
Veröffentlicht: (2024)

ELYADATA & LIA at NADI 2025: ASR and ADI Subtasks
von: Elleuch, Haroun, et al.
Veröffentlicht: (2025)

Learning Multiple Utterance-Level Attribute Representations with a Unified Speech Encoder
von: Bouziane, Maryem, et al.
Veröffentlicht: (2026)

Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation
von: Duret, Jarod, et al.
Veröffentlicht: (2024)

MSP-Podcast SER Challenge 2024: L'antenne du Ventoux Multimodal Self-Supervised Learning for Speech Emotion Recognition
von: Duret, Jarod, et al.
Veröffentlicht: (2024)

TEDxTN: A Three-way Speech Translation Corpus for Code-Switched Tunisian Arabic - English
von: Bougares, Fethi, et al.
Veröffentlicht: (2025)

Using Multimodal and Language-Agnostic Sentence Embeddings for Abstractive Summarization
von: Chellaf, Chaimae, et al.
Veröffentlicht: (2026)

SLURP-TN : Resource for Tunisian Dialect Spoken Language Understanding
von: Elleuch, Haroun, et al.
Veröffentlicht: (2026)

ADI-20: Arabic Dialect Identification dataset and models
von: Elleuch, Haroun, et al.
Veröffentlicht: (2025)

A dual task learning approach to fine-tune a multilingual semantic speech encoder for Spoken Language Understanding
von: Laperrière, Gaëlle, et al.
Veröffentlicht: (2024)

A Study of Data Selection Strategies for Pre-training Self-Supervised Speech Models
von: Whetten, Ryan, et al.
Veröffentlicht: (2026)

SENSE models: an open source solution for multilingual and multimodal semantic-based tasks
von: Mdhaffar, Salima, et al.
Veröffentlicht: (2025)

Semantic enrichment towards efficient speech representations
von: Laperrière, Gaëlle, et al.
Veröffentlicht: (2023)

An Ultra-Low Latency, End-to-End Streaming Speech Synthesis Architecture via Block-Wise Generation and Depth-Wise Codec Decoding
von: Su, Tianhui, et al.
Veröffentlicht: (2026)

Open Implementation and Study of BEST-RQ for Speech Processing
von: Whetten, Ryan, et al.
Veröffentlicht: (2024)

An Analysis of Linear Complexity Attention Substitutes with BEST-RQ
von: Whetten, Ryan, et al.
Veröffentlicht: (2024)

Towards Early Prediction of Self-Supervised Speech Model Performance
von: Whetten, Ryan, et al.
Veröffentlicht: (2025)

Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants
von: Sekkat, Chloé, et al.
Veröffentlicht: (2024)

ATLANTIS at SemEval-2025 Task 3: Detecting Hallucinated Text Spans in Question Answering
von: Kobus, Catherine, et al.
Veröffentlicht: (2025)

Surrogate Neural Networks Local Stability for Aircraft Predictive Maintenance
von: Ducoffe, Mélanie, et al.
Veröffentlicht: (2024)

New Semantic Task for the French Spoken Language Understanding MEDIA Benchmark
von: Alavoine, Nadège, et al.
Veröffentlicht: (2024)

Strategies for improving low resource speech to text translation relying on pre-trained ASR models
von: Kesiraju, Santosh, et al.
Veröffentlicht: (2023)

O caso de uma comunidade avaliativa emergente: re-apropriação pelos pares-multiplicadores da apreciação de suas próprias ações preventivas contra DST/HIV/AIDS, Amazonas, Brasil
von: Hélène Laperrière
Veröffentlicht: (2008)

CUANDO LA COMUNIDAD GUÍA LA ACCIÓN: HACIA UNA EVALUACIÓN COMUNITARIA ALTERNATIVA
von: Hélène Laperrière
Veröffentlicht: (2007)

Open-Source Conversational AI with SpeechBrain 1.0
von: Ravanelli, Mirco, et al.
Veröffentlicht: (2024)

Exploring SSL Discrete Tokens for Multilingual ASR
von: Cui, Mingyu, et al.
Veröffentlicht: (2024)

Evaluating LLM Abilities to Understand Tabular Electronic Health Records: A Comprehensive Study of Patient Data Extraction and Retrieval
von: Lovon, Jesus, et al.
Veröffentlicht: (2025)

O QUE CONSTITUI UMA CONTRIBUIÇÃO TEÓRICA?
von: David A. Whetten
Veröffentlicht: (2003)

Exploring SSL Discrete Speech Features for Zipformer-based Contextual ASR
von: Cui, Mingyu, et al.
Veröffentlicht: (2024)

Mind the Shift: Using Delta SSL Embeddings to Enhance Child ASR
von: Wang, Zilai, et al.
Veröffentlicht: (2026)

Polynomial Mixing for Efficient Self-supervised Speech Encoders
von: Feillet, Eva, et al.
Veröffentlicht: (2026)

VerifIoU -- Robustness of Object Detection to Perturbations
von: Cohen, Noémie, et al.
Veröffentlicht: (2024)

El Proyecto Limpopo: evidencia empírica sobre el concepto de inteligencia emocional-social
von: Kobus MAREE
Veröffentlicht: (2011)

Dante se mistieke reis
von: Krüger, Kobus
Veröffentlicht: (2021)

Collatz Representations With Bounded Partial Quotients
von: Kobus, Franciszek
Veröffentlicht: (2025)

From pre-training to downstream performance: Does domain-specific pre-training make sense?
von: Krones, Felix
Veröffentlicht: (2026)

How Should We Extract Discrete Audio Tokens from Self-Supervised Models?
von: Mousavi, Pooneh, et al.
Veröffentlicht: (2024)

«In Agro Crotoniensi» – Archéologie et histoire de Crotone durant la période romaine (3ème siècle av. J.-C. – 6ème siècle apr. J.-C.) – KROTON 2
von: Duret, Marc
Veröffentlicht: (2023)

UniEnc-CASSNAT: An Encoder-only Non-autoregressive ASR for Speech SSL Models
von: Fan, Ruchao, et al.
Veröffentlicht: (2024)