:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Nguyen, Tu Anh, Muller, Benjamin, Yu, Bokai, Costa-jussa, Marta R., Elbayad, Maha, Popuri, Sravya, Ropers, Christophe, Duquenne, Paul-Ambroise, Algayres, Robin, Mavlyutov, Ruslan, Gat, Itai, Williamson, Mary, Synnaeve, Gabriel, Pino, Juan, Sagot, Benoit, Dupoux, Emmanuel
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Sound Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2402.05755
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Investigating Decoder-only Large Language Models for Speech-to-text Translation
by: Huang, Chao-Wei, et al.
Published: (2024)

Merging Text Transformer Models from Different Initializations
by: Verma, Neha, et al.
Published: (2024)

BigO(Bench) -- Can LLMs Generate Code with Controlled Time and Space Complexity?
by: Chambon, Pierre, et al.
Published: (2025)

SpidR: Learning Fast and Stable Linguistic Units for Spoken Language Models Without Supervision
by: Poli, Maxime, et al.
Published: (2025)

Unified Vision-Language Modeling via Concept Space Alignment
by: Qiu, Yifu, et al.
Published: (2026)

Interference Matrix: Quantifying Cross-Lingual Interference in Transformer Encoders
by: Alastruey, Belen, et al.
Published: (2025)

Towards Massive Multilingual Holistic Bias
by: Tan, Xiaoqing Ellen, et al.
Published: (2024)

Large Concept Models: Language Modeling in a Sentence Representation Space
by: LCM team, et al.
Published: (2024)

Textually Pretrained Speech Language Models
by: Hassid, Michael, et al.
Published: (2023)

Simple and Controllable Music Generation
by: Copet, Jade, et al.
Published: (2023)

Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach
by: Poli, Maxime, et al.
Published: (2024)

Discrete Flow Matching
by: Gat, Itai, et al.
Published: (2024)

LongTail-Swap: benchmarking language models' abilities on rare words
by: Algayres, Robin, et al.
Published: (2025)

Spoken to Spoken vs. Spoken to Written: Corpus Approach to Exploring Interpreting and Subtitling
by: Mikhail Mikhailov
Published: (2010)

VUGEN: Visual Understanding priors for GENeration
by: Chen, Xiangyi, et al.
Published: (2025)

Text-Guided Semantic Image Encoder
by: Thirukovalluru, Raghuveer, et al.
Published: (2025)

An Empirical Study of Speech Language Models for Prompt-Conditioned Speech Synthesis
by: Peng, Yifan, et al.
Published: (2024)

MSLM-S2ST: A Multitask Speech Language Model for Textless Speech-to-Speech Translation with Speaker Style Preservation
by: Peng, Yifan, et al.
Published: (2024)

Masked Audio Generation using a Single Non-Autoregressive Transformer
by: Ziv, Alon, et al.
Published: (2024)

Chapter C7 Spoken and Written Performatives
by: Durant, Alan, et al.
Published: (2021)

2M-BELEBELE: Highly Multilingual Speech and American Sign Language Comprehension Dataset
by: Costa-jussà, Marta R., et al.
Published: (2024)

Set Block Decoding is a Language Model Inference Accelerator
by: Gat, Itai, et al.
Published: (2025)

Combining the flipped classroom and simulation games in engineering education: a methodological survey
by: Algayres, Muriel, et al.
Published: (2019)

Computational Modeling of the Segmentation of Sentence Stimuli From an Infant Word‐Finding Study
by: Daniel Swingley, et al.
Published: (2024)

Corrector Sampling in Language Models
by: Gat, Itai, et al.
Published: (2025)

Transition Matching: Scalable and Flexible Generative Modeling
by: Shaul, Neta, et al.
Published: (2025)

CHECK-MAT: Checking Hand-Written Mathematical Answers for the Russian Unified State Exam
by: Khrulev, Ruslan
Published: (2025)

Written Term Detection Improves Spoken Term Detection
by: Yusuf, Bolaji, et al.
Published: (2024)

Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension and Text Generation
by: Costa-jussà, Marta R., et al.
Published: (2024)

Linguini: A benchmark for language-agnostic linguistic reasoning
by: Sánchez, Eduardo, et al.
Published: (2024)

Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation
by: Tal, Or, et al.
Published: (2024)

Edit Flows: Flow Matching with Edit Operations
by: Havasi, Marton, et al.
Published: (2025)

A French Version of the OLDI Seed Corpus
by: Marmonier, Malik, et al.
Published: (2025)

Tree of Problems: Improving structured problem solving with compositionality
by: Zebaze, Armel, et al.
Published: (2024)

Testing the Deliteralization Hypothesis in Human and Machine Translation
by: Marmonier, Malik, et al.
Published: (2026)

In-Context Example Selection via Similarity Search Improves Low-Resource Machine Translation
by: Zebaze, Armel, et al.
Published: (2024)

ModernBERT or DeBERTaV3? Examining Architecture and Data Influence on Transformer Encoder Models Performance
by: Antoun, Wissam, et al.
Published: (2025)

LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens
by: Zebaze, Armel, et al.
Published: (2025)

Can Character-based Language Models Improve Downstream Task Performance in Low-Resource and Noisy Language Scenarios?
by: Riabi, Arij, et al.
Published: (2021)

Explicit Learning and the LLM in Machine Translation
by: Marmonier, Malik, et al.
Published: (2025)