Saved in:
| Main Authors: | Nguyen, Tu Anh, Muller, Benjamin, Yu, Bokai, Costa-jussa, Marta R., Elbayad, Maha, Popuri, Sravya, Ropers, Christophe, Duquenne, Paul-Ambroise, Algayres, Robin, Mavlyutov, Ruslan, Gat, Itai, Williamson, Mary, Synnaeve, Gabriel, Pino, Juan, Sagot, Benoit, Dupoux, Emmanuel |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.05755 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Investigating Decoder-only Large Language Models for Speech-to-text Translation
by: Huang, Chao-Wei, et al.
Published: (2024)
by: Huang, Chao-Wei, et al.
Published: (2024)
Merging Text Transformer Models from Different Initializations
by: Verma, Neha, et al.
Published: (2024)
by: Verma, Neha, et al.
Published: (2024)
BigO(Bench) -- Can LLMs Generate Code with Controlled Time and Space Complexity?
by: Chambon, Pierre, et al.
Published: (2025)
by: Chambon, Pierre, et al.
Published: (2025)
SpidR: Learning Fast and Stable Linguistic Units for Spoken Language Models Without Supervision
by: Poli, Maxime, et al.
Published: (2025)
by: Poli, Maxime, et al.
Published: (2025)
Unified Vision-Language Modeling via Concept Space Alignment
by: Qiu, Yifu, et al.
Published: (2026)
by: Qiu, Yifu, et al.
Published: (2026)
Interference Matrix: Quantifying Cross-Lingual Interference in Transformer Encoders
by: Alastruey, Belen, et al.
Published: (2025)
by: Alastruey, Belen, et al.
Published: (2025)
Towards Massive Multilingual Holistic Bias
by: Tan, Xiaoqing Ellen, et al.
Published: (2024)
by: Tan, Xiaoqing Ellen, et al.
Published: (2024)
Large Concept Models: Language Modeling in a Sentence Representation Space
by: LCM team, et al.
Published: (2024)
by: LCM team, et al.
Published: (2024)
Textually Pretrained Speech Language Models
by: Hassid, Michael, et al.
Published: (2023)
by: Hassid, Michael, et al.
Published: (2023)
Simple and Controllable Music Generation
by: Copet, Jade, et al.
Published: (2023)
by: Copet, Jade, et al.
Published: (2023)
Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach
by: Poli, Maxime, et al.
Published: (2024)
by: Poli, Maxime, et al.
Published: (2024)
Discrete Flow Matching
by: Gat, Itai, et al.
Published: (2024)
by: Gat, Itai, et al.
Published: (2024)
LongTail-Swap: benchmarking language models' abilities on rare words
by: Algayres, Robin, et al.
Published: (2025)
by: Algayres, Robin, et al.
Published: (2025)
Spoken to Spoken vs. Spoken to Written: Corpus Approach to Exploring Interpreting and Subtitling
by: Mikhail Mikhailov
Published: (2010)
by: Mikhail Mikhailov
Published: (2010)
VUGEN: Visual Understanding priors for GENeration
by: Chen, Xiangyi, et al.
Published: (2025)
by: Chen, Xiangyi, et al.
Published: (2025)
Text-Guided Semantic Image Encoder
by: Thirukovalluru, Raghuveer, et al.
Published: (2025)
by: Thirukovalluru, Raghuveer, et al.
Published: (2025)
An Empirical Study of Speech Language Models for Prompt-Conditioned Speech Synthesis
by: Peng, Yifan, et al.
Published: (2024)
by: Peng, Yifan, et al.
Published: (2024)
MSLM-S2ST: A Multitask Speech Language Model for Textless Speech-to-Speech Translation with Speaker Style Preservation
by: Peng, Yifan, et al.
Published: (2024)
by: Peng, Yifan, et al.
Published: (2024)
Masked Audio Generation using a Single Non-Autoregressive Transformer
by: Ziv, Alon, et al.
Published: (2024)
by: Ziv, Alon, et al.
Published: (2024)
Chapter C7 Spoken and Written Performatives
by: Durant, Alan, et al.
Published: (2021)
by: Durant, Alan, et al.
Published: (2021)
2M-BELEBELE: Highly Multilingual Speech and American Sign Language Comprehension Dataset
by: Costa-jussà, Marta R., et al.
Published: (2024)
by: Costa-jussà, Marta R., et al.
Published: (2024)
Set Block Decoding is a Language Model Inference Accelerator
by: Gat, Itai, et al.
Published: (2025)
by: Gat, Itai, et al.
Published: (2025)
Combining the flipped classroom and simulation games in engineering education: a methodological survey
by: Algayres, Muriel, et al.
Published: (2019)
by: Algayres, Muriel, et al.
Published: (2019)
Computational Modeling of the Segmentation of Sentence Stimuli From an Infant Word‐Finding Study
by: Daniel Swingley, et al.
Published: (2024)
by: Daniel Swingley, et al.
Published: (2024)
Corrector Sampling in Language Models
by: Gat, Itai, et al.
Published: (2025)
by: Gat, Itai, et al.
Published: (2025)
Transition Matching: Scalable and Flexible Generative Modeling
by: Shaul, Neta, et al.
Published: (2025)
by: Shaul, Neta, et al.
Published: (2025)
CHECK-MAT: Checking Hand-Written Mathematical Answers for the Russian Unified State Exam
by: Khrulev, Ruslan
Published: (2025)
by: Khrulev, Ruslan
Published: (2025)
Written Term Detection Improves Spoken Term Detection
by: Yusuf, Bolaji, et al.
Published: (2024)
by: Yusuf, Bolaji, et al.
Published: (2024)
Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension and Text Generation
by: Costa-jussà, Marta R., et al.
Published: (2024)
by: Costa-jussà, Marta R., et al.
Published: (2024)
Linguini: A benchmark for language-agnostic linguistic reasoning
by: Sánchez, Eduardo, et al.
Published: (2024)
by: Sánchez, Eduardo, et al.
Published: (2024)
Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation
by: Tal, Or, et al.
Published: (2024)
by: Tal, Or, et al.
Published: (2024)
Edit Flows: Flow Matching with Edit Operations
by: Havasi, Marton, et al.
Published: (2025)
by: Havasi, Marton, et al.
Published: (2025)
A French Version of the OLDI Seed Corpus
by: Marmonier, Malik, et al.
Published: (2025)
by: Marmonier, Malik, et al.
Published: (2025)
Tree of Problems: Improving structured problem solving with compositionality
by: Zebaze, Armel, et al.
Published: (2024)
by: Zebaze, Armel, et al.
Published: (2024)
Testing the Deliteralization Hypothesis in Human and Machine Translation
by: Marmonier, Malik, et al.
Published: (2026)
by: Marmonier, Malik, et al.
Published: (2026)
In-Context Example Selection via Similarity Search Improves Low-Resource Machine Translation
by: Zebaze, Armel, et al.
Published: (2024)
by: Zebaze, Armel, et al.
Published: (2024)
ModernBERT or DeBERTaV3? Examining Architecture and Data Influence on Transformer Encoder Models Performance
by: Antoun, Wissam, et al.
Published: (2025)
by: Antoun, Wissam, et al.
Published: (2025)
LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens
by: Zebaze, Armel, et al.
Published: (2025)
by: Zebaze, Armel, et al.
Published: (2025)
Can Character-based Language Models Improve Downstream Task Performance in Low-Resource and Noisy Language Scenarios?
by: Riabi, Arij, et al.
Published: (2021)
by: Riabi, Arij, et al.
Published: (2021)
Explicit Learning and the LLM in Machine Translation
by: Marmonier, Malik, et al.
Published: (2025)
by: Marmonier, Malik, et al.
Published: (2025)
Similar Items
-
Investigating Decoder-only Large Language Models for Speech-to-text Translation
by: Huang, Chao-Wei, et al.
Published: (2024) -
Merging Text Transformer Models from Different Initializations
by: Verma, Neha, et al.
Published: (2024) -
BigO(Bench) -- Can LLMs Generate Code with Controlled Time and Space Complexity?
by: Chambon, Pierre, et al.
Published: (2025) -
SpidR: Learning Fast and Stable Linguistic Units for Spoken Language Models Without Supervision
by: Poli, Maxime, et al.
Published: (2025) -
Unified Vision-Language Modeling via Concept Space Alignment
by: Qiu, Yifu, et al.
Published: (2026)