Saved in:
| Main Authors: | Zeghidour, Neil, Kharitonov, Eugene, Orsini, Manu, Volhejn, Václav, de Marmiesse, Gabriel, Grave, Edouard, Pérez, Patrick, Mazaré, Laurent, Défossez, Alexandre |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.08753 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Moshi: a speech-text foundation model for real-time dialogue
by: Défossez, Alexandre, et al.
Published: (2024)
by: Défossez, Alexandre, et al.
Published: (2024)
High-Fidelity Simultaneous Speech-To-Speech Translation
by: Labiausse, Tom, et al.
Published: (2025)
by: Labiausse, Tom, et al.
Published: (2025)
MoshiRAG: Asynchronous Knowledge Retrieval for Full-Duplex Speech Language Models
by: Chien, Chung-Ming, et al.
Published: (2026)
by: Chien, Chung-Ming, et al.
Published: (2026)
Vision-Speech Models: Teaching Speech Models to Converse about Images
by: Royer, Amélie, et al.
Published: (2025)
by: Royer, Amélie, et al.
Published: (2025)
Aligning Spoken Dialogue Models from User Interactions
by: Wu, Anne, et al.
Published: (2025)
by: Wu, Anne, et al.
Published: (2025)
Continuous Audio Language Models
by: Rouard, Simon, et al.
Published: (2025)
by: Rouard, Simon, et al.
Published: (2025)
MAD Speech: Measures of Acoustic Diversity of Speech
by: Futeral, Matthieu, et al.
Published: (2024)
by: Futeral, Matthieu, et al.
Published: (2024)
Simultaneous Speech-to-Speech Translation Without Aligned Data
by: Labiausse, Tom, et al.
Published: (2026)
by: Labiausse, Tom, et al.
Published: (2026)
ARC-Encoder: learning compressed text representations for large language models
by: Pilchen, Hippolyte, et al.
Published: (2025)
by: Pilchen, Hippolyte, et al.
Published: (2025)
SequenceLayers: Sequence Processing and Streaming Neural Networks Made Easy
by: Skerry-Ryan, RJ, et al.
Published: (2025)
by: Skerry-Ryan, RJ, et al.
Published: (2025)
Streaming Sequence Transduction through Dynamic Compression
by: Tan, Weiting, et al.
Published: (2024)
by: Tan, Weiting, et al.
Published: (2024)
Understanding Data Temporality Impact on Large Language Models Pre-training
by: Pilchen, Hippolyte, et al.
Published: (2026)
by: Pilchen, Hippolyte, et al.
Published: (2026)
Neutral Residues: Revisiting Adapters for Model Extension
by: Talla, Franck Signe, et al.
Published: (2024)
by: Talla, Franck Signe, et al.
Published: (2024)
A Regular and Complete Notion of Delay for Streaming String Transducers
by: Filiot, Emmanuel, et al.
Published: (2022)
by: Filiot, Emmanuel, et al.
Published: (2022)
Long Sequence Modeling with Attention Tensorization: From Sequence to Tensor Learning
by: Feng, Aosong, et al.
Published: (2024)
by: Feng, Aosong, et al.
Published: (2024)
PLAID SHIRTTT for Large-Scale Streaming Dense Retrieval
by: Lawrie, Dawn, et al.
Published: (2024)
by: Lawrie, Dawn, et al.
Published: (2024)
Stream of Search (SoS): Learning to Search in Language
by: Gandhi, Kanishk, et al.
Published: (2024)
by: Gandhi, Kanishk, et al.
Published: (2024)
MURR: Model Updating with Regularized Replay for Searching a Document Stream
by: Yang, Eugene, et al.
Published: (2025)
by: Yang, Eugene, et al.
Published: (2025)
On Sequence-to-Sequence Models for Automated Log Parsing
by: Sorrenti, Adam, et al.
Published: (2026)
by: Sorrenti, Adam, et al.
Published: (2026)
LongStream: Long-Sequence Streaming Autoregressive Visual Geometry
by: Cheng, Chong, et al.
Published: (2026)
by: Cheng, Chong, et al.
Published: (2026)
Stream Types
by: Cutler, Joseph W., et al.
Published: (2023)
by: Cutler, Joseph W., et al.
Published: (2023)
Adaptive Test-Time Scaling for Zero-Shot Respiratory Audio Classification
by: Wang, Tsai-Ning, et al.
Published: (2026)
by: Wang, Tsai-Ning, et al.
Published: (2026)
Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs
by: Su, Guinan, et al.
Published: (2026)
by: Su, Guinan, et al.
Published: (2026)
Sequence to Sequence Reward Modeling: Improving RLHF by Language Feedback
by: Zhou, Jiayi, et al.
Published: (2024)
by: Zhou, Jiayi, et al.
Published: (2024)
Sequence-to-Sequence Spanish Pre-trained Language Models
by: Araujo, Vladimir, et al.
Published: (2023)
by: Araujo, Vladimir, et al.
Published: (2023)
Why Any-Order Autoregressive Models Need Two-Stream Attention: A Structural-Semantic Tradeoff
by: Pynadath, Patrick, et al.
Published: (2026)
by: Pynadath, Patrick, et al.
Published: (2026)
Sequence graphs realizations and ambiguity in language models
by: Khalife, Sammy, et al.
Published: (2024)
by: Khalife, Sammy, et al.
Published: (2024)
Sequence-to-Sequence Language Models for Character and Emotion Detection in Dream Narratives
by: Cortal, Gustave
Published: (2024)
by: Cortal, Gustave
Published: (2024)
Large Language Models for Page Stream Segmentation
by: Heidenreich, Hunter, et al.
Published: (2024)
by: Heidenreich, Hunter, et al.
Published: (2024)
Towards Cultural Bridge by Bahnaric-Vietnamese Translation Using Transfer Learning of Sequence-To-Sequence Pre-training Language Model
by: Dat, Phan Tran Minh, et al.
Published: (2025)
by: Dat, Phan Tran Minh, et al.
Published: (2025)
StreamAdapter: Efficient Test Time Adaptation from Contextual Streams
by: Muhtar, Dilxat, et al.
Published: (2024)
by: Muhtar, Dilxat, et al.
Published: (2024)
Few-shot Cross-lingual Aspect-Based Sentiment Analysis with Sequence-to-Sequence Models
by: Šmíd, Jakub, et al.
Published: (2025)
by: Šmíd, Jakub, et al.
Published: (2025)
Streaming Tensor Programs: A Streaming Abstraction for Dynamic Parallelism
by: Sohn, Gina, et al.
Published: (2025)
by: Sohn, Gina, et al.
Published: (2025)
SpeakStream: Streaming Text-to-Speech with Interleaved Data
by: Bai, Richard He, et al.
Published: (2025)
by: Bai, Richard He, et al.
Published: (2025)
LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding
by: Tong, Junlong, et al.
Published: (2025)
by: Tong, Junlong, et al.
Published: (2025)
When Perplexity Lies: Generation-Focused Distillation of Hybrid Sequence Models
by: Kostelec, Juan Gabriel, et al.
Published: (2026)
by: Kostelec, Juan Gabriel, et al.
Published: (2026)
StreamUni: Achieving Streaming Speech Translation with a Unified Large Speech-Language Model
by: Guo, Shoutao, et al.
Published: (2025)
by: Guo, Shoutao, et al.
Published: (2025)
Learning Evaluation Models from Large Language Models for Sequence Generation
by: Wang, Chenglong, et al.
Published: (2023)
by: Wang, Chenglong, et al.
Published: (2023)
Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch
by: Douillard, Arthur, et al.
Published: (2025)
by: Douillard, Arthur, et al.
Published: (2025)
On the "Induction Bias" in Sequence Models
by: Ebrahimi, M. Reza, et al.
Published: (2026)
by: Ebrahimi, M. Reza, et al.
Published: (2026)
Similar Items
-
Moshi: a speech-text foundation model for real-time dialogue
by: Défossez, Alexandre, et al.
Published: (2024) -
High-Fidelity Simultaneous Speech-To-Speech Translation
by: Labiausse, Tom, et al.
Published: (2025) -
MoshiRAG: Asynchronous Knowledge Retrieval for Full-Duplex Speech Language Models
by: Chien, Chung-Ming, et al.
Published: (2026) -
Vision-Speech Models: Teaching Speech Models to Converse about Images
by: Royer, Amélie, et al.
Published: (2025) -
Aligning Spoken Dialogue Models from User Interactions
by: Wu, Anne, et al.
Published: (2025)