Saved in:
| Main Authors: | Guo, Shoutao, Zhang, Shaolei, Feng, Yang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.03878 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Glancing Future for Simultaneous Machine Translation
by: Guo, Shoutao, et al.
Published: (2023)
by: Guo, Shoutao, et al.
Published: (2023)
StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning
by: Zhang, Shaolei, et al.
Published: (2024)
by: Zhang, Shaolei, et al.
Published: (2024)
SiLLM: Large Language Models for Simultaneous Machine Translation
by: Guo, Shoutao, et al.
Published: (2024)
by: Guo, Shoutao, et al.
Published: (2024)
Agent-SiMT: Agent-assisted Simultaneous Machine Translation with Large Language Models
by: Guo, Shoutao, et al.
Published: (2024)
by: Guo, Shoutao, et al.
Published: (2024)
Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation
by: Guo, Shoutao, et al.
Published: (2025)
by: Guo, Shoutao, et al.
Published: (2025)
Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model
by: Zhang, Shaolei, et al.
Published: (2025)
by: Zhang, Shaolei, et al.
Published: (2025)
A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Speech Translation
by: Ma, Zhengrui, et al.
Published: (2024)
by: Ma, Zhengrui, et al.
Published: (2024)
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis
by: Fang, Qingkai, et al.
Published: (2025)
by: Fang, Qingkai, et al.
Published: (2025)
StreamUni: Achieving Streaming Speech Translation with a Unified Large Speech-Language Model
by: Guo, Shoutao, et al.
Published: (2025)
by: Guo, Shoutao, et al.
Published: (2025)
FastLongSpeech: Enhancing Large Speech-Language Models for Efficient Long-Speech Processing
by: Guo, Shoutao, et al.
Published: (2025)
by: Guo, Shoutao, et al.
Published: (2025)
BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment
by: Zhang, Shaolei, et al.
Published: (2024)
by: Zhang, Shaolei, et al.
Published: (2024)
LLaMA-Omni: Seamless Speech Interaction with Large Language Models
by: Fang, Qingkai, et al.
Published: (2024)
by: Fang, Qingkai, et al.
Published: (2024)
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
by: Luo, Yingfeng, et al.
Published: (2025)
by: Luo, Yingfeng, et al.
Published: (2025)
Decoder-only Architecture for Streaming End-to-end Speech Recognition
by: Tsunoo, Emiru, et al.
Published: (2024)
by: Tsunoo, Emiru, et al.
Published: (2024)
Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models
by: Yu, Tian, et al.
Published: (2024)
by: Yu, Tian, et al.
Published: (2024)
Truth-Aware Context Selection: Mitigating Hallucinations of Large Language Models Being Misled by Untruthful Contexts
by: Yu, Tian, et al.
Published: (2024)
by: Yu, Tian, et al.
Published: (2024)
IG-Pruning: Input-Guided Block Pruning for Large Language Models
by: Qiao, Kangyu, et al.
Published: (2025)
by: Qiao, Kangyu, et al.
Published: (2025)
StableMask: Refining Causal Masking in Decoder-only Transformer
by: Yin, Qingyu, et al.
Published: (2024)
by: Yin, Qingyu, et al.
Published: (2024)
Improving Language Transfer Capability of Decoder-only Architecture in Multilingual Neural Machine Translation
by: Qu, Zhi, et al.
Published: (2024)
by: Qu, Zhi, et al.
Published: (2024)
Investigating Decoder-only Large Language Models for Speech-to-text Translation
by: Huang, Chao-Wei, et al.
Published: (2024)
by: Huang, Chao-Wei, et al.
Published: (2024)
On the Hallucination in Simultaneous Machine Translation
by: Zhong, Meizhi, et al.
Published: (2024)
by: Zhong, Meizhi, et al.
Published: (2024)
TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space
by: Zhang, Shaolei, et al.
Published: (2024)
by: Zhang, Shaolei, et al.
Published: (2024)
Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?
by: Fang, Qingkai, et al.
Published: (2024)
by: Fang, Qingkai, et al.
Published: (2024)
Loss Masking Is Not Needed in Decoder-only Transformer for Discrete-token-based ASR
by: Chen, Qian, et al.
Published: (2023)
by: Chen, Qian, et al.
Published: (2023)
Joint Training And Decoding for Multilingual End-to-End Simultaneous Speech Translation
by: Huang, Wuwei, et al.
Published: (2025)
by: Huang, Wuwei, et al.
Published: (2025)
R-BI: Regularized Batched Inputs enhance Incremental Decoding Framework for Low-Latency Simultaneous Speech Translation
by: Guo, Jiaxin, et al.
Published: (2024)
by: Guo, Jiaxin, et al.
Published: (2024)
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token
by: Zhang, Shaolei, et al.
Published: (2025)
by: Zhang, Shaolei, et al.
Published: (2025)
AlignX: Advancing Multilingual Large Language Models with Multilingual Representation Alignment
by: Bu, Mengyu, et al.
Published: (2025)
by: Bu, Mengyu, et al.
Published: (2025)
Language on Demand, Knowledge at Core: Composing LLMs with Encoder-Decoder Translation Models for Extensible Multilinguality
by: Bu, Mengyu, et al.
Published: (2026)
by: Bu, Mengyu, et al.
Published: (2026)
Redefining Machine Simultaneous Interpretation: From Incremental Translation to Human-Like Strategies
by: Zhang, Qianen, et al.
Published: (2026)
by: Zhang, Qianen, et al.
Published: (2026)
Adding Multimodal Capabilities to a Text-only Translation Model
by: Vijayan, Vipin, et al.
Published: (2024)
by: Vijayan, Vipin, et al.
Published: (2024)
Memorization in Attention-only Transformers
by: Dana, Léo, et al.
Published: (2024)
by: Dana, Léo, et al.
Published: (2024)
Accelerating Transformer Inference for Translation via Parallel Decoding
by: Santilli, Andrea, et al.
Published: (2023)
by: Santilli, Andrea, et al.
Published: (2023)
Decoding Partial Differential Equations: Cross-Modal Adaptation of Decoder-only Models to PDEs
by: García-de-Herreros, Paloma, et al.
Published: (2025)
by: García-de-Herreros, Paloma, et al.
Published: (2025)
DPO-Tuned Large Language Models for Segmentation in Simultaneous Speech Translation
by: Yang, Zeyu, et al.
Published: (2025)
by: Yang, Zeyu, et al.
Published: (2025)
KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs
by: Tang, Yixuan, et al.
Published: (2026)
by: Tang, Yixuan, et al.
Published: (2026)
DOA: Training-Free Decoder-Only Attention Policy for Long-Form Simultaneous Translation with SpeechLLMs
by: Papi, Sara, et al.
Published: (2026)
by: Papi, Sara, et al.
Published: (2026)
Segmentation-Free Streaming Machine Translation
by: Iranzo-Sánchez, Javier, et al.
Published: (2023)
by: Iranzo-Sánchez, Javier, et al.
Published: (2023)
Contrastive Feedback Mechanism for Simultaneous Speech Translation
by: Tan, Haotian, et al.
Published: (2024)
by: Tan, Haotian, et al.
Published: (2024)
Simultaneous Machine Translation with Large Language Models
by: Wang, Minghan, et al.
Published: (2023)
by: Wang, Minghan, et al.
Published: (2023)
Similar Items
-
Glancing Future for Simultaneous Machine Translation
by: Guo, Shoutao, et al.
Published: (2023) -
StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning
by: Zhang, Shaolei, et al.
Published: (2024) -
SiLLM: Large Language Models for Simultaneous Machine Translation
by: Guo, Shoutao, et al.
Published: (2024) -
Agent-SiMT: Agent-assisted Simultaneous Machine Translation with Large Language Models
by: Guo, Shoutao, et al.
Published: (2024) -
Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation
by: Guo, Shoutao, et al.
Published: (2025)