:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Guo, Shoutao, Zhang, Shaolei, Feng, Yang
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2406.03878
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Glancing Future for Simultaneous Machine Translation
by: Guo, Shoutao, et al.
Published: (2023)

StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning
by: Zhang, Shaolei, et al.
Published: (2024)

SiLLM: Large Language Models for Simultaneous Machine Translation
by: Guo, Shoutao, et al.
Published: (2024)

Agent-SiMT: Agent-assisted Simultaneous Machine Translation with Large Language Models
by: Guo, Shoutao, et al.
Published: (2024)

Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation
by: Guo, Shoutao, et al.
Published: (2025)

Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model
by: Zhang, Shaolei, et al.
Published: (2025)

A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Speech Translation
by: Ma, Zhengrui, et al.
Published: (2024)

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis
by: Fang, Qingkai, et al.
Published: (2025)

StreamUni: Achieving Streaming Speech Translation with a Unified Large Speech-Language Model
by: Guo, Shoutao, et al.
Published: (2025)

FastLongSpeech: Enhancing Large Speech-Language Models for Efficient Long-Speech Processing
by: Guo, Shoutao, et al.
Published: (2025)

BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment
by: Zhang, Shaolei, et al.
Published: (2024)

LLaMA-Omni: Seamless Speech Interaction with Large Language Models
by: Fang, Qingkai, et al.
Published: (2024)

Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
by: Luo, Yingfeng, et al.
Published: (2025)

Decoder-only Architecture for Streaming End-to-end Speech Recognition
by: Tsunoo, Emiru, et al.
Published: (2024)

Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models
by: Yu, Tian, et al.
Published: (2024)

Truth-Aware Context Selection: Mitigating Hallucinations of Large Language Models Being Misled by Untruthful Contexts
by: Yu, Tian, et al.
Published: (2024)

IG-Pruning: Input-Guided Block Pruning for Large Language Models
by: Qiao, Kangyu, et al.
Published: (2025)

StableMask: Refining Causal Masking in Decoder-only Transformer
by: Yin, Qingyu, et al.
Published: (2024)

Improving Language Transfer Capability of Decoder-only Architecture in Multilingual Neural Machine Translation
by: Qu, Zhi, et al.
Published: (2024)

Investigating Decoder-only Large Language Models for Speech-to-text Translation
by: Huang, Chao-Wei, et al.
Published: (2024)

On the Hallucination in Simultaneous Machine Translation
by: Zhong, Meizhi, et al.
Published: (2024)

TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space
by: Zhang, Shaolei, et al.
Published: (2024)

Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?
by: Fang, Qingkai, et al.
Published: (2024)

Loss Masking Is Not Needed in Decoder-only Transformer for Discrete-token-based ASR
by: Chen, Qian, et al.
Published: (2023)

Joint Training And Decoding for Multilingual End-to-End Simultaneous Speech Translation
by: Huang, Wuwei, et al.
Published: (2025)

R-BI: Regularized Batched Inputs enhance Incremental Decoding Framework for Low-Latency Simultaneous Speech Translation
by: Guo, Jiaxin, et al.
Published: (2024)

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token
by: Zhang, Shaolei, et al.
Published: (2025)

AlignX: Advancing Multilingual Large Language Models with Multilingual Representation Alignment
by: Bu, Mengyu, et al.
Published: (2025)

Language on Demand, Knowledge at Core: Composing LLMs with Encoder-Decoder Translation Models for Extensible Multilinguality
by: Bu, Mengyu, et al.
Published: (2026)

Redefining Machine Simultaneous Interpretation: From Incremental Translation to Human-Like Strategies
by: Zhang, Qianen, et al.
Published: (2026)

Adding Multimodal Capabilities to a Text-only Translation Model
by: Vijayan, Vipin, et al.
Published: (2024)

Memorization in Attention-only Transformers
by: Dana, Léo, et al.
Published: (2024)

Accelerating Transformer Inference for Translation via Parallel Decoding
by: Santilli, Andrea, et al.
Published: (2023)

Decoding Partial Differential Equations: Cross-Modal Adaptation of Decoder-only Models to PDEs
by: García-de-Herreros, Paloma, et al.
Published: (2025)

DPO-Tuned Large Language Models for Segmentation in Simultaneous Speech Translation
by: Yang, Zeyu, et al.
Published: (2025)

KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs
by: Tang, Yixuan, et al.
Published: (2026)

DOA: Training-Free Decoder-Only Attention Policy for Long-Form Simultaneous Translation with SpeechLLMs
by: Papi, Sara, et al.
Published: (2026)

Segmentation-Free Streaming Machine Translation
by: Iranzo-Sánchez, Javier, et al.
Published: (2023)

Contrastive Feedback Mechanism for Simultaneous Speech Translation
by: Tan, Haotian, et al.
Published: (2024)

Simultaneous Machine Translation with Large Language Models
by: Wang, Minghan, et al.
Published: (2023)