:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Cao, Zhiwei, Cao, Qian, Lu, Yu, Peng, Ningxin, Huang, Luyang, Cheng, Shanbo, Su, Jinsong
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2406.02376
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

G-DIG: Towards Gradient-based Diverse and High-quality Instruction Data Selection for Machine Translation
by: Pan, Xingyuan, et al.
Published: (2024)

Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent
by: Cheng, Shanbo, et al.
Published: (2024)

500xCompressor: Generalized Prompt Compression for Large Language Models
by: Li, Zongqian, et al.
Published: (2024)

Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters
by: Cheng, Shanbo, et al.
Published: (2025)

EnAnchored-X2X: English-Anchored Optimization for Many-to-Many Translation
by: Yang, Sen, et al.
Published: (2025)

GRRM: Group Relative Reward Modeling for Machine Translation
by: Yang, Sen, et al.
Published: (2026)

From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition
by: Wang, Tianduo, et al.
Published: (2025)

MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation
by: Li, Jiahuan, et al.
Published: (2024)

Response Enhanced Semi-supervised Dialogue Query Generation
by: Huang, Jianheng, et al.
Published: (2023)

Efficient k-Nearest-Neighbor Machine Translation with Dynamic Retrieval
by: Gao, Yan, et al.
Published: (2024)

Mixture Compressor for Mixture-of-Experts LLMs Gains More
by: Huang, Wei, et al.
Published: (2024)

Trans-Zero: Self-Play Incentivizes Large Language Models for Multilingual Translation Without Parallel Data
by: Zou, Wei, et al.
Published: (2025)

Perception Compressor: A Training-Free Prompt Compression Framework in Long Context Scenarios
by: Tang, Jiwei, et al.
Published: (2024)

Eliciting the Translation Ability of Large Language Models via Multilingual Finetuning with Translation Instructions
by: Li, Jiahuan, et al.
Published: (2023)

TransCompressor: LLM-Powered Multimodal Data Compression for Smart Transportation
by: Yang, Huanqi, et al.
Published: (2024)

How Large Language Models (LLMs) Extrapolate: From Guided Missiles to Guided Prompts
by: Cao, Xuenan
Published: (2024)

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization
by: She, Shuaijie, et al.
Published: (2025)

SeqPO-SiMT: Sequential Policy Optimization for Simultaneous Machine Translation
by: Xu, Ting, et al.
Published: (2025)

Implicit Bias in LLMs: A Survey
by: Lin, Xinru, et al.
Published: (2025)

AttnComp: Attention-Guided Adaptive Context Compression for Retrieval-Augmented Generation
by: Luo, Lvzhou, et al.
Published: (2025)

FLRC: Fine-grained Low-Rank Compressor for Efficient LLM Inference
by: Lu, Yu-Chen, et al.
Published: (2025)

Attn-GS: Attention-Guided Context Compression for Efficient Personalized LLMs
by: Zeng, Shenglai, et al.
Published: (2026)

Unshackling Context Length: An Efficient Selective Attention Approach through Query-Key Compression
by: Wang, Haoyu, et al.
Published: (2025)

INSIDE: LLMs' Internal States Retain the Power of Hallucination Detection
by: Chen, Chao, et al.
Published: (2024)

Density-aware Soft Context Compression with Semi-Dynamic Compression Ratio
by: Yu, Yijiong, et al.
Published: (2026)

Activation-aware Probe-Query: Effective Key-Value Retrieval for Long-Context LLMs Inference
by: Xiao, Qingfa, et al.
Published: (2025)

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
by: Lin, Zhenghao, et al.
Published: (2025)

Mitigating the Negative Impact of Over-association for Conversational Query Production
by: Wang, Ante, et al.
Published: (2024)

Empirical-MCTS: Continuous Agent Evolution via Dual-Experience Monte Carlo Tree Search
by: Lu, Hao, et al.
Published: (2026)

Seed LiveInterpret 2.0: End-to-end Simultaneous Speech-to-speech Translation with Your Voice
by: Cheng, Shanbo, et al.
Published: (2025)

VQKV: High-Fidelity and High-Ratio Cache Compression via Vector-Quantization
by: Wang, Yixuan, et al.
Published: (2026)

Assessing the potential of LLM-assisted annotation for corpus-based pragmatics and discourse analysis: The case of apology
by: Yu, Danni, et al.
Published: (2023)

Faithfulness Evaluation for Decoder-only LLM Attributions with Controlled Retained Information
by: Huang, Xin, et al.
Published: (2026)

ReflectMT: Internalizing Reflection for Efficient and High-Quality Machine Translation
by: Li, Kunquan, et al.
Published: (2026)

QUITO: Accelerating Long-Context Reasoning through Query-Guided Context Compression
by: Wang, Wenshan, et al.
Published: (2024)

Understanding the Physics of Key-Value Cache Compression for LLMs through Attention Dynamics
by: Ananthanarayanan, Samhruth, et al.
Published: (2026)

Large Language Model as Token Compressor and Decompressor
by: Li, Wenbing, et al.
Published: (2026)

The Compressor-Retriever Architecture for Language Model OS
by: Yang, Yuan, et al.
Published: (2024)

On the Information Redundancy in Non-Autoregressive Translation
by: Wang, Zhihao, et al.
Published: (2024)

STDec: Spatio-Temporal Stability Guided Decoding for dLLMs
by: Chen, Yuzhe, et al.
Published: (2026)