Saved in:
| Main Authors: | Cao, Zhiwei, Cao, Qian, Lu, Yu, Peng, Ningxin, Huang, Luyang, Cheng, Shanbo, Su, Jinsong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.02376 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
G-DIG: Towards Gradient-based Diverse and High-quality Instruction Data Selection for Machine Translation
by: Pan, Xingyuan, et al.
Published: (2024)
by: Pan, Xingyuan, et al.
Published: (2024)
Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent
by: Cheng, Shanbo, et al.
Published: (2024)
by: Cheng, Shanbo, et al.
Published: (2024)
500xCompressor: Generalized Prompt Compression for Large Language Models
by: Li, Zongqian, et al.
Published: (2024)
by: Li, Zongqian, et al.
Published: (2024)
Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters
by: Cheng, Shanbo, et al.
Published: (2025)
by: Cheng, Shanbo, et al.
Published: (2025)
EnAnchored-X2X: English-Anchored Optimization for Many-to-Many Translation
by: Yang, Sen, et al.
Published: (2025)
by: Yang, Sen, et al.
Published: (2025)
GRRM: Group Relative Reward Modeling for Machine Translation
by: Yang, Sen, et al.
Published: (2026)
by: Yang, Sen, et al.
Published: (2026)
From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition
by: Wang, Tianduo, et al.
Published: (2025)
by: Wang, Tianduo, et al.
Published: (2025)
MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation
by: Li, Jiahuan, et al.
Published: (2024)
by: Li, Jiahuan, et al.
Published: (2024)
Response Enhanced Semi-supervised Dialogue Query Generation
by: Huang, Jianheng, et al.
Published: (2023)
by: Huang, Jianheng, et al.
Published: (2023)
Efficient k-Nearest-Neighbor Machine Translation with Dynamic Retrieval
by: Gao, Yan, et al.
Published: (2024)
by: Gao, Yan, et al.
Published: (2024)
Mixture Compressor for Mixture-of-Experts LLMs Gains More
by: Huang, Wei, et al.
Published: (2024)
by: Huang, Wei, et al.
Published: (2024)
Trans-Zero: Self-Play Incentivizes Large Language Models for Multilingual Translation Without Parallel Data
by: Zou, Wei, et al.
Published: (2025)
by: Zou, Wei, et al.
Published: (2025)
Perception Compressor: A Training-Free Prompt Compression Framework in Long Context Scenarios
by: Tang, Jiwei, et al.
Published: (2024)
by: Tang, Jiwei, et al.
Published: (2024)
Eliciting the Translation Ability of Large Language Models via Multilingual Finetuning with Translation Instructions
by: Li, Jiahuan, et al.
Published: (2023)
by: Li, Jiahuan, et al.
Published: (2023)
TransCompressor: LLM-Powered Multimodal Data Compression for Smart Transportation
by: Yang, Huanqi, et al.
Published: (2024)
by: Yang, Huanqi, et al.
Published: (2024)
How Large Language Models (LLMs) Extrapolate: From Guided Missiles to Guided Prompts
by: Cao, Xuenan
Published: (2024)
by: Cao, Xuenan
Published: (2024)
DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization
by: She, Shuaijie, et al.
Published: (2025)
by: She, Shuaijie, et al.
Published: (2025)
SeqPO-SiMT: Sequential Policy Optimization for Simultaneous Machine Translation
by: Xu, Ting, et al.
Published: (2025)
by: Xu, Ting, et al.
Published: (2025)
Implicit Bias in LLMs: A Survey
by: Lin, Xinru, et al.
Published: (2025)
by: Lin, Xinru, et al.
Published: (2025)
AttnComp: Attention-Guided Adaptive Context Compression for Retrieval-Augmented Generation
by: Luo, Lvzhou, et al.
Published: (2025)
by: Luo, Lvzhou, et al.
Published: (2025)
FLRC: Fine-grained Low-Rank Compressor for Efficient LLM Inference
by: Lu, Yu-Chen, et al.
Published: (2025)
by: Lu, Yu-Chen, et al.
Published: (2025)
Attn-GS: Attention-Guided Context Compression for Efficient Personalized LLMs
by: Zeng, Shenglai, et al.
Published: (2026)
by: Zeng, Shenglai, et al.
Published: (2026)
Unshackling Context Length: An Efficient Selective Attention Approach through Query-Key Compression
by: Wang, Haoyu, et al.
Published: (2025)
by: Wang, Haoyu, et al.
Published: (2025)
INSIDE: LLMs' Internal States Retain the Power of Hallucination Detection
by: Chen, Chao, et al.
Published: (2024)
by: Chen, Chao, et al.
Published: (2024)
Density-aware Soft Context Compression with Semi-Dynamic Compression Ratio
by: Yu, Yijiong, et al.
Published: (2026)
by: Yu, Yijiong, et al.
Published: (2026)
Activation-aware Probe-Query: Effective Key-Value Retrieval for Long-Context LLMs Inference
by: Xiao, Qingfa, et al.
Published: (2025)
by: Xiao, Qingfa, et al.
Published: (2025)
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
by: Lin, Zhenghao, et al.
Published: (2025)
by: Lin, Zhenghao, et al.
Published: (2025)
Mitigating the Negative Impact of Over-association for Conversational Query Production
by: Wang, Ante, et al.
Published: (2024)
by: Wang, Ante, et al.
Published: (2024)
Empirical-MCTS: Continuous Agent Evolution via Dual-Experience Monte Carlo Tree Search
by: Lu, Hao, et al.
Published: (2026)
by: Lu, Hao, et al.
Published: (2026)
Seed LiveInterpret 2.0: End-to-end Simultaneous Speech-to-speech Translation with Your Voice
by: Cheng, Shanbo, et al.
Published: (2025)
by: Cheng, Shanbo, et al.
Published: (2025)
VQKV: High-Fidelity and High-Ratio Cache Compression via Vector-Quantization
by: Wang, Yixuan, et al.
Published: (2026)
by: Wang, Yixuan, et al.
Published: (2026)
Assessing the potential of LLM-assisted annotation for corpus-based pragmatics and discourse analysis: The case of apology
by: Yu, Danni, et al.
Published: (2023)
by: Yu, Danni, et al.
Published: (2023)
Faithfulness Evaluation for Decoder-only LLM Attributions with Controlled Retained Information
by: Huang, Xin, et al.
Published: (2026)
by: Huang, Xin, et al.
Published: (2026)
ReflectMT: Internalizing Reflection for Efficient and High-Quality Machine Translation
by: Li, Kunquan, et al.
Published: (2026)
by: Li, Kunquan, et al.
Published: (2026)
QUITO: Accelerating Long-Context Reasoning through Query-Guided Context Compression
by: Wang, Wenshan, et al.
Published: (2024)
by: Wang, Wenshan, et al.
Published: (2024)
Understanding the Physics of Key-Value Cache Compression for LLMs through Attention Dynamics
by: Ananthanarayanan, Samhruth, et al.
Published: (2026)
by: Ananthanarayanan, Samhruth, et al.
Published: (2026)
Large Language Model as Token Compressor and Decompressor
by: Li, Wenbing, et al.
Published: (2026)
by: Li, Wenbing, et al.
Published: (2026)
The Compressor-Retriever Architecture for Language Model OS
by: Yang, Yuan, et al.
Published: (2024)
by: Yang, Yuan, et al.
Published: (2024)
On the Information Redundancy in Non-Autoregressive Translation
by: Wang, Zhihao, et al.
Published: (2024)
by: Wang, Zhihao, et al.
Published: (2024)
STDec: Spatio-Temporal Stability Guided Decoding for dLLMs
by: Chen, Yuzhe, et al.
Published: (2026)
by: Chen, Yuzhe, et al.
Published: (2026)
Similar Items
-
G-DIG: Towards Gradient-based Diverse and High-quality Instruction Data Selection for Machine Translation
by: Pan, Xingyuan, et al.
Published: (2024) -
Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent
by: Cheng, Shanbo, et al.
Published: (2024) -
500xCompressor: Generalized Prompt Compression for Large Language Models
by: Li, Zongqian, et al.
Published: (2024) -
Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters
by: Cheng, Shanbo, et al.
Published: (2025) -
EnAnchored-X2X: English-Anchored Optimization for Many-to-Many Translation
by: Yang, Sen, et al.
Published: (2025)