Saved in:
| Main Authors: | Dankers, Verna, Raunak, Vikas |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.01491 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
On Instruction-Finetuning Neural Machine Translation Models
by: Raunak, Vikas, et al.
Published: (2024)
by: Raunak, Vikas, et al.
Published: (2024)
Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks
by: Dankers, Verna, et al.
Published: (2024)
by: Dankers, Verna, et al.
Published: (2024)
SLIDE: Reference-free Evaluation for Machine Translation using a Sliding Document Window
by: Raunak, Vikas, et al.
Published: (2023)
by: Raunak, Vikas, et al.
Published: (2023)
Evolving Knowledge Distillation for Lightweight Neural Machine Translation
by: Zhang, Xuewen, et al.
Published: (2026)
by: Zhang, Xuewen, et al.
Published: (2026)
Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation
by: Zhang, Songming, et al.
Published: (2023)
by: Zhang, Songming, et al.
Published: (2023)
Memorization Dynamics in Knowledge Distillation for Language Models
by: Borkar, Jaydeep, et al.
Published: (2026)
by: Borkar, Jaydeep, et al.
Published: (2026)
Self-Vocabularizing Training for Neural Machine Translation
by: Lin, Pin-Jie, et al.
Published: (2025)
by: Lin, Pin-Jie, et al.
Published: (2025)
Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation
by: Jin, Heegon, et al.
Published: (2024)
by: Jin, Heegon, et al.
Published: (2024)
Multilingual Non-Autoregressive Machine Translation without Knowledge Distillation
by: Huang, Chenyang, et al.
Published: (2025)
by: Huang, Chenyang, et al.
Published: (2025)
Self-Evolution Knowledge Distillation for LLM-based Machine Translation
by: Song, Yuncheng, et al.
Published: (2024)
by: Song, Yuncheng, et al.
Published: (2024)
Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge
by: Batsuren, Khuyagbaatar, et al.
Published: (2024)
by: Batsuren, Khuyagbaatar, et al.
Published: (2024)
KD4MT: A Survey of Knowledge Distillation for Machine Translation
by: de Gibert, Ona, et al.
Published: (2026)
by: de Gibert, Ona, et al.
Published: (2026)
Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
by: Zhou, Yuhang, et al.
Published: (2024)
by: Zhou, Yuhang, et al.
Published: (2024)
From Sycophancy to Sensemaking: Premise Governance for Human-AI Decision Making
by: Jain, Raunak
Published: (2026)
by: Jain, Raunak
Published: (2026)
Zero-shot Factual Consistency Evaluation Across Domains
by: Agarwal, Raunak
Published: (2024)
by: Agarwal, Raunak
Published: (2024)
Collaborative Causal Sensemaking: Closing the Complementarity Gap in Human-AI Decision Support
by: Jain, Raunak
Published: (2025)
by: Jain, Raunak
Published: (2025)
Importance-Aware Data Augmentation for Document-Level Neural Machine Translation
by: Wu, Minghao, et al.
Published: (2024)
by: Wu, Minghao, et al.
Published: (2024)
Evaluating Explainable AI Attribution Methods in Neural Machine Translation via Attention-Guided Knowledge Distillation
by: Nourbakhsh, Aria, et al.
Published: (2026)
by: Nourbakhsh, Aria, et al.
Published: (2026)
Parameter Efficient Diverse Paraphrase Generation Using Sequence-Level Knowledge Distillation
by: Jayawardena, Lasal, et al.
Published: (2024)
by: Jayawardena, Lasal, et al.
Published: (2024)
MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation
by: Li, Jiahuan, et al.
Published: (2024)
by: Li, Jiahuan, et al.
Published: (2024)
Weight-Inherited Distillation for Task-Agnostic BERT Compression
by: Wu, Taiqiang, et al.
Published: (2023)
by: Wu, Taiqiang, et al.
Published: (2023)
Life Cycle-Aware Evaluation of Knowledge Distillation for Machine Translation: Environmental Impact and Translation Quality Trade-offs
by: Attieh, Joseph, et al.
Published: (2026)
by: Attieh, Joseph, et al.
Published: (2026)
Don't Throw Away Data: Better Sequence Knowledge Distillation
by: Wang, Jun, et al.
Published: (2024)
by: Wang, Jun, et al.
Published: (2024)
CycleDistill: Bootstrapping Machine Translation using LLMs with Cyclical Distillation
by: Halder, Deepon, et al.
Published: (2025)
by: Halder, Deepon, et al.
Published: (2025)
SoK: Measuring What Matters for Closed-Loop Security Agents
by: Khurana, Mudita, et al.
Published: (2025)
by: Khurana, Mudita, et al.
Published: (2025)
Sentence-Level or Token-Level? A Comprehensive Study on Knowledge Distillation
by: Wei, Jingxuan, et al.
Published: (2024)
by: Wei, Jingxuan, et al.
Published: (2024)
A Lightweight Method to Disrupt Memorized Sequences in LLM
by: Prashant, Parjanya Prajakta, et al.
Published: (2025)
by: Prashant, Parjanya Prajakta, et al.
Published: (2025)
Memorization and Knowledge Injection in Gated LLMs
by: Pan, Xu, et al.
Published: (2025)
by: Pan, Xu, et al.
Published: (2025)
Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation
by: Myung, Jiyoon, et al.
Published: (2024)
by: Myung, Jiyoon, et al.
Published: (2024)
Sequence Shortening for Context-Aware Machine Translation
by: Mąka, Paweł, et al.
Published: (2024)
by: Mąka, Paweł, et al.
Published: (2024)
Distilling Event Sequence Knowledge From Large Language Models
by: Wadhwa, Somin, et al.
Published: (2024)
by: Wadhwa, Somin, et al.
Published: (2024)
Neuron-Level Differentiation of Memorization and Generalization in Large Language Models
by: Huang, Ko-Wei, et al.
Published: (2024)
by: Huang, Ko-Wei, et al.
Published: (2024)
Span-Level Machine Translation Meta-Evaluation
by: Perrella, Stefano, et al.
Published: (2026)
by: Perrella, Stefano, et al.
Published: (2026)
Get Confused Cautiously: Textual Sequence Memorization Erasure with Selective Entropy Maximization
by: Zhang, Zhaohan, et al.
Published: (2024)
by: Zhang, Zhaohan, et al.
Published: (2024)
Evaluating Structural Generalization in Neural Machine Translation
by: Kumon, Ryoma, et al.
Published: (2024)
by: Kumon, Ryoma, et al.
Published: (2024)
DOLFIN -- Document-Level Financial test set for Machine Translation
by: Nakhlé, Mariam, et al.
Published: (2025)
by: Nakhlé, Mariam, et al.
Published: (2025)
Adapting Large Language Models for Document-Level Machine Translation
by: Wu, Minghao, et al.
Published: (2024)
by: Wu, Minghao, et al.
Published: (2024)
Retrieval-Augmented Machine Translation with Unstructured Knowledge
by: Wang, Jiaan, et al.
Published: (2024)
by: Wang, Jiaan, et al.
Published: (2024)
EGAD: Entropy-Guided Adaptive Distillation for Token-Level Knowledge Transfer
by: Zhang, Hao, et al.
Published: (2026)
by: Zhang, Hao, et al.
Published: (2026)
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection
by: Zhang, Yuwei, et al.
Published: (2025)
by: Zhang, Yuwei, et al.
Published: (2025)
Similar Items
-
On Instruction-Finetuning Neural Machine Translation Models
by: Raunak, Vikas, et al.
Published: (2024) -
Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks
by: Dankers, Verna, et al.
Published: (2024) -
SLIDE: Reference-free Evaluation for Machine Translation using a Sliding Document Window
by: Raunak, Vikas, et al.
Published: (2023) -
Evolving Knowledge Distillation for Lightweight Neural Machine Translation
by: Zhang, Xuewen, et al.
Published: (2026) -
Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation
by: Zhang, Songming, et al.
Published: (2023)