:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Dankers, Verna, Raunak, Vikas
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2502.01491
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

On Instruction-Finetuning Neural Machine Translation Models
by: Raunak, Vikas, et al.
Published: (2024)

Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks
by: Dankers, Verna, et al.
Published: (2024)

SLIDE: Reference-free Evaluation for Machine Translation using a Sliding Document Window
by: Raunak, Vikas, et al.
Published: (2023)

Evolving Knowledge Distillation for Lightweight Neural Machine Translation
by: Zhang, Xuewen, et al.
Published: (2026)

Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation
by: Zhang, Songming, et al.
Published: (2023)

Memorization Dynamics in Knowledge Distillation for Language Models
by: Borkar, Jaydeep, et al.
Published: (2026)

Self-Vocabularizing Training for Neural Machine Translation
by: Lin, Pin-Jie, et al.
Published: (2025)

Align-to-Distill: Trainable Attention Alignment for Knowledge Distillation in Neural Machine Translation
by: Jin, Heegon, et al.
Published: (2024)

Multilingual Non-Autoregressive Machine Translation without Knowledge Distillation
by: Huang, Chenyang, et al.
Published: (2025)

Self-Evolution Knowledge Distillation for LLM-based Machine Translation
by: Song, Yuncheng, et al.
Published: (2024)

Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge
by: Batsuren, Khuyagbaatar, et al.
Published: (2024)

KD4MT: A Survey of Knowledge Distillation for Machine Translation
by: de Gibert, Ona, et al.
Published: (2026)

Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
by: Zhou, Yuhang, et al.
Published: (2024)

From Sycophancy to Sensemaking: Premise Governance for Human-AI Decision Making
by: Jain, Raunak
Published: (2026)

Zero-shot Factual Consistency Evaluation Across Domains
by: Agarwal, Raunak
Published: (2024)

Collaborative Causal Sensemaking: Closing the Complementarity Gap in Human-AI Decision Support
by: Jain, Raunak
Published: (2025)

Importance-Aware Data Augmentation for Document-Level Neural Machine Translation
by: Wu, Minghao, et al.
Published: (2024)

Evaluating Explainable AI Attribution Methods in Neural Machine Translation via Attention-Guided Knowledge Distillation
by: Nourbakhsh, Aria, et al.
Published: (2026)

Parameter Efficient Diverse Paraphrase Generation Using Sequence-Level Knowledge Distillation
by: Jayawardena, Lasal, et al.
Published: (2024)

MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation
by: Li, Jiahuan, et al.
Published: (2024)

Weight-Inherited Distillation for Task-Agnostic BERT Compression
by: Wu, Taiqiang, et al.
Published: (2023)

Life Cycle-Aware Evaluation of Knowledge Distillation for Machine Translation: Environmental Impact and Translation Quality Trade-offs
by: Attieh, Joseph, et al.
Published: (2026)

Don't Throw Away Data: Better Sequence Knowledge Distillation
by: Wang, Jun, et al.
Published: (2024)

CycleDistill: Bootstrapping Machine Translation using LLMs with Cyclical Distillation
by: Halder, Deepon, et al.
Published: (2025)

SoK: Measuring What Matters for Closed-Loop Security Agents
by: Khurana, Mudita, et al.
Published: (2025)

Sentence-Level or Token-Level? A Comprehensive Study on Knowledge Distillation
by: Wei, Jingxuan, et al.
Published: (2024)

A Lightweight Method to Disrupt Memorized Sequences in LLM
by: Prashant, Parjanya Prajakta, et al.
Published: (2025)

Memorization and Knowledge Injection in Gated LLMs
by: Pan, Xu, et al.
Published: (2025)

Efficient Technical Term Translation: A Knowledge Distillation Approach for Parenthetical Terminology Translation
by: Myung, Jiyoon, et al.
Published: (2024)

Sequence Shortening for Context-Aware Machine Translation
by: Mąka, Paweł, et al.
Published: (2024)

Distilling Event Sequence Knowledge From Large Language Models
by: Wadhwa, Somin, et al.
Published: (2024)

Neuron-Level Differentiation of Memorization and Generalization in Large Language Models
by: Huang, Ko-Wei, et al.
Published: (2024)

Span-Level Machine Translation Meta-Evaluation
by: Perrella, Stefano, et al.
Published: (2026)

Get Confused Cautiously: Textual Sequence Memorization Erasure with Selective Entropy Maximization
by: Zhang, Zhaohan, et al.
Published: (2024)

Evaluating Structural Generalization in Neural Machine Translation
by: Kumon, Ryoma, et al.
Published: (2024)

DOLFIN -- Document-Level Financial test set for Machine Translation
by: Nakhlé, Mariam, et al.
Published: (2025)

Adapting Large Language Models for Document-Level Machine Translation
by: Wu, Minghao, et al.
Published: (2024)

Retrieval-Augmented Machine Translation with Unstructured Knowledge
by: Wang, Jiaan, et al.
Published: (2024)

EGAD: Entropy-Guided Adaptive Distillation for Token-Level Knowledge Transfer
by: Zhang, Hao, et al.
Published: (2026)

Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection
by: Zhang, Yuwei, et al.
Published: (2025)