Saved in:
| Main Authors: | Wang, Weixuan, Haddow, Barry, Wu, Minghao, Peng, Wei, Birch, Alexandra |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.09265 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ExpertSteer: Intervening in LLMs through Expert Knowledge
by: Wang, Weixuan, et al.
Published: (2025)
by: Wang, Weixuan, et al.
Published: (2025)
Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention
by: Wang, Weixuan, et al.
Published: (2024)
by: Wang, Weixuan, et al.
Published: (2024)
HBO: Hierarchical Balancing Optimization for Fine-Tuning Large Language Models
by: Wang, Weixuan, et al.
Published: (2025)
by: Wang, Weixuan, et al.
Published: (2025)
Demystifying Multilingual Chain-of-Thought in Process Reward Modeling
by: Wang, Weixuan, et al.
Published: (2025)
by: Wang, Weixuan, et al.
Published: (2025)
Learning to Summarize by Learning to Quiz: Adversarial Agentic Collaboration for Long Document Summarization
by: Wang, Weixuan, et al.
Published: (2025)
by: Wang, Weixuan, et al.
Published: (2025)
Multilingual Retrieval-Augmented Generation for Knowledge-Intensive Task
by: Ranaldi, Leonardo, et al.
Published: (2025)
by: Ranaldi, Leonardo, et al.
Published: (2025)
The Ups and Downs of Large Language Model Inference with Vocabulary Trimming by Language Heuristics
by: Bogoychev, Nikolay, et al.
Published: (2023)
by: Bogoychev, Nikolay, et al.
Published: (2023)
When Does Monolingual Data Help Multilingual Translation: The Role of Domain and Model Scale
by: Baziotis, Christos, et al.
Published: (2023)
by: Baziotis, Christos, et al.
Published: (2023)
MGen: Millions of Naturally Occurring Generics in Context
by: Cilleruelo, Gustavo, et al.
Published: (2025)
by: Cilleruelo, Gustavo, et al.
Published: (2025)
Compact Speech Translation Models via Discrete Speech Units Pretraining
by: Lam, Tsz Kin, et al.
Published: (2024)
by: Lam, Tsz Kin, et al.
Published: (2024)
Prosody in Cascade and Direct Speech-to-Text Translation: a case study on Korean Wh-Phrases
by: Zhou, Giulio, et al.
Published: (2024)
by: Zhou, Giulio, et al.
Published: (2024)
The Prosody of Emojis
by: Zhou, Giulio, et al.
Published: (2025)
by: Zhou, Giulio, et al.
Published: (2025)
Improving Multilingual Retrieval-Augmented Language Models through Dialectic Reasoning Argumentations
by: Ranaldi, Leonardo, et al.
Published: (2025)
by: Ranaldi, Leonardo, et al.
Published: (2025)
Generics are puzzling. Can language models find the missing piece?
by: Calderón, Gustavo Cilleruelo, et al.
Published: (2024)
by: Calderón, Gustavo Cilleruelo, et al.
Published: (2024)
Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation
by: Iyer, Vivek, et al.
Published: (2024)
by: Iyer, Vivek, et al.
Published: (2024)
Liaozhai through the Looking-Glass: On Paratextual Explicitation of Culture-Bound Terms in Machine Translation
by: Shen, Sherrie, et al.
Published: (2025)
by: Shen, Sherrie, et al.
Published: (2025)
In-game Toxic Language Detection: Shared Task and Attention Residuals
by: Jia, Yuanzhe, et al.
Published: (2022)
by: Jia, Yuanzhe, et al.
Published: (2022)
Context and System Fusion in Post-ASR Emotion Recognition with Large Language Models
by: Stepachev, Pavel, et al.
Published: (2024)
by: Stepachev, Pavel, et al.
Published: (2024)
Semantics-Adaptive Activation Intervention for LLMs via Dynamic Steering Vectors
by: Wang, Weixuan, et al.
Published: (2024)
by: Wang, Weixuan, et al.
Published: (2024)
Iterative Translation Refinement with Large Language Models
by: Chen, Pinzhen, et al.
Published: (2023)
by: Chen, Pinzhen, et al.
Published: (2023)
Controlling What You Share: Assessing Language Model Adherence to Privacy Preferences
by: Ramírez, Guillem, et al.
Published: (2025)
by: Ramírez, Guillem, et al.
Published: (2025)
Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models?
by: Chen, Pinzhen, et al.
Published: (2024)
by: Chen, Pinzhen, et al.
Published: (2024)
Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice?
by: Zhu, Dawei, et al.
Published: (2024)
by: Zhu, Dawei, et al.
Published: (2024)
Kakugo: Distillation of Low-Resource Languages into Small Language Models
by: Devine, Peter, et al.
Published: (2026)
by: Devine, Peter, et al.
Published: (2026)
No Train but Gain: Language Arithmetic for training-free Language Adapters enhancement
by: Klimaszewski, Mateusz, et al.
Published: (2024)
by: Klimaszewski, Mateusz, et al.
Published: (2024)
MatheMagic: Generating Dynamic Mathematics Benchmarks Robust to Memorization
by: O'Brien, Dayyán, et al.
Published: (2025)
by: O'Brien, Dayyán, et al.
Published: (2025)
Pitfalls and Outlooks in Using COMET
by: Zouhar, Vilém, et al.
Published: (2024)
by: Zouhar, Vilém, et al.
Published: (2024)
Optimising Calls to Large Language Models with Uncertainty-Based Two-Tier Selection
by: Ramírez, Guillem, et al.
Published: (2024)
by: Ramírez, Guillem, et al.
Published: (2024)
Catastrophic Forgetting in LLMs: A Comparative Analysis Across Language Tasks
by: Haque, Naimul
Published: (2025)
by: Haque, Naimul
Published: (2025)
Do LLMs and VLMs Share Neurons for Inference? Evidence and Mechanisms of Cross-Modal Transfer
by: Cui, Chenhang, et al.
Published: (2026)
by: Cui, Chenhang, et al.
Published: (2026)
How Programming Concepts and Neurons Are Shared in Code Language Models
by: Kargaran, Amir Hossein, et al.
Published: (2025)
by: Kargaran, Amir Hossein, et al.
Published: (2025)
EuroLLM: Multilingual Language Models for Europe
by: Martins, Pedro Henrique, et al.
Published: (2024)
by: Martins, Pedro Henrique, et al.
Published: (2024)
LF-Steering: Latent Feature Activation Steering for Enhancing Semantic Consistency in Large Language Models
by: Yang, Jingyuan, et al.
Published: (2025)
by: Yang, Jingyuan, et al.
Published: (2025)
Findings of the WMT 2024 Shared Task on Discourse-Level Literary Translation
by: Wang, Longyue, et al.
Published: (2024)
by: Wang, Longyue, et al.
Published: (2024)
Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation
by: Sperber, Matthias, et al.
Published: (2024)
by: Sperber, Matthias, et al.
Published: (2024)
Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca
by: Chen, Pinzhen, et al.
Published: (2023)
by: Chen, Pinzhen, et al.
Published: (2023)
Identifying Good and Bad Neurons for Task-Level Controllable LLMs
by: Li, Wenjie, et al.
Published: (2026)
by: Li, Wenjie, et al.
Published: (2026)
The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities
by: Wu, Zhaofeng, et al.
Published: (2024)
by: Wu, Zhaofeng, et al.
Published: (2024)
Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison
by: Lam, Tsz Kin, et al.
Published: (2025)
by: Lam, Tsz Kin, et al.
Published: (2025)
DocHPLT: A Massively Multilingual Document-Level Translation Dataset
by: O'Brien, Dayyán, et al.
Published: (2025)
by: O'Brien, Dayyán, et al.
Published: (2025)
Similar Items
-
ExpertSteer: Intervening in LLMs through Expert Knowledge
by: Wang, Weixuan, et al.
Published: (2025) -
Bridging the Language Gaps in Large Language Models with Inference-Time Cross-Lingual Intervention
by: Wang, Weixuan, et al.
Published: (2024) -
HBO: Hierarchical Balancing Optimization for Fine-Tuning Large Language Models
by: Wang, Weixuan, et al.
Published: (2025) -
Demystifying Multilingual Chain-of-Thought in Process Reward Modeling
by: Wang, Weixuan, et al.
Published: (2025) -
Learning to Summarize by Learning to Quiz: Adversarial Agentic Collaboration for Long Document Summarization
by: Wang, Weixuan, et al.
Published: (2025)