Saved in:
| Main Authors: | Xu, Haoran, Sharaf, Amr, Chen, Yunmo, Tan, Weiting, Shen, Lingfeng, Van Durme, Benjamin, Murray, Kenton, Kim, Young Jin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.08417 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models
by: Xu, Haoran, et al.
Published: (2023)
by: Xu, Haoran, et al.
Published: (2023)
Streaming Sequence Transduction through Dynamic Compression
by: Tan, Weiting, et al.
Published: (2024)
by: Tan, Weiting, et al.
Published: (2024)
Upsample or Upweight? Balanced Training on Heavily Imbalanced Datasets
by: Li, Tianjian, et al.
Published: (2024)
by: Li, Tianjian, et al.
Published: (2024)
The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts
by: Shen, Lingfeng, et al.
Published: (2024)
by: Shen, Lingfeng, et al.
Published: (2024)
DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation
by: Tan, Weiting, et al.
Published: (2024)
by: Tan, Weiting, et al.
Published: (2024)
Unified Multimodal Uncertain Inference
by: Zhang, Dengjia, et al.
Published: (2026)
by: Zhang, Dengjia, et al.
Published: (2026)
MultiMUC: Multilingual Template Filling on MUC-4
by: Gantt, William, et al.
Published: (2024)
by: Gantt, William, et al.
Published: (2024)
Learning to Retrieve Iteratively for In-Context Learning
by: Chen, Yunmo, et al.
Published: (2024)
by: Chen, Yunmo, et al.
Published: (2024)
HLTCOE Evaluation Team at TREC 2025: VQA Track
by: Zhang, Dengjia, et al.
Published: (2025)
by: Zhang, Dengjia, et al.
Published: (2025)
Exploring Representational Disparities Between Multilingual and Bilingual Translation Models
by: Verma, Neha, et al.
Published: (2023)
by: Verma, Neha, et al.
Published: (2023)
X-ALMA: Plug & Play Modules and Adaptive Rejection for Quality Translation at Scale
by: Xu, Haoran, et al.
Published: (2024)
by: Xu, Haoran, et al.
Published: (2024)
The Alignment Waltz: Jointly Training Agents to Collaborate for Safety
by: Zhang, Jingyu, et al.
Published: (2025)
by: Zhang, Jingyu, et al.
Published: (2025)
It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF
by: Lu, Taiming, et al.
Published: (2024)
by: Lu, Taiming, et al.
Published: (2024)
RE-AdaptIR: Improving Information Retrieval through Reverse Engineered Adaptation
by: Fleshman, William, et al.
Published: (2024)
by: Fleshman, William, et al.
Published: (2024)
Bonsai: Interpretable Tree-Adaptive Grounded Reasoning
by: Sanders, Kate, et al.
Published: (2025)
by: Sanders, Kate, et al.
Published: (2025)
SEQR: Secure and Efficient QR-based LoRA Routing
by: Fleshman, William, et al.
Published: (2025)
by: Fleshman, William, et al.
Published: (2025)
LoRA-Augmented Generation (LAG) for Knowledge-Intensive Language Tasks
by: Fleshman, William, et al.
Published: (2025)
by: Fleshman, William, et al.
Published: (2025)
RE-Adapt: Reverse Engineered Adaptation of Large Language Models
by: Fleshman, William, et al.
Published: (2024)
by: Fleshman, William, et al.
Published: (2024)
Compactor: Calibrated Query-Agnostic KV Cache Compression with Approximate Leverage Scores
by: Chari, Vivek, et al.
Published: (2025)
by: Chari, Vivek, et al.
Published: (2025)
A Survey of Video Datasets for Grounded Event Understanding
by: Sanders, Kate, et al.
Published: (2024)
by: Sanders, Kate, et al.
Published: (2024)
SpectR: Dynamically Composing LM Experts with Spectral Routing
by: Fleshman, William, et al.
Published: (2025)
by: Fleshman, William, et al.
Published: (2025)
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations
by: Cheng, Jeffrey, et al.
Published: (2024)
by: Cheng, Jeffrey, et al.
Published: (2024)
DOTResize: Reducing LLM Width via Discrete Optimal Transport-based Neuron Merging
by: Verma, Neha, et al.
Published: (2025)
by: Verma, Neha, et al.
Published: (2025)
LLMs Provide Unstable Answers to Legal Questions
by: Blair-Stanek, Andrew, et al.
Published: (2025)
by: Blair-Stanek, Andrew, et al.
Published: (2025)
Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models
by: Li, Tianjian, et al.
Published: (2023)
by: Li, Tianjian, et al.
Published: (2023)
Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval
by: Reddy, Arun, et al.
Published: (2025)
by: Reddy, Arun, et al.
Published: (2025)
Process Supervision of Confidence Margin for Calibrated LLM Reasoning
by: Wang, Liaoyaqi, et al.
Published: (2026)
by: Wang, Liaoyaqi, et al.
Published: (2026)
Certified Mitigation of Worst-Case LLM Copyright Infringement
by: Zhang, Jingyu, et al.
Published: (2025)
by: Zhang, Jingyu, et al.
Published: (2025)
CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel Programming
by: TehraniJamsaz, Ali, et al.
Published: (2024)
by: TehraniJamsaz, Ali, et al.
Published: (2024)
SocialNLI: A Dialogue-Centric Social Inference Dataset
by: Deo, Akhil, et al.
Published: (2025)
by: Deo, Akhil, et al.
Published: (2025)
LM Agents for Coordinating Multi-User Information Gathering
by: Jhamtani, Harsh, et al.
Published: (2025)
by: Jhamtani, Harsh, et al.
Published: (2025)
A Replicability Study of XTR
by: Jha, Rohan, et al.
Published: (2026)
by: Jha, Rohan, et al.
Published: (2026)
NevIR: Negation in Neural Information Retrieval
by: Weller, Orion, et al.
Published: (2023)
by: Weller, Orion, et al.
Published: (2023)
Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering
by: Jurayj, William, et al.
Published: (2025)
by: Jurayj, William, et al.
Published: (2025)
Does Reasoning Make Search More Fair? Comparing Fairness in Reasoning and Non-Reasoning Rerankers
by: Samuel, Saron, et al.
Published: (2026)
by: Samuel, Saron, et al.
Published: (2026)
The NLP Task Effectiveness of Long-Range Transformers
by: Qin, Guanghui, et al.
Published: (2022)
by: Qin, Guanghui, et al.
Published: (2022)
Crystal: Characterizing Relative Impact of Scholarly Publications
by: Collison, Hannah, et al.
Published: (2026)
by: Collison, Hannah, et al.
Published: (2026)
MegaWika 2: A More Comprehensive Multilingual Collection of Articles and their Sources
by: Barham, Samuel, et al.
Published: (2025)
by: Barham, Samuel, et al.
Published: (2025)
KV-Distill: Nearly Lossless Learnable Context Compression for LLMs
by: Chari, Vivek, et al.
Published: (2025)
by: Chari, Vivek, et al.
Published: (2025)
Language Models and Logic Programs for Trustworthy Tax Reasoning
by: Jurayj, William, et al.
Published: (2025)
by: Jurayj, William, et al.
Published: (2025)
Similar Items
-
A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models
by: Xu, Haoran, et al.
Published: (2023) -
Streaming Sequence Transduction through Dynamic Compression
by: Tan, Weiting, et al.
Published: (2024) -
Upsample or Upweight? Balanced Training on Heavily Imbalanced Datasets
by: Li, Tianjian, et al.
Published: (2024) -
The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts
by: Shen, Lingfeng, et al.
Published: (2024) -
DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation
by: Tan, Weiting, et al.
Published: (2024)