:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Vijayan, Vipin, Bowen, Braeden, Grigsby, Scott, Anderson, Timothy, Gwinnup, Jeremy
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2403.03045
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The Case for Evaluating Multimodal Translation Models on Text Datasets
by: Vijayan, Vipin, et al.
Published: (2024)

Detecting Concrete Visual Tokens for Multimodal Machine Translation
by: Bowen, Braeden, et al.
Published: (2024)

Improving Language Transfer Capability of Decoder-only Architecture in Multilingual Neural Machine Translation
by: Qu, Zhi, et al.
Published: (2024)

Exploring the Capabilities of Large Multimodal Models on Dense Text
by: Zhang, Shuo, et al.
Published: (2024)

From TOWER to SPIRE: Adding the Speech Modality to a Translation-Specialist LLM
by: Ambilduke, Kshitij, et al.
Published: (2025)

Enhancing Code-switched Text-to-Speech Synthesis Capability in Large Language Models with only Monolingual Corpora
by: Xu, Jing, et al.
Published: (2024)

Decoder-only Streaming Transformer for Simultaneous Translation
by: Guo, Shoutao, et al.
Published: (2024)

Wings: Learning Multimodal LLMs without Text-only Forgetting
by: Zhang, Yi-Kai, et al.
Published: (2024)

Adding Chocolate to Mint: Mitigating Metric Interference in Machine Translation
by: Pombal, José, et al.
Published: (2025)

Adding Alignment Control to Language Models
by: Zhu, Wenhong, et al.
Published: (2025)

Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
by: Luo, Yingfeng, et al.
Published: (2025)

Scalable Multilingual Multimodal Machine Translation with Speech-Text Fusion
by: Du, Yexing, et al.
Published: (2026)

Text-only Synthesis for Image Captioning
by: Zhou, Qing, et al.
Published: (2024)

Investigating Decoder-only Large Language Models for Speech-to-text Translation
by: Huang, Chao-Wei, et al.
Published: (2024)

Decoding the Multimodal Mind: Generalizable Brain-to-Text Translation via Multimodal Alignment and Adaptive Routing
by: Ye, Chunyu, et al.
Published: (2025)

Looking Right is Sometimes Right: Investigating the Capabilities of Decoder-only LLMs for Sequence Labeling
by: Dukić, David, et al.
Published: (2024)

Continually Adding New Languages to Multilingual Language Models
by: Owodunni, Abraham Toluwase, et al.
Published: (2025)

A Novel Paradigm Boosting Translation Capabilities of Large Language Models
by: Guo, Jiaxin, et al.
Published: (2024)

Unlocking Reasoning Capability on Machine Translation in Large Language Models
by: Rajaee, Sara, et al.
Published: (2026)

Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model
by: Wang, Minghan, et al.
Published: (2025)

ControlMed: Adding Reasoning Control to Medical Language Model
by: Lee, Sung-Min, et al.
Published: (2025)

Forecasting Frontier Language Model Agent Capabilities
by: Pimpale, Govind, et al.
Published: (2025)

Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss
by: Zhang, Bowen, et al.
Published: (2024)

Generating Difficult-to-Translate Texts
by: Zouhar, Vilém, et al.
Published: (2025)

Gemini: A Family of Highly Capable Multimodal Models
by: Gemini Team, et al.
Published: (2023)

Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging
by: Morrison, Jacob, et al.
Published: (2024)

Imp: Highly Capable Large Multimodal Models for Mobile Devices
by: Shao, Zhenwei, et al.
Published: (2024)

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
by: Que, Haoran, et al.
Published: (2024)

PROST-LLM: Progressively Enhancing the Speech-to-Speech Translation Capability in LLMs
by: Xu, Jing, et al.
Published: (2026)

Emergent Explainability: Adding a causal chain to neural network inference
by: Perrett, Adam
Published: (2024)

Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
by: Briakou, Eleftheria, et al.
Published: (2024)

Self-Translate-Train: Enhancing Cross-Lingual Transfer of Large Language Models via Inherent Capability
by: Ri, Ryokan, et al.
Published: (2024)

Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations
by: Miller, Evan
Published: (2024)

Merge and Conquer: Instructing Multilingual Models by Adding Target Language Weights
by: Valero, Eneko, et al.
Published: (2026)

CT2C-QA: Multimodal Question Answering over Chinese Text, Table and Chart
by: Zhao, Bowen, et al.
Published: (2024)

Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time
by: Yang, Wang, et al.
Published: (2025)

ACE-$M^3$: Automatic Capability Evaluator for Multimodal Medical Models
by: Zhang, Xiechi, et al.
Published: (2024)

Can They Dixit? Yes they Can! Dixit as a Playground for Multimodal Language Model Capabilities
by: Balepur, Nishant, et al.
Published: (2025)

Chitchat as Interference: Adding User Backstories to Task-Oriented Dialogues
by: Stricker, Armand, et al.
Published: (2024)

M3PO: Multimodal-Model-Guided Preference Optimization for Visual Instruction Following
by: Gao, Ruirui, et al.
Published: (2025)