Saved in:
| Main Authors: | Vijayan, Vipin, Bowen, Braeden, Grigsby, Scott, Anderson, Timothy, Gwinnup, Jeremy |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.03045 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Case for Evaluating Multimodal Translation Models on Text Datasets
by: Vijayan, Vipin, et al.
Published: (2024)
by: Vijayan, Vipin, et al.
Published: (2024)
Detecting Concrete Visual Tokens for Multimodal Machine Translation
by: Bowen, Braeden, et al.
Published: (2024)
by: Bowen, Braeden, et al.
Published: (2024)
Improving Language Transfer Capability of Decoder-only Architecture in Multilingual Neural Machine Translation
by: Qu, Zhi, et al.
Published: (2024)
by: Qu, Zhi, et al.
Published: (2024)
Exploring the Capabilities of Large Multimodal Models on Dense Text
by: Zhang, Shuo, et al.
Published: (2024)
by: Zhang, Shuo, et al.
Published: (2024)
From TOWER to SPIRE: Adding the Speech Modality to a Translation-Specialist LLM
by: Ambilduke, Kshitij, et al.
Published: (2025)
by: Ambilduke, Kshitij, et al.
Published: (2025)
Enhancing Code-switched Text-to-Speech Synthesis Capability in Large Language Models with only Monolingual Corpora
by: Xu, Jing, et al.
Published: (2024)
by: Xu, Jing, et al.
Published: (2024)
Decoder-only Streaming Transformer for Simultaneous Translation
by: Guo, Shoutao, et al.
Published: (2024)
by: Guo, Shoutao, et al.
Published: (2024)
Wings: Learning Multimodal LLMs without Text-only Forgetting
by: Zhang, Yi-Kai, et al.
Published: (2024)
by: Zhang, Yi-Kai, et al.
Published: (2024)
Adding Chocolate to Mint: Mitigating Metric Interference in Machine Translation
by: Pombal, José, et al.
Published: (2025)
by: Pombal, José, et al.
Published: (2025)
Adding Alignment Control to Language Models
by: Zhu, Wenhong, et al.
Published: (2025)
by: Zhu, Wenhong, et al.
Published: (2025)
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
by: Luo, Yingfeng, et al.
Published: (2025)
by: Luo, Yingfeng, et al.
Published: (2025)
Scalable Multilingual Multimodal Machine Translation with Speech-Text Fusion
by: Du, Yexing, et al.
Published: (2026)
by: Du, Yexing, et al.
Published: (2026)
Text-only Synthesis for Image Captioning
by: Zhou, Qing, et al.
Published: (2024)
by: Zhou, Qing, et al.
Published: (2024)
Investigating Decoder-only Large Language Models for Speech-to-text Translation
by: Huang, Chao-Wei, et al.
Published: (2024)
by: Huang, Chao-Wei, et al.
Published: (2024)
Decoding the Multimodal Mind: Generalizable Brain-to-Text Translation via Multimodal Alignment and Adaptive Routing
by: Ye, Chunyu, et al.
Published: (2025)
by: Ye, Chunyu, et al.
Published: (2025)
Looking Right is Sometimes Right: Investigating the Capabilities of Decoder-only LLMs for Sequence Labeling
by: Dukić, David, et al.
Published: (2024)
by: Dukić, David, et al.
Published: (2024)
Continually Adding New Languages to Multilingual Language Models
by: Owodunni, Abraham Toluwase, et al.
Published: (2025)
by: Owodunni, Abraham Toluwase, et al.
Published: (2025)
A Novel Paradigm Boosting Translation Capabilities of Large Language Models
by: Guo, Jiaxin, et al.
Published: (2024)
by: Guo, Jiaxin, et al.
Published: (2024)
Unlocking Reasoning Capability on Machine Translation in Large Language Models
by: Rajaee, Sara, et al.
Published: (2026)
by: Rajaee, Sara, et al.
Published: (2026)
Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model
by: Wang, Minghan, et al.
Published: (2025)
by: Wang, Minghan, et al.
Published: (2025)
ControlMed: Adding Reasoning Control to Medical Language Model
by: Lee, Sung-Min, et al.
Published: (2025)
by: Lee, Sung-Min, et al.
Published: (2025)
Forecasting Frontier Language Model Agent Capabilities
by: Pimpale, Govind, et al.
Published: (2025)
by: Pimpale, Govind, et al.
Published: (2025)
Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss
by: Zhang, Bowen, et al.
Published: (2024)
by: Zhang, Bowen, et al.
Published: (2024)
Generating Difficult-to-Translate Texts
by: Zouhar, Vilém, et al.
Published: (2025)
by: Zouhar, Vilém, et al.
Published: (2025)
Gemini: A Family of Highly Capable Multimodal Models
by: Gemini Team, et al.
Published: (2023)
by: Gemini Team, et al.
Published: (2023)
Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging
by: Morrison, Jacob, et al.
Published: (2024)
by: Morrison, Jacob, et al.
Published: (2024)
Imp: Highly Capable Large Multimodal Models for Mobile Devices
by: Shao, Zhenwei, et al.
Published: (2024)
by: Shao, Zhenwei, et al.
Published: (2024)
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
by: Que, Haoran, et al.
Published: (2024)
by: Que, Haoran, et al.
Published: (2024)
PROST-LLM: Progressively Enhancing the Speech-to-Speech Translation Capability in LLMs
by: Xu, Jing, et al.
Published: (2026)
by: Xu, Jing, et al.
Published: (2026)
Emergent Explainability: Adding a causal chain to neural network inference
by: Perrett, Adam
Published: (2024)
by: Perrett, Adam
Published: (2024)
Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
by: Briakou, Eleftheria, et al.
Published: (2024)
by: Briakou, Eleftheria, et al.
Published: (2024)
Self-Translate-Train: Enhancing Cross-Lingual Transfer of Large Language Models via Inherent Capability
by: Ri, Ryokan, et al.
Published: (2024)
by: Ri, Ryokan, et al.
Published: (2024)
Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations
by: Miller, Evan
Published: (2024)
by: Miller, Evan
Published: (2024)
Merge and Conquer: Instructing Multilingual Models by Adding Target Language Weights
by: Valero, Eneko, et al.
Published: (2026)
by: Valero, Eneko, et al.
Published: (2026)
CT2C-QA: Multimodal Question Answering over Chinese Text, Table and Chart
by: Zhao, Bowen, et al.
Published: (2024)
by: Zhao, Bowen, et al.
Published: (2024)
Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time
by: Yang, Wang, et al.
Published: (2025)
by: Yang, Wang, et al.
Published: (2025)
ACE-$M^3$: Automatic Capability Evaluator for Multimodal Medical Models
by: Zhang, Xiechi, et al.
Published: (2024)
by: Zhang, Xiechi, et al.
Published: (2024)
Can They Dixit? Yes they Can! Dixit as a Playground for Multimodal Language Model Capabilities
by: Balepur, Nishant, et al.
Published: (2025)
by: Balepur, Nishant, et al.
Published: (2025)
Chitchat as Interference: Adding User Backstories to Task-Oriented Dialogues
by: Stricker, Armand, et al.
Published: (2024)
by: Stricker, Armand, et al.
Published: (2024)
M3PO: Multimodal-Model-Guided Preference Optimization for Visual Instruction Following
by: Gao, Ruirui, et al.
Published: (2025)
by: Gao, Ruirui, et al.
Published: (2025)
Similar Items
-
The Case for Evaluating Multimodal Translation Models on Text Datasets
by: Vijayan, Vipin, et al.
Published: (2024) -
Detecting Concrete Visual Tokens for Multimodal Machine Translation
by: Bowen, Braeden, et al.
Published: (2024) -
Improving Language Transfer Capability of Decoder-only Architecture in Multilingual Neural Machine Translation
by: Qu, Zhi, et al.
Published: (2024) -
Exploring the Capabilities of Large Multimodal Models on Dense Text
by: Zhang, Shuo, et al.
Published: (2024) -
From TOWER to SPIRE: Adding the Speech Modality to a Translation-Specialist LLM
by: Ambilduke, Kshitij, et al.
Published: (2025)