Saved in:
| Main Authors: | Bowen, Braeden, Vijayan, Vipin, Grigsby, Scott, Anderson, Timothy, Gwinnup, Jeremy |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.03075 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Case for Evaluating Multimodal Translation Models on Text Datasets
by: Vijayan, Vipin, et al.
Published: (2024)
by: Vijayan, Vipin, et al.
Published: (2024)
Adding Multimodal Capabilities to a Text-only Translation Model
by: Vijayan, Vipin, et al.
Published: (2024)
by: Vijayan, Vipin, et al.
Published: (2024)
Exploring the Necessity of Visual Modality in Multimodal Machine Translation using Authentic Datasets
by: Long, Zi, et al.
Published: (2024)
by: Long, Zi, et al.
Published: (2024)
VIDA: A dataset for Visually Dependent Ambiguity in Multimodal Machine Translation
by: Pan, Jingheng, et al.
Published: (2026)
by: Pan, Jingheng, et al.
Published: (2026)
Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation
by: Chen, Andong, et al.
Published: (2024)
by: Chen, Andong, et al.
Published: (2024)
Towards Zero-Shot Multimodal Machine Translation
by: Futeral, Matthieu, et al.
Published: (2024)
by: Futeral, Matthieu, et al.
Published: (2024)
Dual-branch Prompting for Multimodal Machine Translation
by: Wang, Jie, et al.
Published: (2025)
by: Wang, Jie, et al.
Published: (2025)
Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation
by: Qu, Zhi, et al.
Published: (2025)
by: Qu, Zhi, et al.
Published: (2025)
Extend Adversarial Policy Against Neural Machine Translation via Unknown Token
by: Zou, Wei, et al.
Published: (2025)
by: Zou, Wei, et al.
Published: (2025)
LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens
by: Zebaze, Armel, et al.
Published: (2025)
by: Zebaze, Armel, et al.
Published: (2025)
MatViX: Multimodal Information Extraction from Visually Rich Articles
by: Khalighinejad, Ghazal, et al.
Published: (2024)
by: Khalighinejad, Ghazal, et al.
Published: (2024)
Contrastive Token Learning with Similarity Decay for Repetition Suppression in Machine Translation
by: Dai, Huangyu, et al.
Published: (2024)
by: Dai, Huangyu, et al.
Published: (2024)
Translate, then Detect: Leveraging Machine Translation for Cross-Lingual Toxicity Classification
by: Bell, Samuel J., et al.
Published: (2025)
by: Bell, Samuel J., et al.
Published: (2025)
Scalable Multilingual Multimodal Machine Translation with Speech-Text Fusion
by: Du, Yexing, et al.
Published: (2026)
by: Du, Yexing, et al.
Published: (2026)
CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation
by: Villa-Cueva, Emilio, et al.
Published: (2025)
by: Villa-Cueva, Emilio, et al.
Published: (2025)
BatchGEMBA: Token-Efficient Machine Translation Evaluation with Batched Prompting and Prompt Compression
by: Larionov, Daniil, et al.
Published: (2025)
by: Larionov, Daniil, et al.
Published: (2025)
Testing the Limits of Machine Translation from One Book
by: Shaw, Jonathan, et al.
Published: (2025)
by: Shaw, Jonathan, et al.
Published: (2025)
Why do Large Language Models Fail in Low-resource Translation? Unraveling the Token Dynamics of Large Language Models for Machine Translation
by: Qian, Shenbin, et al.
Published: (2026)
by: Qian, Shenbin, et al.
Published: (2026)
Efficient Pre-Training with Token Superposition
by: Peng, Bowen, et al.
Published: (2026)
by: Peng, Bowen, et al.
Published: (2026)
v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning
by: Chung, Jiwan, et al.
Published: (2025)
by: Chung, Jiwan, et al.
Published: (2025)
Exploring Machine Learning and Language Models for Multimodal Depression Detection
by: Hong, Javier Si Zhao, et al.
Published: (2025)
by: Hong, Javier Si Zhao, et al.
Published: (2025)
M3PO: Multimodal-Model-Guided Preference Optimization for Visual Instruction Following
by: Gao, Ruirui, et al.
Published: (2025)
by: Gao, Ruirui, et al.
Published: (2025)
Context-Informed Machine Translation of Manga using Multimodal Large Language Models
by: Lippmann, Philip, et al.
Published: (2024)
by: Lippmann, Philip, et al.
Published: (2024)
EMMeTT: Efficient Multimodal Machine Translation Training
by: Żelasko, Piotr, et al.
Published: (2024)
by: Żelasko, Piotr, et al.
Published: (2024)
Automatic Machine Translation Detection Using a Surrogate Multilingual Translation Model
by: García-Romero, Cristian, et al.
Published: (2025)
by: García-Romero, Cristian, et al.
Published: (2025)
GIIFT: Graph-guided Inductive Image-free Multimodal Machine Translation
by: Xiong, Jiafeng, et al.
Published: (2025)
by: Xiong, Jiafeng, et al.
Published: (2025)
TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation
by: Lin, Haokun, et al.
Published: (2025)
by: Lin, Haokun, et al.
Published: (2025)
Multi-Level Contextual Token Relation Modeling for Machine-Generated Text Detection
by: Wu, Chenwang, et al.
Published: (2026)
by: Wu, Chenwang, et al.
Published: (2026)
TopicVD: A Topic-Based Dataset of Video-Guided Multimodal Machine Translation for Documentaries
by: Lv, Jinze, et al.
Published: (2025)
by: Lv, Jinze, et al.
Published: (2025)
Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs
by: Li, Yanhong, et al.
Published: (2025)
by: Li, Yanhong, et al.
Published: (2025)
Can LLMs Detect Intrinsic Hallucinations in Paraphrasing and Machine Translation?
by: Gogoulou, Evangelia, et al.
Published: (2025)
by: Gogoulou, Evangelia, et al.
Published: (2025)
Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation
by: Himmi, Anas, et al.
Published: (2024)
by: Himmi, Anas, et al.
Published: (2024)
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
by: Ma, Chuofan, et al.
Published: (2024)
by: Ma, Chuofan, et al.
Published: (2024)
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation
by: Gigant, Théo, et al.
Published: (2026)
by: Gigant, Théo, et al.
Published: (2026)
Token Masking Improves Transformer-Based Text Classification
by: Xu, Xianglong, et al.
Published: (2025)
by: Xu, Xianglong, et al.
Published: (2025)
ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers
by: Yuan, Qianhao, et al.
Published: (2025)
by: Yuan, Qianhao, et al.
Published: (2025)
On the Hallucination in Simultaneous Machine Translation
by: Zhong, Meizhi, et al.
Published: (2024)
by: Zhong, Meizhi, et al.
Published: (2024)
Paraphrase-Aligned Machine Translation
by: Chang, Ke-Ching, et al.
Published: (2024)
by: Chang, Ke-Ching, et al.
Published: (2024)
Vision-Grounded Machine Interpreting: Improving the Translation Process through Visual Cues
by: Fantinuoli, Claudio
Published: (2025)
by: Fantinuoli, Claudio
Published: (2025)
Estimating Machine Translation Difficulty
by: Proietti, Lorenzo, et al.
Published: (2025)
by: Proietti, Lorenzo, et al.
Published: (2025)
Similar Items
-
The Case for Evaluating Multimodal Translation Models on Text Datasets
by: Vijayan, Vipin, et al.
Published: (2024) -
Adding Multimodal Capabilities to a Text-only Translation Model
by: Vijayan, Vipin, et al.
Published: (2024) -
Exploring the Necessity of Visual Modality in Multimodal Machine Translation using Authentic Datasets
by: Long, Zi, et al.
Published: (2024) -
VIDA: A dataset for Visually Dependent Ambiguity in Multimodal Machine Translation
by: Pan, Jingheng, et al.
Published: (2026) -
Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation
by: Chen, Andong, et al.
Published: (2024)