:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Bowen, Braeden, Vijayan, Vipin, Grigsby, Scott, Anderson, Timothy, Gwinnup, Jeremy
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2403.03075
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The Case for Evaluating Multimodal Translation Models on Text Datasets
by: Vijayan, Vipin, et al.
Published: (2024)

Adding Multimodal Capabilities to a Text-only Translation Model
by: Vijayan, Vipin, et al.
Published: (2024)

Exploring the Necessity of Visual Modality in Multimodal Machine Translation using Authentic Datasets
by: Long, Zi, et al.
Published: (2024)

VIDA: A dataset for Visually Dependent Ambiguity in Multimodal Machine Translation
by: Pan, Jingheng, et al.
Published: (2026)

Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation
by: Chen, Andong, et al.
Published: (2024)

Towards Zero-Shot Multimodal Machine Translation
by: Futeral, Matthieu, et al.
Published: (2024)

Dual-branch Prompting for Multimodal Machine Translation
by: Wang, Jie, et al.
Published: (2025)

Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation
by: Qu, Zhi, et al.
Published: (2025)

Extend Adversarial Policy Against Neural Machine Translation via Unknown Token
by: Zou, Wei, et al.
Published: (2025)

LLM Reasoning for Machine Translation: Synthetic Data Generation over Thinking Tokens
by: Zebaze, Armel, et al.
Published: (2025)

MatViX: Multimodal Information Extraction from Visually Rich Articles
by: Khalighinejad, Ghazal, et al.
Published: (2024)

Contrastive Token Learning with Similarity Decay for Repetition Suppression in Machine Translation
by: Dai, Huangyu, et al.
Published: (2024)

Translate, then Detect: Leveraging Machine Translation for Cross-Lingual Toxicity Classification
by: Bell, Samuel J., et al.
Published: (2025)

Scalable Multilingual Multimodal Machine Translation with Speech-Text Fusion
by: Du, Yexing, et al.
Published: (2026)

CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation
by: Villa-Cueva, Emilio, et al.
Published: (2025)

BatchGEMBA: Token-Efficient Machine Translation Evaluation with Batched Prompting and Prompt Compression
by: Larionov, Daniil, et al.
Published: (2025)

Testing the Limits of Machine Translation from One Book
by: Shaw, Jonathan, et al.
Published: (2025)

Why do Large Language Models Fail in Low-resource Translation? Unraveling the Token Dynamics of Large Language Models for Machine Translation
by: Qian, Shenbin, et al.
Published: (2026)

Efficient Pre-Training with Token Superposition
by: Peng, Bowen, et al.
Published: (2026)

v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning
by: Chung, Jiwan, et al.
Published: (2025)

Exploring Machine Learning and Language Models for Multimodal Depression Detection
by: Hong, Javier Si Zhao, et al.
Published: (2025)

M3PO: Multimodal-Model-Guided Preference Optimization for Visual Instruction Following
by: Gao, Ruirui, et al.
Published: (2025)

Context-Informed Machine Translation of Manga using Multimodal Large Language Models
by: Lippmann, Philip, et al.
Published: (2024)

EMMeTT: Efficient Multimodal Machine Translation Training
by: Żelasko, Piotr, et al.
Published: (2024)

Automatic Machine Translation Detection Using a Surrogate Multilingual Translation Model
by: García-Romero, Cristian, et al.
Published: (2025)

GIIFT: Graph-guided Inductive Image-free Multimodal Machine Translation
by: Xiong, Jiafeng, et al.
Published: (2025)

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation
by: Lin, Haokun, et al.
Published: (2025)

Multi-Level Contextual Token Relation Modeling for Machine-Generated Text Detection
by: Wu, Chenwang, et al.
Published: (2026)

TopicVD: A Topic-Based Dataset of Video-Guided Multimodal Machine Translation for Documentaries
by: Lv, Jinze, et al.
Published: (2025)

Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs
by: Li, Yanhong, et al.
Published: (2025)

Can LLMs Detect Intrinsic Hallucinations in Paraphrasing and Machine Translation?
by: Gogoulou, Evangelia, et al.
Published: (2025)

Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation
by: Himmi, Anas, et al.
Published: (2024)

Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
by: Ma, Chuofan, et al.
Published: (2024)

Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation
by: Gigant, Théo, et al.
Published: (2026)

Token Masking Improves Transformer-Based Text Classification
by: Xu, Xianglong, et al.
Published: (2025)

ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers
by: Yuan, Qianhao, et al.
Published: (2025)

On the Hallucination in Simultaneous Machine Translation
by: Zhong, Meizhi, et al.
Published: (2024)

Paraphrase-Aligned Machine Translation
by: Chang, Ke-Ching, et al.
Published: (2024)

Vision-Grounded Machine Interpreting: Improving the Translation Process through Visual Cues
by: Fantinuoli, Claudio
Published: (2025)

Estimating Machine Translation Difficulty
by: Proietti, Lorenzo, et al.
Published: (2025)