Saved in:
| Main Authors: | Vijayan, Vipin, Bowen, Braeden, Grigsby, Scott, Anderson, Timothy, Gwinnup, Jeremy |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.03014 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Adding Multimodal Capabilities to a Text-only Translation Model
by: Vijayan, Vipin, et al.
Published: (2024)
by: Vijayan, Vipin, et al.
Published: (2024)
Detecting Concrete Visual Tokens for Multimodal Machine Translation
by: Bowen, Braeden, et al.
Published: (2024)
by: Bowen, Braeden, et al.
Published: (2024)
Scalable Multilingual Multimodal Machine Translation with Speech-Text Fusion
by: Du, Yexing, et al.
Published: (2026)
by: Du, Yexing, et al.
Published: (2026)
Comparative Evaluation of Machine Translation Systems on Images with Text
by: Puchol, Blai, et al.
Published: (2026)
by: Puchol, Blai, et al.
Published: (2026)
Decoding the Multimodal Mind: Generalizable Brain-to-Text Translation via Multimodal Alignment and Adaptive Routing
by: Ye, Chunyu, et al.
Published: (2025)
by: Ye, Chunyu, et al.
Published: (2025)
Exploring the Necessity of Visual Modality in Multimodal Machine Translation using Authentic Datasets
by: Long, Zi, et al.
Published: (2024)
by: Long, Zi, et al.
Published: (2024)
Breaking the Silence: A Dataset and Benchmark for Bangla Text-to-Gloss Translation
by: Abdullah, Sharif Mohammad, et al.
Published: (2025)
by: Abdullah, Sharif Mohammad, et al.
Published: (2025)
Multimodal Multi-turn Conversation Stance Detection: A Challenge Dataset and Effective Model
by: Niu, Fuqiang, et al.
Published: (2024)
by: Niu, Fuqiang, et al.
Published: (2024)
Evaluating Text Creativity across Diverse Domains: A Dataset and Large Language Model Evaluator
by: Cao, Qian, et al.
Published: (2025)
by: Cao, Qian, et al.
Published: (2025)
Evaluating LLMs and Pre-trained Models for Text Summarization Across Diverse Datasets
by: Rehman, Tohida, et al.
Published: (2025)
by: Rehman, Tohida, et al.
Published: (2025)
Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss
by: Zhang, Bowen, et al.
Published: (2024)
by: Zhang, Bowen, et al.
Published: (2024)
SiTSE: Sinhala Text Simplification Dataset and Evaluation
by: Ranathunga, Surangika, et al.
Published: (2024)
by: Ranathunga, Surangika, et al.
Published: (2024)
Arabic Dataset for LLM Safeguard Evaluation
by: Ashraf, Yasser, et al.
Published: (2024)
by: Ashraf, Yasser, et al.
Published: (2024)
Rethinking Multilingual Vision-Language Translation: Dataset, Evaluation, and Adaptation
by: Wang, Xintong, et al.
Published: (2025)
by: Wang, Xintong, et al.
Published: (2025)
MMPersuade: A Dataset and Evaluation Framework for Multimodal Persuasion
by: Qiu, Haoyi, et al.
Published: (2025)
by: Qiu, Haoyi, et al.
Published: (2025)
CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models
by: Liu, Wentao, et al.
Published: (2024)
by: Liu, Wentao, et al.
Published: (2024)
TopicVD: A Topic-Based Dataset of Video-Guided Multimodal Machine Translation for Documentaries
by: Lv, Jinze, et al.
Published: (2025)
by: Lv, Jinze, et al.
Published: (2025)
Evaluating Multimodal Large Language Models on Vertically Written Japanese Text
by: Sasagawa, Keito, et al.
Published: (2025)
by: Sasagawa, Keito, et al.
Published: (2025)
Building Bridges: A Dataset for Evaluating Gender-Fair Machine Translation into German
by: Lardelli, Manuel, et al.
Published: (2024)
by: Lardelli, Manuel, et al.
Published: (2024)
CommonWhy: A Dataset for Evaluating Entity-Based Causal Commonsense Reasoning in Large Language Models
by: Toroghi, Armin, et al.
Published: (2026)
by: Toroghi, Armin, et al.
Published: (2026)
A Chinese Dataset for Evaluating the Safeguards in Large Language Models
by: Wang, Yuxia, et al.
Published: (2024)
by: Wang, Yuxia, et al.
Published: (2024)
MULTISEISMO: A Multimodal Seismic Dataset and Model for Cross-Modal Seismic Understanding
by: Munikoti, Sai, et al.
Published: (2026)
by: Munikoti, Sai, et al.
Published: (2026)
Generating Difficult-to-Translate Texts
by: Zouhar, Vilém, et al.
Published: (2025)
by: Zouhar, Vilém, et al.
Published: (2025)
On the Implications of Verbose LLM Outputs: A Case Study in Translation Evaluation
by: Briakou, Eleftheria, et al.
Published: (2024)
by: Briakou, Eleftheria, et al.
Published: (2024)
FairTranslate: An English-French Dataset for Gender Bias Evaluation in Machine Translation by Overcoming Gender Binarity
by: Jourdan, Fanny, et al.
Published: (2025)
by: Jourdan, Fanny, et al.
Published: (2025)
Evaluating Machine Translation Datasets for Low-Web Data Languages: A Gendered Lens
by: Nigatu, Hellina Hailu, et al.
Published: (2025)
by: Nigatu, Hellina Hailu, et al.
Published: (2025)
Don't Pass@k: A Bayesian Framework for Large Language Model Evaluation
by: Hariri, Mohsen, et al.
Published: (2025)
by: Hariri, Mohsen, et al.
Published: (2025)
Automating Evaluation of Diffusion Model Unlearning with (Vision-) Language Model World Knowledge
by: Yeats, Eric, et al.
Published: (2025)
by: Yeats, Eric, et al.
Published: (2025)
Simulstream: Open-Source Toolkit for Evaluation and Demonstration of Streaming Speech-to-Text Translation Systems
by: Gaido, Marco, et al.
Published: (2025)
by: Gaido, Marco, et al.
Published: (2025)
Listening or Reading? Evaluating Speech Awareness in Chain-of-Thought Speech-to-Text Translation
by: Romero-Díaz, Jacobo, et al.
Published: (2025)
by: Romero-Díaz, Jacobo, et al.
Published: (2025)
MuSaG: A Multimodal German Sarcasm Dataset with Full-Modal Annotations
by: Scott, Aaron, et al.
Published: (2025)
by: Scott, Aaron, et al.
Published: (2025)
ERIT Lightweight Multimodal Dataset for Elderly Emotion Recognition and Multimodal Fusion Evaluation
by: Frieske, Rita, et al.
Published: (2024)
by: Frieske, Rita, et al.
Published: (2024)
Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
by: Briakou, Eleftheria, et al.
Published: (2024)
by: Briakou, Eleftheria, et al.
Published: (2024)
Evaluating Language Translation Models by Playing Telephone
by: Saba, Syeda Jannatus, et al.
Published: (2025)
by: Saba, Syeda Jannatus, et al.
Published: (2025)
TeachObs: A Human-Validated Benchmark for Multimodal Teaching Observation and Model Evaluation
by: Jeong, Yeil, et al.
Published: (2026)
by: Jeong, Yeil, et al.
Published: (2026)
Multimodal RewardBench 2: Evaluating Omni Reward Models for Interleaved Text and Image
by: Hu, Yushi, et al.
Published: (2025)
by: Hu, Yushi, et al.
Published: (2025)
CT2C-QA: Multimodal Question Answering over Chinese Text, Table and Chart
by: Zhao, Bowen, et al.
Published: (2024)
by: Zhao, Bowen, et al.
Published: (2024)
MINOS: A Multimodal Evaluation Model for Bidirectional Generation Between Image and Text
by: Zhang, Junzhe, et al.
Published: (2025)
by: Zhang, Junzhe, et al.
Published: (2025)
Text2VLM: Adapting Text-Only Datasets to Evaluate Alignment Training in Visual Language Models
by: Downer, Gabriel, et al.
Published: (2025)
by: Downer, Gabriel, et al.
Published: (2025)
XFacta: Contemporary, Real-World Dataset and Evaluation for Multimodal Misinformation Detection with Multimodal LLMs
by: Xiao, Yuzhuo, et al.
Published: (2025)
by: Xiao, Yuzhuo, et al.
Published: (2025)
Similar Items
-
Adding Multimodal Capabilities to a Text-only Translation Model
by: Vijayan, Vipin, et al.
Published: (2024) -
Detecting Concrete Visual Tokens for Multimodal Machine Translation
by: Bowen, Braeden, et al.
Published: (2024) -
Scalable Multilingual Multimodal Machine Translation with Speech-Text Fusion
by: Du, Yexing, et al.
Published: (2026) -
Comparative Evaluation of Machine Translation Systems on Images with Text
by: Puchol, Blai, et al.
Published: (2026) -
Decoding the Multimodal Mind: Generalizable Brain-to-Text Translation via Multimodal Alignment and Adaptive Routing
by: Ye, Chunyu, et al.
Published: (2025)