Saved in:
| Main Authors: | Pakbin, Arash, Su, Aaron, Lee, Donald K. K., Mortazavi, Bobak J. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.03725 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Protecting multimodal large language models against misleading visualizations
by: Tonglet, Jonathan, et al.
Published: (2025)
by: Tonglet, Jonathan, et al.
Published: (2025)
Assessing the alignment between infants' visual and linguistic experience using multimodal language models
by: Tan, Alvin Wei Ming, et al.
Published: (2025)
by: Tan, Alvin Wei Ming, et al.
Published: (2025)
WorldMedQA-V: a multilingual, multimodal medical examination dataset for multimodal language models evaluation
by: Matos, João, et al.
Published: (2024)
by: Matos, João, et al.
Published: (2024)
Is your multimodal large language model a good science tutor?
by: Liu, Ming, et al.
Published: (2025)
by: Liu, Ming, et al.
Published: (2025)
GRASP: A novel benchmark for evaluating language GRounding And Situated Physics understanding in multimodal language models
by: Jassim, Serwan, et al.
Published: (2023)
by: Jassim, Serwan, et al.
Published: (2023)
CXR-LLAVA: a multimodal large language model for interpreting chest X-ray images
by: Lee, Seowoo, et al.
Published: (2023)
by: Lee, Seowoo, et al.
Published: (2023)
DocLLM: A layout-aware generative language model for multimodal document understanding
by: Wang, Dongsheng, et al.
Published: (2023)
by: Wang, Dongsheng, et al.
Published: (2023)
Why do language models perform worse for morphologically complex languages?
by: Arnett, Catherine, et al.
Published: (2024)
by: Arnett, Catherine, et al.
Published: (2024)
Evaluating language models as risk scores
by: Cruz, André F., et al.
Published: (2024)
by: Cruz, André F., et al.
Published: (2024)
MNAFT: modality neuron-aware fine-tuning of multimodal large language models for image translation
by: Li, Bo, et al.
Published: (2026)
by: Li, Bo, et al.
Published: (2026)
RoMemes: A multimodal meme corpus for the Romanian language
by: Păiş, Vasile, et al.
Published: (2024)
by: Păiş, Vasile, et al.
Published: (2024)
Do large language models resemble humans in language use?
by: Cai, Zhenguang G., et al.
Published: (2023)
by: Cai, Zhenguang G., et al.
Published: (2023)
Human-interpretable clustering of short-text using large language models
by: Miller, Justin K., et al.
Published: (2024)
by: Miller, Justin K., et al.
Published: (2024)
Disentangling generalization and memorization in large language models using chess
by: Pleiss, Leonard S., et al.
Published: (2026)
by: Pleiss, Leonard S., et al.
Published: (2026)
OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection network
by: Zhao, Tiancheng, et al.
Published: (2022)
by: Zhao, Tiancheng, et al.
Published: (2022)
Evaluating the capability of large language models to personalize science texts for diverse middle-school-age learners
by: Vaccaro Jr, Michael, et al.
Published: (2024)
by: Vaccaro Jr, Michael, et al.
Published: (2024)
The American Ghost in the Machine: How language models align culturally and the effects of cultural prompting
by: Luther, James, et al.
Published: (2025)
by: Luther, James, et al.
Published: (2025)
Large language models have learned to use language
by: Lupyan, Gary
Published: (2025)
by: Lupyan, Gary
Published: (2025)
Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models
by: Padlewski, Piotr, et al.
Published: (2024)
by: Padlewski, Piotr, et al.
Published: (2024)
The opportunities and risks of large language models in mental health
by: Lawrence, Hannah R., et al.
Published: (2024)
by: Lawrence, Hannah R., et al.
Published: (2024)
Attribution analysis of legal language as used by LLM
by: Belew, Richard K.
Published: (2025)
by: Belew, Richard K.
Published: (2025)
DevBench: A multimodal developmental benchmark for language learning
by: Tan, Alvin Wei Ming, et al.
Published: (2024)
by: Tan, Alvin Wei Ming, et al.
Published: (2024)
When language and vision meet road safety: leveraging multimodal large language models for video-based traffic accident analysis
by: Zhang, Ruixuan, et al.
Published: (2025)
by: Zhang, Ruixuan, et al.
Published: (2025)
Advanced spectral clustering for heterogeneous data in credit risk monitoring systems
by: Han, Lu, et al.
Published: (2025)
by: Han, Lu, et al.
Published: (2025)
Human-like object concept representations emerge naturally in multimodal large language models
by: Du, Changde, et al.
Published: (2024)
by: Du, Changde, et al.
Published: (2024)
Estimating near-verbatim extraction risk in language models with decoding-constrained beam search
by: Cooper, A. Feder, et al.
Published: (2026)
by: Cooper, A. Feder, et al.
Published: (2026)
Self-supervised contrastive learning of echocardiogram videos enables label-efficient cardiac disease diagnosis
by: Holste, Gregory, et al.
Published: (2022)
by: Holste, Gregory, et al.
Published: (2022)
Prediction of mortality and resource utilization in critical care: a deep learning approach using multimodal electronic health records with natural language processing techniques
by: Ruan, Yucheng, et al.
Published: (2025)
by: Ruan, Yucheng, et al.
Published: (2025)
MedicalBERT: enhancing biomedical natural language processing using pretrained BERT-based model
by: Reddy, K. Sahit, et al.
Published: (2025)
by: Reddy, K. Sahit, et al.
Published: (2025)
Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant
by: Dao, Alan, et al.
Published: (2024)
by: Dao, Alan, et al.
Published: (2024)
Functional Subspace, where language models can use vector algebra to solve problems
by: Lee, Jung H., et al.
Published: (2026)
by: Lee, Jung H., et al.
Published: (2026)
Embodied Task Planning via Graph-Informed Action Generation with Large Language Models
by: Li, Xiang, et al.
Published: (2026)
by: Li, Xiang, et al.
Published: (2026)
Evaluating Large language models on Understanding Korean indirect Speech acts
by: Koo, Youngeun, et al.
Published: (2025)
by: Koo, Youngeun, et al.
Published: (2025)
Large language models are not about natural language
by: Bolhuis, Johan J., et al.
Published: (2025)
by: Bolhuis, Johan J., et al.
Published: (2025)
Cross-modal linkage risk in clinical vision-language models
by: Arasteh, Soroosh Tayebi, et al.
Published: (2026)
by: Arasteh, Soroosh Tayebi, et al.
Published: (2026)
Do language models practice what they preach? Examining language ideologies about gendered language reform encoded in LLMs
by: Watson, Julia, et al.
Published: (2024)
by: Watson, Julia, et al.
Published: (2024)
One fish, two fish, but not the whole sea: Alignment reduces language models' conceptual diversity
by: Murthy, Sonia K., et al.
Published: (2024)
by: Murthy, Sonia K., et al.
Published: (2024)
Manipulating language models' training data to study syntactic constraint learning: the case of English passivization
by: Leong, Cara Su-Yi, et al.
Published: (2024)
by: Leong, Cara Su-Yi, et al.
Published: (2024)
Challenges and opportunities in portraying emotion in generated sign language
by: McDonald, John C., et al.
Published: (2025)
by: McDonald, John C., et al.
Published: (2025)
Text2Model: Generating dynamic chemical reactor models using large language models (LLMs)
by: Rupprecht, Sophia, et al.
Published: (2025)
by: Rupprecht, Sophia, et al.
Published: (2025)
Similar Items
-
Protecting multimodal large language models against misleading visualizations
by: Tonglet, Jonathan, et al.
Published: (2025) -
Assessing the alignment between infants' visual and linguistic experience using multimodal language models
by: Tan, Alvin Wei Ming, et al.
Published: (2025) -
WorldMedQA-V: a multilingual, multimodal medical examination dataset for multimodal language models evaluation
by: Matos, João, et al.
Published: (2024) -
Is your multimodal large language model a good science tutor?
by: Liu, Ming, et al.
Published: (2025) -
GRASP: A novel benchmark for evaluating language GRounding And Situated Physics understanding in multimodal language models
by: Jassim, Serwan, et al.
Published: (2023)