:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Pakbin, Arash, Su, Aaron, Lee, Donald K. K., Mortazavi, Bobak J.
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2410.03725
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Protecting multimodal large language models against misleading visualizations
by: Tonglet, Jonathan, et al.
Published: (2025)

Assessing the alignment between infants' visual and linguistic experience using multimodal language models
by: Tan, Alvin Wei Ming, et al.
Published: (2025)

WorldMedQA-V: a multilingual, multimodal medical examination dataset for multimodal language models evaluation
by: Matos, João, et al.
Published: (2024)

Is your multimodal large language model a good science tutor?
by: Liu, Ming, et al.
Published: (2025)

GRASP: A novel benchmark for evaluating language GRounding And Situated Physics understanding in multimodal language models
by: Jassim, Serwan, et al.
Published: (2023)

CXR-LLAVA: a multimodal large language model for interpreting chest X-ray images
by: Lee, Seowoo, et al.
Published: (2023)

DocLLM: A layout-aware generative language model for multimodal document understanding
by: Wang, Dongsheng, et al.
Published: (2023)

Why do language models perform worse for morphologically complex languages?
by: Arnett, Catherine, et al.
Published: (2024)

Evaluating language models as risk scores
by: Cruz, André F., et al.
Published: (2024)

MNAFT: modality neuron-aware fine-tuning of multimodal large language models for image translation
by: Li, Bo, et al.
Published: (2026)

RoMemes: A multimodal meme corpus for the Romanian language
by: Păiş, Vasile, et al.
Published: (2024)

Do large language models resemble humans in language use?
by: Cai, Zhenguang G., et al.
Published: (2023)

Human-interpretable clustering of short-text using large language models
by: Miller, Justin K., et al.
Published: (2024)

Disentangling generalization and memorization in large language models using chess
by: Pleiss, Leonard S., et al.
Published: (2026)

OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection network
by: Zhao, Tiancheng, et al.
Published: (2022)

Evaluating the capability of large language models to personalize science texts for diverse middle-school-age learners
by: Vaccaro Jr, Michael, et al.
Published: (2024)

The American Ghost in the Machine: How language models align culturally and the effects of cultural prompting
by: Luther, James, et al.
Published: (2025)

Large language models have learned to use language
by: Lupyan, Gary
Published: (2025)

Vibe-Eval: A hard evaluation suite for measuring progress of multimodal language models
by: Padlewski, Piotr, et al.
Published: (2024)

The opportunities and risks of large language models in mental health
by: Lawrence, Hannah R., et al.
Published: (2024)

Attribution analysis of legal language as used by LLM
by: Belew, Richard K.
Published: (2025)

DevBench: A multimodal developmental benchmark for language learning
by: Tan, Alvin Wei Ming, et al.
Published: (2024)

When language and vision meet road safety: leveraging multimodal large language models for video-based traffic accident analysis
by: Zhang, Ruixuan, et al.
Published: (2025)

Advanced spectral clustering for heterogeneous data in credit risk monitoring systems
by: Han, Lu, et al.
Published: (2025)

Human-like object concept representations emerge naturally in multimodal large language models
by: Du, Changde, et al.
Published: (2024)

Estimating near-verbatim extraction risk in language models with decoding-constrained beam search
by: Cooper, A. Feder, et al.
Published: (2026)

Self-supervised contrastive learning of echocardiogram videos enables label-efficient cardiac disease diagnosis
by: Holste, Gregory, et al.
Published: (2022)

Prediction of mortality and resource utilization in critical care: a deep learning approach using multimodal electronic health records with natural language processing techniques
by: Ruan, Yucheng, et al.
Published: (2025)

MedicalBERT: enhancing biomedical natural language processing using pretrained BERT-based model
by: Reddy, K. Sahit, et al.
Published: (2025)

Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant
by: Dao, Alan, et al.
Published: (2024)

Functional Subspace, where language models can use vector algebra to solve problems
by: Lee, Jung H., et al.
Published: (2026)

Embodied Task Planning via Graph-Informed Action Generation with Large Language Models
by: Li, Xiang, et al.
Published: (2026)

Evaluating Large language models on Understanding Korean indirect Speech acts
by: Koo, Youngeun, et al.
Published: (2025)

Large language models are not about natural language
by: Bolhuis, Johan J., et al.
Published: (2025)

Cross-modal linkage risk in clinical vision-language models
by: Arasteh, Soroosh Tayebi, et al.
Published: (2026)

Do language models practice what they preach? Examining language ideologies about gendered language reform encoded in LLMs
by: Watson, Julia, et al.
Published: (2024)

One fish, two fish, but not the whole sea: Alignment reduces language models' conceptual diversity
by: Murthy, Sonia K., et al.
Published: (2024)

Manipulating language models' training data to study syntactic constraint learning: the case of English passivization
by: Leong, Cara Su-Yi, et al.
Published: (2024)

Challenges and opportunities in portraying emotion in generated sign language
by: McDonald, John C., et al.
Published: (2025)

Text2Model: Generating dynamic chemical reactor models using large language models (LLMs)
by: Rupprecht, Sophia, et al.
Published: (2025)