Saved in:
| Main Authors: | Ma, Ziqiao, Ding, Jing, Zhang, Xuejun, Luo, Dezhi, Ding, Jiahe, Xu, Sihan, Huang, Yuchen, Peng, Run, Chai, Joyce |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.16060 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models
by: Ma, Ziqiao, et al.
Published: (2023)
by: Ma, Ziqiao, et al.
Published: (2023)
Multi-Object Hallucination in Vision-Language Models
by: Chen, Xuweiyi, et al.
Published: (2024)
by: Chen, Xuweiyi, et al.
Published: (2024)
World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models
by: Ma, Ziqiao, et al.
Published: (2023)
by: Ma, Ziqiao, et al.
Published: (2023)
Babysit A Language Model From Scratch: Interactive Language Learning by Trials and Demonstrations
by: Ma, Ziqiao, et al.
Published: (2024)
by: Ma, Ziqiao, et al.
Published: (2024)
Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Under Ambiguities
by: Zhang, Zheyuan, et al.
Published: (2024)
by: Zhang, Zheyuan, et al.
Published: (2024)
The Mechanistic Emergence of Symbol Grounding in Language Models
by: Wu, Shuyu, et al.
Published: (2025)
by: Wu, Shuyu, et al.
Published: (2025)
GROUNDHOG: Grounding Large Language Models to Holistic Segmentation
by: Zhang, Yichi, et al.
Published: (2024)
by: Zhang, Yichi, et al.
Published: (2024)
CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation
by: Xu, Sihan, et al.
Published: (2023)
by: Xu, Sihan, et al.
Published: (2023)
Are Multimodal Large Language Models Pragmatically Competent Listeners in Simple Reference Resolution Tasks?
by: Junker, Simeon, et al.
Published: (2025)
by: Junker, Simeon, et al.
Published: (2025)
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
by: Zhang, Yue, et al.
Published: (2024)
by: Zhang, Yue, et al.
Published: (2024)
DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences
by: Huang, Yidong, et al.
Published: (2024)
by: Huang, Yidong, et al.
Published: (2024)
Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry
by: Peng, Run, et al.
Published: (2025)
by: Peng, Run, et al.
Published: (2025)
On SkipGram Word Embedding Models with Negative Sampling: Unified Framework and Impact of Noise Distributions
by: Liu, Dezhi, et al.
Published: (2020)
by: Liu, Dezhi, et al.
Published: (2020)
The Pragmatic Mind of Machines: Tracing the Emergence of Pragmatic Competence in Large Language Models
by: Yu, Kefan, et al.
Published: (2025)
by: Yu, Kefan, et al.
Published: (2025)
Vision-Language Models Mistake Head Orientation for Gaze Direction: Nonverbal Conversation Cues
by: Zhang, Zory, et al.
Published: (2025)
by: Zhang, Zory, et al.
Published: (2025)
Pragmatic Competence Evaluation of Large Language Models for the Korean Language
by: Park, Dojun, et al.
Published: (2024)
by: Park, Dojun, et al.
Published: (2024)
Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation
by: Dai, Yinpei, et al.
Published: (2023)
by: Dai, Yinpei, et al.
Published: (2023)
From Behavioral Performance to Internal Competence: Interpreting Vision-Language Models with VLM-Lens
by: Sheta, Hala, et al.
Published: (2025)
by: Sheta, Hala, et al.
Published: (2025)
Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors
by: Wang, Jian, et al.
Published: (2025)
by: Wang, Jian, et al.
Published: (2025)
Next-Embedding Prediction Makes Strong Vision Learners
by: Xu, Sihan, et al.
Published: (2025)
by: Xu, Sihan, et al.
Published: (2025)
O-Edit: Orthogonal Subspace Editing for Language Model Sequential Editing
by: Cai, Yuchen, et al.
Published: (2024)
by: Cai, Yuchen, et al.
Published: (2024)
VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation
by: Yu, Shoubin, et al.
Published: (2025)
by: Yu, Shoubin, et al.
Published: (2025)
TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models
by: Cao, Jiahuan, et al.
Published: (2024)
by: Cao, Jiahuan, et al.
Published: (2024)
ChartAdapter: Large Vision-Language Model for Chart Summarization
by: Xu, Peixin, et al.
Published: (2024)
by: Xu, Peixin, et al.
Published: (2024)
MMCode: Benchmarking Multimodal Large Language Models for Code Generation with Visually Rich Programming Problems
by: Li, Kaixin, et al.
Published: (2024)
by: Li, Kaixin, et al.
Published: (2024)
Referring Expressions as a Lens into Spatial Language Grounding in Vision-Language Models
by: Tumu, Akshar, et al.
Published: (2025)
by: Tumu, Akshar, et al.
Published: (2025)
How Hypocritical Is Your LLM judge? Listener-Speaker Asymmetries in the Pragmatic Competence of Large Language Models
by: Sieker, Judith, et al.
Published: (2026)
by: Sieker, Judith, et al.
Published: (2026)
CoT Referring: Improving Referring Expression Tasks with Grounded Reasoning
by: Dong, Qihua, et al.
Published: (2025)
by: Dong, Qihua, et al.
Published: (2025)
Diagnosing Moral Reasoning Acquisition in Language Models: Pragmatics and Generalization
by: Liu, Guangliang, et al.
Published: (2025)
by: Liu, Guangliang, et al.
Published: (2025)
Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models
by: Ding, Yi, et al.
Published: (2025)
by: Ding, Yi, et al.
Published: (2025)
Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties
by: Yu, Keunwoo Peter, et al.
Published: (2023)
by: Yu, Keunwoo Peter, et al.
Published: (2023)
Intercultural Competence and Pragmatics
by: Schauer, Gila A.
Published: (2024)
by: Schauer, Gila A.
Published: (2024)
A Wolf in Sheep's Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily
by: Ding, Peng, et al.
Published: (2023)
by: Ding, Peng, et al.
Published: (2023)
P-ICL: Point In-Context Learning for Named Entity Recognition with Large Language Models
by: Jiang, Guochao, et al.
Published: (2024)
by: Jiang, Guochao, et al.
Published: (2024)
Prosody in Pragmatic Competence: Proficiency Impact on Pitch and Fluency Features in Request‐Making in Second Language Chinese
by: Mo Chen, et al.
Published: (2025)
by: Mo Chen, et al.
Published: (2025)
Evaluation of Cultural Competence of Vision-Language Models
by: Yadav, Srishti, et al.
Published: (2025)
by: Yadav, Srishti, et al.
Published: (2025)
Sherlock: Self-Correcting Reasoning in Vision-Language Models
by: Ding, Yi, et al.
Published: (2025)
by: Ding, Yi, et al.
Published: (2025)
HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization
by: Peng, Qiwei, et al.
Published: (2024)
by: Peng, Qiwei, et al.
Published: (2024)
Distilling Implicit Multimodal Knowledge into Large Language Models for Zero-Resource Dialogue Generation
by: Zhang, Bo, et al.
Published: (2024)
by: Zhang, Bo, et al.
Published: (2024)
C$^{3}$Bench: A Comprehensive Classical Chinese Understanding Benchmark for Large Language Models
by: Cao, Jiahuan, et al.
Published: (2024)
by: Cao, Jiahuan, et al.
Published: (2024)
Similar Items
-
Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models
by: Ma, Ziqiao, et al.
Published: (2023) -
Multi-Object Hallucination in Vision-Language Models
by: Chen, Xuweiyi, et al.
Published: (2024) -
World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models
by: Ma, Ziqiao, et al.
Published: (2023) -
Babysit A Language Model From Scratch: Interactive Language Learning by Trials and Demonstrations
by: Ma, Ziqiao, et al.
Published: (2024) -
Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Under Ambiguities
by: Zhang, Zheyuan, et al.
Published: (2024)