:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Luo, Meng, Li, Bobo, Xu, Shanqing, Zhang, Shize, Chen, Qiuchan, Han, Menglu, Chen, Wenhao, Huang, Yanxiang, Fei, Hao, Lee, Mong-Li, Hsu, Wynne
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2602.00971
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents
by: Li, Bobo, et al.
Published: (2025)

PanoSent: A Panoptic Sextuple Extraction Benchmark for Multimodal Conversational Aspect-based Sentiment Analysis
by: Luo, Meng, et al.
Published: (2024)

Orthogonal Spatial-temporal Distributional Transfer for 4D Generation
by: Liu, Wei, et al.
Published: (2026)

Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition
by: Fei, Hao, et al.
Published: (2024)

Taming Actor-Observer Asymmetry in Agents via Dialectical Alignment
by: Li, Bobo, et al.
Published: (2026)

Faithful Logical Reasoning via Symbolic Chain-of-Thought
by: Xu, Jundong, et al.
Published: (2024)

TRUST-VL: An Explainable News Assistant for General Multimodal Misinformation Detection
by: Yan, Zehong, et al.
Published: (2025)

SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection
by: Qi, Peng, et al.
Published: (2024)

Mitigating GenAI-powered Evidence Pollution for Out-of-Context Multimodal Misinformation Detection
by: Yan, Zehong, et al.
Published: (2025)

LEAF-Mamba: Local Emphatic and Adaptive Fusion State Space Model for RGB-D Salient Object Detection
by: Wu, Lanhu, et al.
Published: (2025)

Multi-Part Object Representations via Graph Structures and Co-Part Discovery
by: Foo, Alex, et al.
Published: (2025)

Multi-Modal Continual Learning via Cross-Modality Adapters and Representation Alignment with Knowledge Preservation
by: Chee, Evelyn, et al.
Published: (2025)

From Personas to Talks: Revisiting the Impact of Personas on LLM-Synthesized Emotional Support Conversations
by: Wu, Shenghan, et al.
Published: (2025)

Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework
by: Xu, Jundong, et al.
Published: (2024)

NUS-Emo at SemEval-2024 Task 3: Instruction-Tuning LLM for Multimodal Emotion-Cause Analysis in Conversations
by: Luo, Meng, et al.
Published: (2024)

Evidence-Based Temporal Fact Verification
by: Barik, Anab Maulana, et al.
Published: (2024)

ChronoFact: Timeline-based Temporal Fact Verification
by: Barik, Anab Maulana, et al.
Published: (2024)

MuSLR: Multimodal Symbolic Logical Reasoning
by: Xu, Jundong, et al.
Published: (2025)

Test-Time Adaptation by Causal Trimming
by: Liu, Yingnan, et al.
Published: (2025)

UniM: A Unified Any-to-Any Interleaved Multimodal Benchmark
by: Li, Yanlin, et al.
Published: (2026)

LogicReward: Incentivizing LLM Reasoning via Step-Wise Logical Supervision
by: Xu, Jundong, et al.
Published: (2025)

Dr.V: A Hierarchical Perception-Temporal-Cognition Framework to Diagnose Video Hallucination by Fine-grained Spatial-Temporal Grounding
by: Luo, Meng, et al.
Published: (2025)

On the Adaptive Psychological Persuasion of Large Language Models
by: Ju, Tianjie, et al.
Published: (2025)

Althea: Human-AI Collaboration for Fact-Checking and Critical Reasoning
by: Churina, Svetlana, et al.
Published: (2025)

The Effects of Mindfulness on Shame: Exploring Mediation by Cognitive Flexibility and Self‐Compassion in a Chinese Adult Population
by: Xiaoshuo Zhang, et al.
Published: (2024)

Probing then Editing Response Personality of Large Language Models
by: Ju, Tianjie, et al.
Published: (2025)

Towards Robust Out-of-Distribution Generalization Bounds via Sharpness
by: Zou, Yingtian, et al.
Published: (2024)

Cross-Domain Feature Augmentation for Domain Generalization
by: Liu, Yingnan, et al.
Published: (2024)

MultiMind: Enhancing Werewolf Agents with Multimodal Reasoning and Theory of Mind
by: Zhang, Zheng, et al.
Published: (2025)

MindMerger: Efficient Boosting LLM Reasoning in non-English Languages
by: Huang, Zixian, et al.
Published: (2024)

Watch Out Your Album! On the Inadvertent Privacy Memorization in Multi-Modal Large Language Models
by: Ju, Tianjie, et al.
Published: (2025)

QIME: Constructing Interpretable Medical Text Embeddings via Ontology-Grounded Questions
by: Tang, Yixuan, et al.
Published: (2026)

Cognitive Pivot Points and Visual Anchoring: Unveiling and Rectifying Hallucinations in Multimodal Reasoning Models
by: Qian, Zhe, et al.
Published: (2026)

Beyond Context to Cognitive Appraisal: Emotion Reasoning as a Theory of Mind Benchmark for Large Language Models
by: Yeo, Gerard Christopher, et al.
Published: (2025)

COKE: A Cognitive Knowledge Graph for Machine Theory of Mind
by: Wu, Jincenzi, et al.
Published: (2023)

Dynamic Emotion and Personality Profiling for Multimodal Deception Detection
by: Zheng, Li, et al.
Published: (2026)

Enhancing Thermal Conductive Properties With Liquid Metal‐Assisted Epoxy Resin Composites
by: Junyan Wang, et al.
Published: (2025)

When Disagreements Elicit Robustness: Investigating Self-Repair Capabilities under LLM Multi-Agent Disagreements
by: Ju, Tianjie, et al.
Published: (2025)

M$^{3}$D: A Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction
by: Liu, Jiang, et al.
Published: (2024)

Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models
by: Chen, Zhawnen, et al.
Published: (2024)