Saved in:
| Main Authors: | Luo, Meng, Li, Bobo, Xu, Shanqing, Zhang, Shize, Chen, Qiuchan, Han, Menglu, Chen, Wenhao, Huang, Yanxiang, Fei, Hao, Lee, Mong-Li, Hsu, Wynne |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.00971 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents
by: Li, Bobo, et al.
Published: (2025)
by: Li, Bobo, et al.
Published: (2025)
PanoSent: A Panoptic Sextuple Extraction Benchmark for Multimodal Conversational Aspect-based Sentiment Analysis
by: Luo, Meng, et al.
Published: (2024)
by: Luo, Meng, et al.
Published: (2024)
Orthogonal Spatial-temporal Distributional Transfer for 4D Generation
by: Liu, Wei, et al.
Published: (2026)
by: Liu, Wei, et al.
Published: (2026)
Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition
by: Fei, Hao, et al.
Published: (2024)
by: Fei, Hao, et al.
Published: (2024)
Taming Actor-Observer Asymmetry in Agents via Dialectical Alignment
by: Li, Bobo, et al.
Published: (2026)
by: Li, Bobo, et al.
Published: (2026)
Faithful Logical Reasoning via Symbolic Chain-of-Thought
by: Xu, Jundong, et al.
Published: (2024)
by: Xu, Jundong, et al.
Published: (2024)
TRUST-VL: An Explainable News Assistant for General Multimodal Misinformation Detection
by: Yan, Zehong, et al.
Published: (2025)
by: Yan, Zehong, et al.
Published: (2025)
SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection
by: Qi, Peng, et al.
Published: (2024)
by: Qi, Peng, et al.
Published: (2024)
Mitigating GenAI-powered Evidence Pollution for Out-of-Context Multimodal Misinformation Detection
by: Yan, Zehong, et al.
Published: (2025)
by: Yan, Zehong, et al.
Published: (2025)
LEAF-Mamba: Local Emphatic and Adaptive Fusion State Space Model for RGB-D Salient Object Detection
by: Wu, Lanhu, et al.
Published: (2025)
by: Wu, Lanhu, et al.
Published: (2025)
Multi-Part Object Representations via Graph Structures and Co-Part Discovery
by: Foo, Alex, et al.
Published: (2025)
by: Foo, Alex, et al.
Published: (2025)
Multi-Modal Continual Learning via Cross-Modality Adapters and Representation Alignment with Knowledge Preservation
by: Chee, Evelyn, et al.
Published: (2025)
by: Chee, Evelyn, et al.
Published: (2025)
From Personas to Talks: Revisiting the Impact of Personas on LLM-Synthesized Emotional Support Conversations
by: Wu, Shenghan, et al.
Published: (2025)
by: Wu, Shenghan, et al.
Published: (2025)
Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework
by: Xu, Jundong, et al.
Published: (2024)
by: Xu, Jundong, et al.
Published: (2024)
NUS-Emo at SemEval-2024 Task 3: Instruction-Tuning LLM for Multimodal Emotion-Cause Analysis in Conversations
by: Luo, Meng, et al.
Published: (2024)
by: Luo, Meng, et al.
Published: (2024)
Evidence-Based Temporal Fact Verification
by: Barik, Anab Maulana, et al.
Published: (2024)
by: Barik, Anab Maulana, et al.
Published: (2024)
ChronoFact: Timeline-based Temporal Fact Verification
by: Barik, Anab Maulana, et al.
Published: (2024)
by: Barik, Anab Maulana, et al.
Published: (2024)
MuSLR: Multimodal Symbolic Logical Reasoning
by: Xu, Jundong, et al.
Published: (2025)
by: Xu, Jundong, et al.
Published: (2025)
Test-Time Adaptation by Causal Trimming
by: Liu, Yingnan, et al.
Published: (2025)
by: Liu, Yingnan, et al.
Published: (2025)
UniM: A Unified Any-to-Any Interleaved Multimodal Benchmark
by: Li, Yanlin, et al.
Published: (2026)
by: Li, Yanlin, et al.
Published: (2026)
LogicReward: Incentivizing LLM Reasoning via Step-Wise Logical Supervision
by: Xu, Jundong, et al.
Published: (2025)
by: Xu, Jundong, et al.
Published: (2025)
Dr.V: A Hierarchical Perception-Temporal-Cognition Framework to Diagnose Video Hallucination by Fine-grained Spatial-Temporal Grounding
by: Luo, Meng, et al.
Published: (2025)
by: Luo, Meng, et al.
Published: (2025)
On the Adaptive Psychological Persuasion of Large Language Models
by: Ju, Tianjie, et al.
Published: (2025)
by: Ju, Tianjie, et al.
Published: (2025)
Althea: Human-AI Collaboration for Fact-Checking and Critical Reasoning
by: Churina, Svetlana, et al.
Published: (2025)
by: Churina, Svetlana, et al.
Published: (2025)
The Effects of Mindfulness on Shame: Exploring Mediation by Cognitive Flexibility and Self‐Compassion in a Chinese Adult Population
by: Xiaoshuo Zhang, et al.
Published: (2024)
by: Xiaoshuo Zhang, et al.
Published: (2024)
Probing then Editing Response Personality of Large Language Models
by: Ju, Tianjie, et al.
Published: (2025)
by: Ju, Tianjie, et al.
Published: (2025)
Towards Robust Out-of-Distribution Generalization Bounds via Sharpness
by: Zou, Yingtian, et al.
Published: (2024)
by: Zou, Yingtian, et al.
Published: (2024)
Cross-Domain Feature Augmentation for Domain Generalization
by: Liu, Yingnan, et al.
Published: (2024)
by: Liu, Yingnan, et al.
Published: (2024)
MultiMind: Enhancing Werewolf Agents with Multimodal Reasoning and Theory of Mind
by: Zhang, Zheng, et al.
Published: (2025)
by: Zhang, Zheng, et al.
Published: (2025)
MindMerger: Efficient Boosting LLM Reasoning in non-English Languages
by: Huang, Zixian, et al.
Published: (2024)
by: Huang, Zixian, et al.
Published: (2024)
Watch Out Your Album! On the Inadvertent Privacy Memorization in Multi-Modal Large Language Models
by: Ju, Tianjie, et al.
Published: (2025)
by: Ju, Tianjie, et al.
Published: (2025)
QIME: Constructing Interpretable Medical Text Embeddings via Ontology-Grounded Questions
by: Tang, Yixuan, et al.
Published: (2026)
by: Tang, Yixuan, et al.
Published: (2026)
Cognitive Pivot Points and Visual Anchoring: Unveiling and Rectifying Hallucinations in Multimodal Reasoning Models
by: Qian, Zhe, et al.
Published: (2026)
by: Qian, Zhe, et al.
Published: (2026)
Beyond Context to Cognitive Appraisal: Emotion Reasoning as a Theory of Mind Benchmark for Large Language Models
by: Yeo, Gerard Christopher, et al.
Published: (2025)
by: Yeo, Gerard Christopher, et al.
Published: (2025)
COKE: A Cognitive Knowledge Graph for Machine Theory of Mind
by: Wu, Jincenzi, et al.
Published: (2023)
by: Wu, Jincenzi, et al.
Published: (2023)
Dynamic Emotion and Personality Profiling for Multimodal Deception Detection
by: Zheng, Li, et al.
Published: (2026)
by: Zheng, Li, et al.
Published: (2026)
Enhancing Thermal Conductive Properties With Liquid Metal‐Assisted Epoxy Resin Composites
by: Junyan Wang, et al.
Published: (2025)
by: Junyan Wang, et al.
Published: (2025)
When Disagreements Elicit Robustness: Investigating Self-Repair Capabilities under LLM Multi-Agent Disagreements
by: Ju, Tianjie, et al.
Published: (2025)
by: Ju, Tianjie, et al.
Published: (2025)
M$^{3}$D: A Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction
by: Liu, Jiang, et al.
Published: (2024)
by: Liu, Jiang, et al.
Published: (2024)
Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models
by: Chen, Zhawnen, et al.
Published: (2024)
by: Chen, Zhawnen, et al.
Published: (2024)
Similar Items
-
FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents
by: Li, Bobo, et al.
Published: (2025) -
PanoSent: A Panoptic Sextuple Extraction Benchmark for Multimodal Conversational Aspect-based Sentiment Analysis
by: Luo, Meng, et al.
Published: (2024) -
Orthogonal Spatial-temporal Distributional Transfer for 4D Generation
by: Liu, Wei, et al.
Published: (2026) -
Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition
by: Fei, Hao, et al.
Published: (2024) -
Taming Actor-Observer Asymmetry in Agents via Dialectical Alignment
by: Li, Bobo, et al.
Published: (2026)