Guardado en:
| Autores principales: | Lin, Zhiyu, Gao, Yifei, Zhao, Xian, Yang, Yunfan, Sang, Jitao |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2503.18071 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
AIGCs Confuse AI Too: Investigating and Explaining Synthetic Image-induced Hallucinations in Large Vision-Language Models
por: Gao, Yifei, et al.
Publicado: (2024)
por: Gao, Yifei, et al.
Publicado: (2024)
ODE: Open-Set Evaluation of Hallucinations in Multimodal Large Language Models
por: Tu, Yahan, et al.
Publicado: (2024)
por: Tu, Yahan, et al.
Publicado: (2024)
Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models
por: Wu, Wenshan, et al.
Publicado: (2024)
por: Wu, Wenshan, et al.
Publicado: (2024)
Self-Guided Defense: Adaptive Safety Alignment for Reasoning Models via Synthesized Guidelines
por: Wang, Yuhang, et al.
Publicado: (2025)
por: Wang, Yuhang, et al.
Publicado: (2025)
Debiasing Vison-Language Models with Text-Only Training
por: Yang, Yunfan, et al.
Publicado: (2024)
por: Yang, Yunfan, et al.
Publicado: (2024)
Positional Failures in Long-Context LLMs: A Blind Spot in Reasoning Benchmarks
por: Zhang, Chuyifei, et al.
Publicado: (2026)
por: Zhang, Chuyifei, et al.
Publicado: (2026)
Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space
por: Liu, Chengzhi, et al.
Publicado: (2025)
por: Liu, Chengzhi, et al.
Publicado: (2025)
Unleashing Spatial Reasoning in Multimodal Large Language Models via Textual Representation Guided Reasoning
por: Hua, Jiacheng, et al.
Publicado: (2026)
por: Hua, Jiacheng, et al.
Publicado: (2026)
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond
por: Qu, Xiaoye, et al.
Publicado: (2025)
por: Qu, Xiaoye, et al.
Publicado: (2025)
EmoLLM: Appraisal-Grounded Cognitive-Emotional Co-Reasoning in Large Language Models
por: Zhang, Yifei, et al.
Publicado: (2026)
por: Zhang, Yifei, et al.
Publicado: (2026)
Constrained Reasoning Chains for Enhancing Theory-of-Mind in Large Language Models
por: Lin, Zizheng, et al.
Publicado: (2024)
por: Lin, Zizheng, et al.
Publicado: (2024)
CodeMind: Evaluating Large Language Models for Code Reasoning
por: Liu, Changshu, et al.
Publicado: (2024)
por: Liu, Changshu, et al.
Publicado: (2024)
\texttt{ReMind}: Understanding Deductive Code Reasoning in LLMs
por: Gao, Jun, et al.
Publicado: (2025)
por: Gao, Jun, et al.
Publicado: (2025)
Can Pruning Improve Reasoning? Revisiting Long-CoT Compression with Capability in Mind for Better Reasoning
por: Zhao, Shangziqi, et al.
Publicado: (2025)
por: Zhao, Shangziqi, et al.
Publicado: (2025)
Taming the Thinker: Conditional Entropy Shaping for Adaptive LLM Reasoning
por: Wei, Shuyu, et al.
Publicado: (2026)
por: Wei, Shuyu, et al.
Publicado: (2026)
Exploring the Reasoning Abilities of Multimodal Large Language Models (MLLMs): A Comprehensive Survey on Emerging Trends in Multimodal Reasoning
por: Wang, Yiqi, et al.
Publicado: (2024)
por: Wang, Yiqi, et al.
Publicado: (2024)
ThinkPilot: Steering Reasoning Models via Automated Think-prefixes Optimization
por: Li, Sunzhu, et al.
Publicado: (2025)
por: Li, Sunzhu, et al.
Publicado: (2025)
VaLiD: Mitigating the Hallucination of Large Vision Language Models by Visual Layer Fusion Contrastive Decoding
por: Wang, Jiaqi, et al.
Publicado: (2024)
por: Wang, Jiaqi, et al.
Publicado: (2024)
Debiased Prompt Tuning in Vision-Language Model without Annotations
por: Jiang, Chaoquan, et al.
Publicado: (2025)
por: Jiang, Chaoquan, et al.
Publicado: (2025)
Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models
por: Kim, Hyunwoo, et al.
Publicado: (2025)
por: Kim, Hyunwoo, et al.
Publicado: (2025)
Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal Large Language Models
por: Sun, Haoyuan, et al.
Publicado: (2025)
por: Sun, Haoyuan, et al.
Publicado: (2025)
GRASP: Grounded CoT Reasoning with Dual-Stage Optimization for Multimodal Sarcasm Target Identification
por: Wan, Faxian, et al.
Publicado: (2026)
por: Wan, Faxian, et al.
Publicado: (2026)
Parrot Mind: Towards Explaining the Complex Task Reasoning of Pretrained Large Language Models with Template-Content Structure
por: Yang, Haotong, et al.
Publicado: (2023)
por: Yang, Haotong, et al.
Publicado: (2023)
Exploring Chain-of-Thought Reasoning for Steerable Pluralistic Alignment
por: Zhang, Yunfan, et al.
Publicado: (2025)
por: Zhang, Yunfan, et al.
Publicado: (2025)
Multimodal Behavioral Patterns Analysis with Eye-Tracking and LLM-Based Reasoning
por: Guo, Dongyang, et al.
Publicado: (2025)
por: Guo, Dongyang, et al.
Publicado: (2025)
Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models
por: Stogiannidis, Ilias, et al.
Publicado: (2025)
por: Stogiannidis, Ilias, et al.
Publicado: (2025)
Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models
por: Yan, Qianqi, et al.
Publicado: (2025)
por: Yan, Qianqi, et al.
Publicado: (2025)
From Reasoning to Learning: A Survey on Hypothesis Discovery and Rule Learning with Large Language Models
por: He, Kaiyu, et al.
Publicado: (2025)
por: He, Kaiyu, et al.
Publicado: (2025)
RRTL: Red Teaming Reasoning Large Language Models in Tool Learning
por: Liu, Yifei, et al.
Publicado: (2025)
por: Liu, Yifei, et al.
Publicado: (2025)
Guiding Clinical Reasoning with Large Language Models via Knowledge Seeds
por: WU, Jiageng, et al.
Publicado: (2024)
por: WU, Jiageng, et al.
Publicado: (2024)
Are Vision Language Models Cross-Cultural Theory of Mind Reasoners?
por: Nazi, Zabir Al, et al.
Publicado: (2025)
por: Nazi, Zabir Al, et al.
Publicado: (2025)
Multimodal Chain-of-Thought Reasoning in Language Models
por: Zhang, Zhuosheng, et al.
Publicado: (2023)
por: Zhang, Zhuosheng, et al.
Publicado: (2023)
MUR: Momentum Uncertainty guided Reasoning for Large Language Models
por: Yan, Hang, et al.
Publicado: (2025)
por: Yan, Hang, et al.
Publicado: (2025)
Probabilistic Concept Graph Reasoning for Multimodal Misinformation Detection
por: Yang, Ruichao, et al.
Publicado: (2026)
por: Yang, Ruichao, et al.
Publicado: (2026)
WebUIBench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in WebUI-to-Code
por: Lin, Zhiyu, et al.
Publicado: (2025)
por: Lin, Zhiyu, et al.
Publicado: (2025)
MSA at ImageCLEF 2025 Multimodal Reasoning: Multilingual Multimodal Reasoning With Ensemble Vision Language Models
por: Ahmed, Seif, et al.
Publicado: (2025)
por: Ahmed, Seif, et al.
Publicado: (2025)
AstroMind: A High-Fidelity Benchmark for Spacecraft Behavior Reasoning Based on Large Language Models
por: Liu, Hao, et al.
Publicado: (2026)
por: Liu, Hao, et al.
Publicado: (2026)
MemeMind: A Large-Scale Multimodal Dataset with Chain-of-Thought Reasoning for Harmful Meme Detection
por: Gu, Hexiang, et al.
Publicado: (2025)
por: Gu, Hexiang, et al.
Publicado: (2025)
Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models
por: Wang, Yuqing, et al.
Publicado: (2023)
por: Wang, Yuqing, et al.
Publicado: (2023)
LocalBench: Benchmarking LLMs on County-Level Local Knowledge and Reasoning
por: Gao, Zihan, et al.
Publicado: (2025)
por: Gao, Zihan, et al.
Publicado: (2025)
Ejemplares similares
-
AIGCs Confuse AI Too: Investigating and Explaining Synthetic Image-induced Hallucinations in Large Vision-Language Models
por: Gao, Yifei, et al.
Publicado: (2024) -
ODE: Open-Set Evaluation of Hallucinations in Multimodal Large Language Models
por: Tu, Yahan, et al.
Publicado: (2024) -
Mind's Eye of LLMs: Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models
por: Wu, Wenshan, et al.
Publicado: (2024) -
Self-Guided Defense: Adaptive Safety Alignment for Reasoning Models via Synthesized Guidelines
por: Wang, Yuhang, et al.
Publicado: (2025) -
Debiasing Vison-Language Models with Text-Only Training
por: Yang, Yunfan, et al.
Publicado: (2024)