:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Rose, Daniel, Himakunthala, Vaishnavi, Ouyang, Andy, He, Ryan, Mei, Alex, Lu, Yujie, Saxon, Michael, Sonar, Chinmay, Mirza, Diba, Wang, William Yang
Format:	Preprint
Published:	2023
Subjects:	Computation and Language Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2305.02317
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Visual Thoughts: A Unified Perspective of Understanding Multimodal Chain-of-Thought
by: Cheng, Zihui, et al.
Published: (2025)

Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning
by: Xu, Haolei, et al.
Published: (2025)

Multimodal Chain-of-Thought Reasoning in Language Models
by: Zhang, Zhuosheng, et al.
Published: (2023)

Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning
by: Lin, Jingyang, et al.
Published: (2025)

Bridging the Dynamic Perception Gap: Training-Free Draft Chain-of-Thought for Dynamic Multimodal Spatial Reasoning
by: Ou, Siqu, et al.
Published: (2025)

Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts
by: Sharma, Aditya, et al.
Published: (2024)

Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models
by: Hu, Yushi, et al.
Published: (2024)

MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning
by: Shi, Weikang, et al.
Published: (2025)

Bridging the Gap Between Multimodal Foundation Models and World Models
by: He, Xuehai
Published: (2025)

Faithful Logical Reasoning via Symbolic Chain-of-Thought
by: Xu, Jundong, et al.
Published: (2024)

Autoencoder-based Dimensionality Reduction for Accelerating the Solution of Nonlinear Time-Dependent PDEs: Transport in Porous Media with Reactions
by: Behnoudfar, Diba
Published: (2025)

Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)
by: Saxon, Michael, et al.
Published: (2024)

The Expressive Power of Transformers with Chain of Thought
by: Merrill, William, et al.
Published: (2023)

Cantor: Inspiring Multimodal Chain-of-Thought of MLLM
by: Gao, Timin, et al.
Published: (2024)

Dialogue Director: Bridging the Gap in Dialogue Visualization for Multimodal Storytelling
by: Zhang, Min, et al.
Published: (2024)

Logical Modelling in CS Education: Bridging the Natural Language Gap
by: Kneisel, Tristan, et al.
Published: (2025)

Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions
by: Yanuka, Moran, et al.
Published: (2024)

Generative Visual Chain-of-Thought for Image Editing
by: Yin, Zijin, et al.
Published: (2026)

Chain-of-Thought Degrades Visual Spatial Reasoning Capabilities of Multimodal LLMs
by: Kancheti, Sai Srinivas, et al.
Published: (2026)

A Logically Consistent Chain-of-Thought Approach for Stance Detection
by: Zhang, Bowen, et al.
Published: (2023)

Render-of-Thought: Rendering Textual Chain-of-Thought as Images for Visual Latent Reasoning
by: Wang, Yifan, et al.
Published: (2026)

S-Chain: Structured Visual Chain-of-Thought For Medicine
by: Le-Duc, Khai, et al.
Published: (2025)

Latent Chain-of-Thought for Visual Reasoning
by: Sun, Guohao, et al.
Published: (2025)

Non-Interactive Symbolic-Aided Chain-of-Thought for Logical Reasoning
by: Nguyen, Phuong Minh, et al.
Published: (2025)

Psy-Copilot: Visual Chain of Thought for Counseling
by: Chen, Keqi, et al.
Published: (2025)

A Multimodal Fusion Framework for Bridge Defect Detection with Cross-Verification
by: Rachuri, Ravi Datta, et al.
Published: (2024)

Compositional Chain-of-Thought Prompting for Large Multimodal Models
by: Mitra, Chancharik, et al.
Published: (2023)

GeoChain: Multimodal Chain-of-Thought for Geographic Reasoning
by: Yerramilli, Sahiti, et al.
Published: (2025)

Shape of Thought: Progressive Object Assembly via Visual Chain-of-Thought
by: Huo, Yu, et al.
Published: (2026)

MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification
by: Sun, Linzhuang, et al.
Published: (2025)

Teaching Models to Verbalize Reward Hacking in Chain-of-Thought Reasoning
by: Turpin, Miles, et al.
Published: (2025)

LogicCat: A Chain-of-Thought Text-to-SQL Benchmark for Complex Reasoning
by: Liu, Tao, et al.
Published: (2025)

MM-CoT:A Benchmark for Probing Visual Chain-of-Thought Reasoning in Multimodal Models
by: Zhang, Jusheng, et al.
Published: (2025)

Revisiting Overthinking in Long Chain-of-Thought from the Perspective of Self-Doubt
by: Peng, Keqin, et al.
Published: (2025)

EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models
by: Dai, Xuanlang, et al.
Published: (2026)

Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic
by: Zhao, Xufeng, et al.
Published: (2023)

RGBX-R1: Visual Modality Chain-of-Thought Guided Reinforcement Learning for Multimodal Grounding
by: Wu, Jiahe, et al.
Published: (2026)

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought
by: Peng, Yi, et al.
Published: (2025)

Bridging Semantic Logic Gaps: A Cognition Inspired Multimodal Boundary Preserving Network for Image Manipulation Localization
by: Li, Songlin, et al.
Published: (2025)

Understanding and Mitigating Hallucinations in Multimodal Chain-of-Thought Models
by: Ma, Ji, et al.
Published: (2026)