Saved in:
| Main Authors: | Lv, Bo, Sun, Yasheng, Wang, Junjie, Shi, Haoxiang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.13738 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Latent Chain-of-Thought for Visual Reasoning
by: Sun, Guohao, et al.
Published: (2025)
by: Sun, Guohao, et al.
Published: (2025)
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens
by: Yang, Zeyuan, et al.
Published: (2025)
by: Yang, Zeyuan, et al.
Published: (2025)
Compressing Sequences in the Latent Embedding Space: $K$-Token Merging for Large Language Models
by: Xu, Zihao, et al.
Published: (2026)
by: Xu, Zihao, et al.
Published: (2026)
Monet: Reasoning in Latent Visual Space Beyond Images and Language
by: Wang, Qixun, et al.
Published: (2025)
by: Wang, Qixun, et al.
Published: (2025)
ATLAS: Agentic or Latent Visual Reasoning? One Word is Enough for Both
by: Guo, Ziyu, et al.
Published: (2026)
by: Guo, Ziyu, et al.
Published: (2026)
Beyond Visual Memory: Mechanistic Diagnostics of Latent Visual Reasoning
by: Guo, Garvin, et al.
Published: (2026)
by: Guo, Garvin, et al.
Published: (2026)
OneSearch-V2: The Latent Reasoning Enhanced Self-distillation Generative Search Framework
by: Chen, Ben, et al.
Published: (2026)
by: Chen, Ben, et al.
Published: (2026)
LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs
by: Krojer, Benno, et al.
Published: (2026)
by: Krojer, Benno, et al.
Published: (2026)
LLM Reasoning Is Latent, Not the Chain of Thought
by: Wang, Wenshuo
Published: (2026)
by: Wang, Wenshuo
Published: (2026)
RuPLaR : Efficient Latent Compression of LLM Reasoning Chains with Rule-Based Priors From Multi-Step to One-Step
by: Luo, Xiaocheng, et al.
Published: (2026)
by: Luo, Xiaocheng, et al.
Published: (2026)
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
by: Su, DiJia, et al.
Published: (2025)
by: Su, DiJia, et al.
Published: (2025)
The Geometric Reasoner: Manifold-Informed Latent Foresight Search for Long-Context Reasoning
by: Zhuang, Ren, et al.
Published: (2026)
by: Zhuang, Ren, et al.
Published: (2026)
Exploring Token-Space Manipulation in Latent Audio Tokenizers
by: Paissan, Francesco, et al.
Published: (2026)
by: Paissan, Francesco, et al.
Published: (2026)
MedLVR: Latent Visual Reasoning for Reliable Medical Visual Question Answering
by: Xi, Suyang, et al.
Published: (2026)
by: Xi, Suyang, et al.
Published: (2026)
LatentPilot: Scene-Aware Vision-and-Language Navigation by Dreaming Ahead with Latent Visual Reasoning
by: Hao, Haihong, et al.
Published: (2026)
by: Hao, Haihong, et al.
Published: (2026)
Internalizing LLM Reasoning via Discovery and Replay of Latent Actions
by: Shi, Zhenning, et al.
Published: (2026)
by: Shi, Zhenning, et al.
Published: (2026)
Reinforced Latent Reasoning for LLM-based Recommendation
by: Zhang, Yang, et al.
Published: (2025)
by: Zhang, Yang, et al.
Published: (2025)
What's Holding Back Latent Visual Reasoning?
by: Viveiros, André G., et al.
Published: (2026)
by: Viveiros, André G., et al.
Published: (2026)
LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning
by: Ye, Xinwu, et al.
Published: (2026)
by: Ye, Xinwu, et al.
Published: (2026)
Adaptive Compression of the Latent Space in Variational Autoencoders
by: Sejnova, Gabriela, et al.
Published: (2023)
by: Sejnova, Gabriela, et al.
Published: (2023)
VITAL: Visual-Semantic Dual Supervision for Enhanced and Interpretable Latent Reasoning in Medical MLLMs
by: Li, Qiaoru, et al.
Published: (2026)
by: Li, Qiaoru, et al.
Published: (2026)
MemoSight: Unifying Context Compression and Multi Token Prediction for Reasoning Acceleration
by: Liu, Xinyu, et al.
Published: (2026)
by: Liu, Xinyu, et al.
Published: (2026)
Learning Structural Latent Points for Efficient Visual Representations in Robotic Manipulation
by: Jiang, Yicheng, et al.
Published: (2026)
by: Jiang, Yicheng, et al.
Published: (2026)
Mull-Tokens: Modality-Agnostic Latent Thinking
by: Ray, Arijit, et al.
Published: (2025)
by: Ray, Arijit, et al.
Published: (2025)
Dr. Seg: Revisiting GRPO Training for Visual Large Language Models through Perception-Oriented Design
by: Sun, Haoxiang, et al.
Published: (2026)
by: Sun, Haoxiang, et al.
Published: (2026)
Compressed Convolutional Attention: Efficient Attention in a Compressed Latent Space
by: Figliolia, Tomas, et al.
Published: (2025)
by: Figliolia, Tomas, et al.
Published: (2025)
SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs
by: Shi, Dachuan, et al.
Published: (2025)
by: Shi, Dachuan, et al.
Published: (2025)
LaRS: Latent Reasoning Skills for Chain-of-Thought Reasoning
by: Xu, Zifan, et al.
Published: (2023)
by: Xu, Zifan, et al.
Published: (2023)
Compress3D: a Compressed Latent Space for 3D Generation from a Single Image
by: Zhang, Bowen, et al.
Published: (2024)
by: Zhang, Bowen, et al.
Published: (2024)
LaViT: Aligning Latent Visual Thoughts for Multi-modal Reasoning
by: Wu, Linquan, et al.
Published: (2026)
by: Wu, Linquan, et al.
Published: (2026)
Latent Reasoning with Supervised Thinking States
by: Amos, Ido, et al.
Published: (2026)
by: Amos, Ido, et al.
Published: (2026)
ActivationReasoning: Logical Reasoning in Latent Activation Spaces
by: Helff, Lukas, et al.
Published: (2025)
by: Helff, Lukas, et al.
Published: (2025)
Reasoning Path and Latent State Analysis for Multi-view Visual Spatial Reasoning: A Cognitive Science Perspective
by: Xue, Qiyao, et al.
Published: (2025)
by: Xue, Qiyao, et al.
Published: (2025)
One-Token Verification for Reasoning Correctness Estimation
by: Zhuang, Zhan, et al.
Published: (2026)
by: Zhuang, Zhan, et al.
Published: (2026)
Learning Modal-Mixed Chain-of-Thought Reasoning with Latent Embeddings
by: Shao, Yifei, et al.
Published: (2026)
by: Shao, Yifei, et al.
Published: (2026)
Visual Text Compression as Measure Transport
by: Tang, Lv, et al.
Published: (2026)
by: Tang, Lv, et al.
Published: (2026)
Reasoning as Energy Minimization over Structured Latent Trajectories
by: Johansson, David K.
Published: (2026)
by: Johansson, David K.
Published: (2026)
Dual Latent Memory for Visual Multi-agent System
by: Yu, Xinlei, et al.
Published: (2026)
by: Yu, Xinlei, et al.
Published: (2026)
GMapLatent: Geometric Mapping in Latent Space
by: Zeng, Wei, et al.
Published: (2025)
by: Zeng, Wei, et al.
Published: (2025)
The Thinking Pixel: Recursive Sparse Reasoning in Multimodal Diffusion Latents
by: Sun, Yuwei, et al.
Published: (2026)
by: Sun, Yuwei, et al.
Published: (2026)
Similar Items
-
Latent Chain-of-Thought for Visual Reasoning
by: Sun, Guohao, et al.
Published: (2025) -
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens
by: Yang, Zeyuan, et al.
Published: (2025) -
Compressing Sequences in the Latent Embedding Space: $K$-Token Merging for Large Language Models
by: Xu, Zihao, et al.
Published: (2026) -
Monet: Reasoning in Latent Visual Space Beyond Images and Language
by: Wang, Qixun, et al.
Published: (2025) -
ATLAS: Agentic or Latent Visual Reasoning? One Word is Enough for Both
by: Guo, Ziyu, et al.
Published: (2026)