Saved in:
| Main Authors: | Zeng, Boyi, Hao, Yiqin, Li, He, Song, Shixiang, Song, Feichen, Wang, Zitong, Huang, Siyuan, Xu, Yi, He, ZiWei, Wang, Xinbing, Lin, Zhouhan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.08220 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PonderLM-2: Pretraining LLM with Latent Thoughts in Continuous Space
by: Zeng, Boyi, et al.
Published: (2025)
by: Zeng, Boyi, et al.
Published: (2025)
AdaPonderLM: Gated Pondering Language Models with Token-Wise Adaptive Depth
by: Song, Shixiang, et al.
Published: (2026)
by: Song, Shixiang, et al.
Published: (2026)
PonderLM-3: Adaptive Token-Wise Pondering with Differentiable Masking
by: Li, He, et al.
Published: (2026)
by: Li, He, et al.
Published: (2026)
PonderLM: Pretraining Language Models to Ponder in Continuous Space
by: Zeng, Boyi, et al.
Published: (2025)
by: Zeng, Boyi, et al.
Published: (2025)
AWM: Accurate Weight-Matrix Fingerprint for Large Language Models
by: Zeng, Boyi, et al.
Published: (2025)
by: Zeng, Boyi, et al.
Published: (2025)
Graph Parsing Networks
by: Song, Yunchong, et al.
Published: (2024)
by: Song, Yunchong, et al.
Published: (2024)
HuRef: HUman-REadable Fingerprint for Large Language Models
by: Zeng, Boyi, et al.
Published: (2023)
by: Zeng, Boyi, et al.
Published: (2023)
Cluster-wise Graph Transformer with Dual-granularity Kernelized Attention
by: Huang, Siyuan, et al.
Published: (2024)
by: Huang, Siyuan, et al.
Published: (2024)
FreqKV: Key-Value Compression in Frequency Domain for Context Window Extension
by: Kai, Jushi, et al.
Published: (2025)
by: Kai, Jushi, et al.
Published: (2025)
Diagnosing Memorization in Chain-of-Thought Reasoning, One Token at a Time
by: Li, Huihan, et al.
Published: (2025)
by: Li, Huihan, et al.
Published: (2025)
Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator
by: He, Ziwei, et al.
Published: (2023)
by: He, Ziwei, et al.
Published: (2023)
Effects of Perceived Control in the Relationship Between Psychological Distress and Posttraumatic Growth After Glioma Diagnosis: A Longitudinal Mediation Analysis
by: Xu ZiWei, et al.
Published: (2025)
by: Xu ZiWei, et al.
Published: (2025)
Flow of Spans: Generalizing Language Models to Dynamic Span-Vocabulary via GFlowNets
by: Xue, Bo, et al.
Published: (2026)
by: Xue, Bo, et al.
Published: (2026)
One Size Does Not Fit All: Token-Wise Adaptive Compression for KV Cache
by: Lu, Liming, et al.
Published: (2026)
by: Lu, Liming, et al.
Published: (2026)
CoT-X: An Adaptive Framework for Cross-Model Chain-of-Thought Transfer and Optimization
by: Bi, Ziqian, et al.
Published: (2025)
by: Bi, Ziqian, et al.
Published: (2025)
LLM Reasoning Is Latent, Not the Chain of Thought
by: Wang, Wenshuo
Published: (2026)
by: Wang, Wenshuo
Published: (2026)
Context-level Language Modeling by Learning Predictive Context Embeddings
by: Dai, Beiya, et al.
Published: (2025)
by: Dai, Beiya, et al.
Published: (2025)
Latent Chain-of-Thought as Planning: Decoupling Reasoning from Verbalization
by: Wang, Jiecong, et al.
Published: (2026)
by: Wang, Jiecong, et al.
Published: (2026)
Towards Controlled Table-to-Text Generation with Scientific Reasoning
by: Guo, Zhixin, et al.
Published: (2023)
by: Guo, Zhixin, et al.
Published: (2023)
Fourier Compressor: Frequency-Domain Visual Token Compression for Vision-Language Models
by: Wang, Huanyu, et al.
Published: (2025)
by: Wang, Huanyu, et al.
Published: (2025)
Amplify Adjacent Token Differences: Enhancing Long Chain-of-Thought Reasoning with Shift-FFN
by: Xu, Yao, et al.
Published: (2025)
by: Xu, Yao, et al.
Published: (2025)
GeoGalactica: A Scientific Large Language Model in Geoscience
by: Lin, Zhouhan, et al.
Published: (2023)
by: Lin, Zhouhan, et al.
Published: (2023)
Accelerating Structured Chain-of-Thought in Autonomous Vehicles
by: Gu, Yi, et al.
Published: (2026)
by: Gu, Yi, et al.
Published: (2026)
Learning Modal-Mixed Chain-of-Thought Reasoning with Latent Embeddings
by: Shao, Yifei, et al.
Published: (2026)
by: Shao, Yifei, et al.
Published: (2026)
Prehabilitation enhances functional and structural recovery following anterior cruciate ligament reconstruction: A randomized controlled trial
by: Yuping Fu, et al.
Published: (2025)
by: Yuping Fu, et al.
Published: (2025)
Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought
by: Li, Yunheng, et al.
Published: (2026)
by: Li, Yunheng, et al.
Published: (2026)
Toward the $p$-adic Hodge parameters in the potentially crystalline representations of $\mathrm{GL}_n$
by: He, Yiqin
Published: (2026)
by: He, Yiqin
Published: (2026)
Companion points and locally analytic socle conjecture for Steinberg case
by: He, Yiqin
Published: (2024)
by: He, Yiqin
Published: (2024)
Towards the $p$-adic Hodge parameters in semistable representations of $\mathrm{GL}_n(\mathrm{Q}_p)$
by: He, Yiqin
Published: (2026)
by: He, Yiqin
Published: (2026)
Chain-of-Thought Tokens are Computer Program Variables
by: Zhu, Fangwei, et al.
Published: (2025)
by: Zhu, Fangwei, et al.
Published: (2025)
Latent Chain-of-Thought for Visual Reasoning
by: Sun, Guohao, et al.
Published: (2025)
by: Sun, Guohao, et al.
Published: (2025)
FaithCoT-Bench: Benchmarking Instance-Level Faithfulness of Chain-of-Thought Reasoning
by: Shen, Xu, et al.
Published: (2025)
by: Shen, Xu, et al.
Published: (2025)
Mirror-Consistency: Harnessing Inconsistency in Majority Voting
by: Huang, Siyuan, et al.
Published: (2024)
by: Huang, Siyuan, et al.
Published: (2024)
Investigating the Fundamental Limit: A Feasibility Study of Hybrid-Neural Archival
by: Armstrong, Marcus, et al.
Published: (2026)
by: Armstrong, Marcus, et al.
Published: (2026)
Towards Enhanced Image Generation Via Multi-modal Chain of Thought in Unified Generative Models
by: Wang, Yi, et al.
Published: (2025)
by: Wang, Yi, et al.
Published: (2025)
Imbalanced Graph-Level Anomaly Detection via Counterfactual Augmentation and Feature Learning
by: Wang, Zitong, et al.
Published: (2024)
by: Wang, Zitong, et al.
Published: (2024)
Mitigating Premature Discretization with Progressive Quantization for Robust Vector Tokenization
by: Zhao, Wenhao, et al.
Published: (2026)
by: Zhao, Wenhao, et al.
Published: (2026)
Rethinking Chain-of-Thought Reasoning for Videos
by: Zhong, Yiwu, et al.
Published: (2025)
by: Zhong, Yiwu, et al.
Published: (2025)
Inference-Time Chain-of-Thought Pruning with Latent Informativeness Signals
by: Li, Sophie, et al.
Published: (2025)
by: Li, Sophie, et al.
Published: (2025)
SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens
by: He, Yinhan, et al.
Published: (2025)
by: He, Yinhan, et al.
Published: (2025)
Similar Items
-
PonderLM-2: Pretraining LLM with Latent Thoughts in Continuous Space
by: Zeng, Boyi, et al.
Published: (2025) -
AdaPonderLM: Gated Pondering Language Models with Token-Wise Adaptive Depth
by: Song, Shixiang, et al.
Published: (2026) -
PonderLM-3: Adaptive Token-Wise Pondering with Differentiable Masking
by: Li, He, et al.
Published: (2026) -
PonderLM: Pretraining Language Models to Ponder in Continuous Space
by: Zeng, Boyi, et al.
Published: (2025) -
AWM: Accurate Weight-Matrix Fingerprint for Large Language Models
by: Zeng, Boyi, et al.
Published: (2025)