Saved in:
| Main Authors: | Xie, Wenya, Shaochen, Zhong, Le, Hoang Anh Duy, Xu, Zhaozhuo, Xie, Jianwen, Liu, Zirui |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.00536 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DEL-ToM: Inference-Time Scaling for Theory-of-Mind Reasoning via Dynamic Epistemic Logic
by: Wu, Yuheng, et al.
Published: (2025)
by: Wu, Yuheng, et al.
Published: (2025)
KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches
by: Yuan, Jiayi, et al.
Published: (2024)
by: Yuan, Jiayi, et al.
Published: (2024)
KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
by: Liu, Zirui, et al.
Published: (2024)
by: Liu, Zirui, et al.
Published: (2024)
Do LLMs Know to Respect Copyright Notice?
by: Xu, Jialiang, et al.
Published: (2024)
by: Xu, Jialiang, et al.
Published: (2024)
AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models
by: Luo, Feng, et al.
Published: (2025)
by: Luo, Feng, et al.
Published: (2025)
Winner-Take-All Column Row Sampling for Memory Efficient Adaptation of Language Model
by: Liu, Zirui, et al.
Published: (2023)
by: Liu, Zirui, et al.
Published: (2023)
Scout Before You Attend: Sketch-and-Walk Sparse Attention for Efficient LLM Inference
by: Le, Hoang Anh Duy, et al.
Published: (2026)
by: Le, Hoang Anh Duy, et al.
Published: (2026)
A Neural Model for Word Repetition
by: Dager, Daniel, et al.
Published: (2025)
by: Dager, Daniel, et al.
Published: (2025)
Sensitivity Meets Sparsity: The Impact of Extremely Sparse Parameter Patterns on Theory-of-Mind of Large Language Models
by: Wu, Yuheng, et al.
Published: (2025)
by: Wu, Yuheng, et al.
Published: (2025)
Large Language Models Know What Makes Exemplary Contexts
by: Long, Quanyu, et al.
Published: (2024)
by: Long, Quanyu, et al.
Published: (2024)
On Repetitive Finite Automata with Translucent Words
by: Mráz, František, et al.
Published: (2025)
by: Mráz, František, et al.
Published: (2025)
Does Continued Pretraining on a Learner Corpus Improve Automated Essay Scoring on English Proficiency Tests? Evidence from EFCAMDAT
by: Nguyen, Duy Anh
Published: (2026)
by: Nguyen, Duy Anh
Published: (2026)
Understanding the Emergence of Seemingly Useless Features in Next-Token Predictors
by: Rofin, Mark, et al.
Published: (2026)
by: Rofin, Mark, et al.
Published: (2026)
The Diminishing Returns of Early-Exit Decoding in Modern LLMs
by: Wei, Rui, et al.
Published: (2026)
by: Wei, Rui, et al.
Published: (2026)
XAI-enhanced Comparative Opinion Mining via Aspect-based Scoring and Semantic Reasoning
by: Le, Ngoc-Quang, et al.
Published: (2026)
by: Le, Ngoc-Quang, et al.
Published: (2026)
OAT-Rephrase: Optimization-Aware Training Data Rephrasing for Zeroth-Order LLM Fine-Tuning
by: Long, Jikai, et al.
Published: (2025)
by: Long, Jikai, et al.
Published: (2025)
What Do Self-Supervised Speech Models Know About Words?
by: Pasad, Ankita, et al.
Published: (2023)
by: Pasad, Ankita, et al.
Published: (2023)
Schema Key Wording as an Instruction Channel in Structured Generation under Constrained Decoding
by: Le, Yifan
Published: (2026)
by: Le, Yifan
Published: (2026)
Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers
by: Chen, Xin, et al.
Published: (2026)
by: Chen, Xin, et al.
Published: (2026)
GraphDancer: Training LLMs to Explore and Reason over Graphs via Two-Stage Curriculum Post-Training
by: Bai, Yuyang, et al.
Published: (2026)
by: Bai, Yuyang, et al.
Published: (2026)
Uncertainty-Aware Budget Allocation for Adaptive Test-Time Reasoning
by: Nguyen, Manh, et al.
Published: (2026)
by: Nguyen, Manh, et al.
Published: (2026)
Self-Verification Dilemma: Experience-Driven Suppression of Overused Checking in LLM Reasoning
by: Long, Quanyu, et al.
Published: (2026)
by: Long, Quanyu, et al.
Published: (2026)
Draft Model Knows When to Stop: Self-Verification Speculative Decoding for Long-Form Generation
by: Zhang, Ziyin, et al.
Published: (2024)
by: Zhang, Ziyin, et al.
Published: (2024)
Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning
by: Zhu, Tinghui, et al.
Published: (2024)
by: Zhu, Tinghui, et al.
Published: (2024)
Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment
by: Nguyen, Cong-Duy, et al.
Published: (2023)
by: Nguyen, Cong-Duy, et al.
Published: (2023)
Large Language Model-Enhanced Symbolic Reasoning for Knowledge Base Completion
by: He, Qiyuan, et al.
Published: (2025)
by: He, Qiyuan, et al.
Published: (2025)
Self-consistent Reasoning For Solving Math Word Problems
by: Xiong, Jing, et al.
Published: (2022)
by: Xiong, Jing, et al.
Published: (2022)
OralMLLM-Bench: Evaluating Cognitive Capabilities of Multimodal Large Language Models in Dental Practice
by: Wang, Rongyang, et al.
Published: (2026)
by: Wang, Rongyang, et al.
Published: (2026)
Interpretable Multimodal Misinformation Detection with Logic Reasoning
by: Liu, Hui, et al.
Published: (2023)
by: Liu, Hui, et al.
Published: (2023)
Useless but Safe? Benchmarking Utility Recovery with User Intent Clarification in Multi-Turn Conversations
by: Zheng, Mingqian, et al.
Published: (2026)
by: Zheng, Mingqian, et al.
Published: (2026)
Three Minds, One Legend: Jailbreak Large Reasoning Model with Adaptive Stacked Ciphers
by: Nguyen, Viet-Anh, et al.
Published: (2025)
by: Nguyen, Viet-Anh, et al.
Published: (2025)
SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning
by: Li, Zheng, et al.
Published: (2025)
by: Li, Zheng, et al.
Published: (2025)
Adaptive Two-Phase Finetuning LLMs for Japanese Legal Text Retrieval
by: Trung, Quang Hoang, et al.
Published: (2024)
by: Trung, Quang Hoang, et al.
Published: (2024)
Understanding and Mitigating Numerical Sources of Nondeterminism in LLM Inference
by: Yuan, Jiayi, et al.
Published: (2025)
by: Yuan, Jiayi, et al.
Published: (2025)
Don't Read Everything: A Curvature-Conditioned Query for Linear Attention
by: Le, Dong, et al.
Published: (2026)
by: Le, Dong, et al.
Published: (2026)
Prompt Repetition Improves Non-Reasoning LLMs
by: Leviathan, Yaniv, et al.
Published: (2025)
by: Leviathan, Yaniv, et al.
Published: (2025)
DTS: Enhancing Large Reasoning Models via Decoding Tree Sketching
by: Xu, Zicheng, et al.
Published: (2025)
by: Xu, Zicheng, et al.
Published: (2025)
Half the Nonlinearity Is Wasted: Measuring and Reallocating the Transformer's MLP Budget
by: Balogh, Peter
Published: (2026)
by: Balogh, Peter
Published: (2026)
Reasoning about Uncertainty: Do Reasoning Models Know When They Don't Know?
by: Mei, Zhiting, et al.
Published: (2025)
by: Mei, Zhiting, et al.
Published: (2025)
To Compress or Not? Pushing the Frontier of Lossless GenAI Model Weights Compression with Exponent Concentration
by: Yang, Zeyu, et al.
Published: (2025)
by: Yang, Zeyu, et al.
Published: (2025)
Similar Items
-
DEL-ToM: Inference-Time Scaling for Theory-of-Mind Reasoning via Dynamic Epistemic Logic
by: Wu, Yuheng, et al.
Published: (2025) -
KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches
by: Yuan, Jiayi, et al.
Published: (2024) -
KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
by: Liu, Zirui, et al.
Published: (2024) -
Do LLMs Know to Respect Copyright Notice?
by: Xu, Jialiang, et al.
Published: (2024) -
AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models
by: Luo, Feng, et al.
Published: (2025)