Saved in:
| Main Authors: | Lou, Chenwei, Sun, Zewei, Liang, Xinnian, Qu, Meng, Shen, Wei, Wang, Wenqi, Li, Yuntao, Yang, Qingping, Wu, Shuangzhi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.11896 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs
by: He, Feng, et al.
Published: (2025)
by: He, Feng, et al.
Published: (2025)
AdaMCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Multilingual Chain-of-Thought
by: Zheng, Weihua, et al.
Published: (2025)
by: Zheng, Weihua, et al.
Published: (2025)
Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies
by: Wen, Yibo, et al.
Published: (2024)
by: Wen, Yibo, et al.
Published: (2024)
SCM: Enhancing Large Language Model with Self-Controlled Memory Framework
by: Wang, Bing, et al.
Published: (2023)
by: Wang, Bing, et al.
Published: (2023)
AdaMuon: Adaptive Muon Optimizer
by: Si, Chongjie, et al.
Published: (2025)
by: Si, Chongjie, et al.
Published: (2025)
Crystal-KV: Efficient KV Cache Management for Chain-of-Thought LLMs via Answer-First Principle
by: Wang, Zihan, et al.
Published: (2026)
by: Wang, Zihan, et al.
Published: (2026)
Adaptive Laser Modulation Strategy for Femtosecond Laser Surface Texturing of Uniform Microcorner Features
by: Wenqi Ma, et al.
Published: (2025)
by: Wenqi Ma, et al.
Published: (2025)
EntroCoT: Enhancing Chain-of-Thought via Adaptive Entropy-Guided Segmentation
by: Li, Zihang, et al.
Published: (2026)
by: Li, Zihang, et al.
Published: (2026)
From Dataset to Real-world: General 3D Object Detection via Generalized Cross-domain Few-shot Learning
by: Li, Shuangzhi, et al.
Published: (2025)
by: Li, Shuangzhi, et al.
Published: (2025)
Generalizable Pareto-Optimal Offloading with Reinforcement Learning in Mobile Edge Computing
by: Yang, Ning, et al.
Published: (2025)
by: Yang, Ning, et al.
Published: (2025)
DRT: Deep Reasoning Translation via Long Chain-of-Thought
by: Wang, Jiaan, et al.
Published: (2024)
by: Wang, Jiaan, et al.
Published: (2024)
RationAnomaly: Log Anomaly Detection with Rationality via Chain-of-Thought and Reinforcement Learning
by: Xu, Song, et al.
Published: (2025)
by: Xu, Song, et al.
Published: (2025)
Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning
by: Xu, Haolei, et al.
Published: (2025)
by: Xu, Haolei, et al.
Published: (2025)
Stop When Enough: Adaptive Early-Stopping for Chain-of-Thought Reasoning
by: Sun, Renliang, et al.
Published: (2025)
by: Sun, Renliang, et al.
Published: (2025)
Ada-R1: Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization
by: Luo, Haotian, et al.
Published: (2025)
by: Luo, Haotian, et al.
Published: (2025)
UTMath: Math Evaluation with Unit Test via Reasoning-to-Coding Thoughts
by: Yang, Bo, et al.
Published: (2024)
by: Yang, Bo, et al.
Published: (2024)
Learning to Edit Knowledge via Instruction-based Chain-of-Thought Prompting
by: Fu, Jinhu, et al.
Published: (2026)
by: Fu, Jinhu, et al.
Published: (2026)
AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism
by: Wei, Zhepei, et al.
Published: (2025)
by: Wei, Zhepei, et al.
Published: (2025)
CoTGuard: Using Chain-of-Thought Triggering for Copyright Protection in Multi-Agent LLM Systems
by: Wen, Yan, et al.
Published: (2025)
by: Wen, Yan, et al.
Published: (2025)
AdaDoS: Adaptive DoS Attack via Deep Adversarial Reinforcement Learning in SDN
by: Shao, Wei, et al.
Published: (2025)
by: Shao, Wei, et al.
Published: (2025)
In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought
by: Huang, Sili, et al.
Published: (2024)
by: Huang, Sili, et al.
Published: (2024)
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
by: Shen, Maohao, et al.
Published: (2025)
by: Shen, Maohao, et al.
Published: (2025)
On the Equivalence of Synchronous Coordination Game and Asynchronous Coordination Design
by: Pan, Xinnian Kazusa
Published: (2024)
by: Pan, Xinnian Kazusa
Published: (2024)
AdaFuse: Adaptive Multimodal Fusion for Lung Cancer Risk Prediction via Reinforcement Learning
by: Qu, Chongyu, et al.
Published: (2026)
by: Qu, Chongyu, et al.
Published: (2026)
Self-Compression of Chain-of-Thought via Multi-Agent Reinforcement Learning
by: Chen, Yiqun, et al.
Published: (2026)
by: Chen, Yiqun, et al.
Published: (2026)
SurgCoT: Advancing Spatiotemporal Reasoning in Surgical Videos through a Chain-of-Thought Benchmark
by: Wang, Gui, et al.
Published: (2026)
by: Wang, Gui, et al.
Published: (2026)
Optimal Beamforming for Uplink Covert Communication in MIMO GEO Satellite-Terrestrial Systems
by: Guo, Zewei, et al.
Published: (2026)
by: Guo, Zewei, et al.
Published: (2026)
Pareto-Optimal Offline Reinforcement Learning via Smooth Tchebysheff Scalarization
by: Bhatnagar, Aadyot, et al.
Published: (2026)
by: Bhatnagar, Aadyot, et al.
Published: (2026)
RecCoT: Enhancing Recommendation via Chain-of-Thought
by: Yang, Shuo, et al.
Published: (2025)
by: Yang, Shuo, et al.
Published: (2025)
AdaTSQ: Pushing the Pareto Frontier of Diffusion Transformers via Temporal-Sensitivity Quantization
by: Zhang, Shaoqiu, et al.
Published: (2026)
by: Zhang, Shaoqiu, et al.
Published: (2026)
SIM-CoT: Supervised Implicit Chain-of-Thought
by: Wei, Xilin, et al.
Published: (2025)
by: Wei, Xilin, et al.
Published: (2025)
Intention Chain-of-Thought Prompting with Dynamic Routing for Code Generation
by: Li, Shen, et al.
Published: (2025)
by: Li, Shen, et al.
Published: (2025)
CoTEvol: Self-Evolving Chain-of-Thoughts for Data Synthesis in Mathematical Reasoning
by: Wang, Zhuo, et al.
Published: (2026)
by: Wang, Zhuo, et al.
Published: (2026)
AdaThinkDrive: Adaptive Thinking via Reinforcement Learning for Autonomous Driving
by: Luo, Yuechen, et al.
Published: (2025)
by: Luo, Yuechen, et al.
Published: (2025)
CoT-X: An Adaptive Framework for Cross-Model Chain-of-Thought Transfer and Optimization
by: Bi, Ziqian, et al.
Published: (2025)
by: Bi, Ziqian, et al.
Published: (2025)
RULE: Reinforcement UnLEarning Achieves Forget-Retain Pareto Optimality
by: Zhang, Chenlong, et al.
Published: (2025)
by: Zhang, Chenlong, et al.
Published: (2025)
Thoughts-as-Planning: Latent World Models for Chain-of-Thoughts Optimization via Reinforcement Planning
by: Liu, Dong, et al.
Published: (2026)
by: Liu, Dong, et al.
Published: (2026)
ExpThink: Experience-Guided Reinforcement Learning for Adaptive Chain-of-Thought Compression
by: Bian, Tingcheng, et al.
Published: (2026)
by: Bian, Tingcheng, et al.
Published: (2026)
Clear Chain-of-Thought (ClearCoT)
by: Sagić, Andrija
Published: (2026)
by: Sagić, Andrija
Published: (2026)
CoT-Evo: Evolutionary Distillation of Chain-of-Thought for Scientific Reasoning
by: Feng, Kehua, et al.
Published: (2025)
by: Feng, Kehua, et al.
Published: (2025)
Similar Items
-
ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs
by: He, Feng, et al.
Published: (2025) -
AdaMCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Multilingual Chain-of-Thought
by: Zheng, Weihua, et al.
Published: (2025) -
Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies
by: Wen, Yibo, et al.
Published: (2024) -
SCM: Enhancing Large Language Model with Self-Controlled Memory Framework
by: Wang, Bing, et al.
Published: (2023) -
AdaMuon: Adaptive Muon Optimizer
by: Si, Chongjie, et al.
Published: (2025)