:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lou, Chenwei, Sun, Zewei, Liang, Xinnian, Qu, Meng, Shen, Wei, Wang, Wenqi, Li, Yuntao, Yang, Qingping, Wu, Shuangzhi
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2505.11896
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs
by: He, Feng, et al.
Published: (2025)

AdaMCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Multilingual Chain-of-Thought
by: Zheng, Weihua, et al.
Published: (2025)

Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies
by: Wen, Yibo, et al.
Published: (2024)

SCM: Enhancing Large Language Model with Self-Controlled Memory Framework
by: Wang, Bing, et al.
Published: (2023)

AdaMuon: Adaptive Muon Optimizer
by: Si, Chongjie, et al.
Published: (2025)

Crystal-KV: Efficient KV Cache Management for Chain-of-Thought LLMs via Answer-First Principle
by: Wang, Zihan, et al.
Published: (2026)

Adaptive Laser Modulation Strategy for Femtosecond Laser Surface Texturing of Uniform Microcorner Features
by: Wenqi Ma, et al.
Published: (2025)

EntroCoT: Enhancing Chain-of-Thought via Adaptive Entropy-Guided Segmentation
by: Li, Zihang, et al.
Published: (2026)

From Dataset to Real-world: General 3D Object Detection via Generalized Cross-domain Few-shot Learning
by: Li, Shuangzhi, et al.
Published: (2025)

Generalizable Pareto-Optimal Offloading with Reinforcement Learning in Mobile Edge Computing
by: Yang, Ning, et al.
Published: (2025)

DRT: Deep Reasoning Translation via Long Chain-of-Thought
by: Wang, Jiaan, et al.
Published: (2024)

RationAnomaly: Log Anomaly Detection with Rationality via Chain-of-Thought and Reinforcement Learning
by: Xu, Song, et al.
Published: (2025)

Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning
by: Xu, Haolei, et al.
Published: (2025)

Stop When Enough: Adaptive Early-Stopping for Chain-of-Thought Reasoning
by: Sun, Renliang, et al.
Published: (2025)

Ada-R1: Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization
by: Luo, Haotian, et al.
Published: (2025)

UTMath: Math Evaluation with Unit Test via Reasoning-to-Coding Thoughts
by: Yang, Bo, et al.
Published: (2024)

Learning to Edit Knowledge via Instruction-based Chain-of-Thought Prompting
by: Fu, Jinhu, et al.
Published: (2026)

AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism
by: Wei, Zhepei, et al.
Published: (2025)

CoTGuard: Using Chain-of-Thought Triggering for Copyright Protection in Multi-Agent LLM Systems
by: Wen, Yan, et al.
Published: (2025)

AdaDoS: Adaptive DoS Attack via Deep Adversarial Reinforcement Learning in SDN
by: Shao, Wei, et al.
Published: (2025)

In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought
by: Huang, Sili, et al.
Published: (2024)

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
by: Shen, Maohao, et al.
Published: (2025)

On the Equivalence of Synchronous Coordination Game and Asynchronous Coordination Design
by: Pan, Xinnian Kazusa
Published: (2024)

AdaFuse: Adaptive Multimodal Fusion for Lung Cancer Risk Prediction via Reinforcement Learning
by: Qu, Chongyu, et al.
Published: (2026)

Self-Compression of Chain-of-Thought via Multi-Agent Reinforcement Learning
by: Chen, Yiqun, et al.
Published: (2026)

SurgCoT: Advancing Spatiotemporal Reasoning in Surgical Videos through a Chain-of-Thought Benchmark
by: Wang, Gui, et al.
Published: (2026)

Optimal Beamforming for Uplink Covert Communication in MIMO GEO Satellite-Terrestrial Systems
by: Guo, Zewei, et al.
Published: (2026)

Pareto-Optimal Offline Reinforcement Learning via Smooth Tchebysheff Scalarization
by: Bhatnagar, Aadyot, et al.
Published: (2026)

RecCoT: Enhancing Recommendation via Chain-of-Thought
by: Yang, Shuo, et al.
Published: (2025)

AdaTSQ: Pushing the Pareto Frontier of Diffusion Transformers via Temporal-Sensitivity Quantization
by: Zhang, Shaoqiu, et al.
Published: (2026)

SIM-CoT: Supervised Implicit Chain-of-Thought
by: Wei, Xilin, et al.
Published: (2025)

Intention Chain-of-Thought Prompting with Dynamic Routing for Code Generation
by: Li, Shen, et al.
Published: (2025)

CoTEvol: Self-Evolving Chain-of-Thoughts for Data Synthesis in Mathematical Reasoning
by: Wang, Zhuo, et al.
Published: (2026)

AdaThinkDrive: Adaptive Thinking via Reinforcement Learning for Autonomous Driving
by: Luo, Yuechen, et al.
Published: (2025)

CoT-X: An Adaptive Framework for Cross-Model Chain-of-Thought Transfer and Optimization
by: Bi, Ziqian, et al.
Published: (2025)

RULE: Reinforcement UnLEarning Achieves Forget-Retain Pareto Optimality
by: Zhang, Chenlong, et al.
Published: (2025)

Thoughts-as-Planning: Latent World Models for Chain-of-Thoughts Optimization via Reinforcement Planning
by: Liu, Dong, et al.
Published: (2026)

ExpThink: Experience-Guided Reinforcement Learning for Adaptive Chain-of-Thought Compression
by: Bian, Tingcheng, et al.
Published: (2026)

Clear Chain-of-Thought (ClearCoT)
by: Sagić, Andrija
Published: (2026)

CoT-Evo: Evolutionary Distillation of Chain-of-Thought for Scientific Reasoning
by: Feng, Kehua, et al.
Published: (2025)