:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Peng, Xiao, Geng, Xufan
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2410.00359
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Do LLMs Really Think Step-by-step In Implicit Reasoning?
by: Yu, Yijiong
Published: (2024)

Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping
by: Wang, Haoyu, et al.
Published: (2024)

Self-Evaluating LLMs for Multi-Step Tasks: Stepwise Confidence Estimation for Failure Detection
by: Mavi, Vaibhav, et al.
Published: (2025)

From Long to Lean: Performance-aware and Adaptive Chain-of-Thought Compression via Multi-round Refinement
by: Yan, Jianzhi, et al.
Published: (2025)

Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation
by: Zhang, Xiaoying, et al.
Published: (2024)

LLM Self Defense: By Self Examination, LLMs Know They Are Being Tricked
by: Phute, Mansi, et al.
Published: (2023)

Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning
by: Li, Yiwei, et al.
Published: (2024)

Step-Tagging: Toward controlling the generation of Language Reasoning Models through step monitoring
by: Belkhiter, Yannis, et al.
Published: (2025)

Knowing You Don't Know: Learning When to Continue Search in Multi-round RAG through Self-Practicing
by: Yang, Diji, et al.
Published: (2025)

Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
by: Wang, Peiyi, et al.
Published: (2023)

Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models
by: Yang, Xiao-Wen, et al.
Published: (2025)

Multi-round jailbreak attack on large language models
by: Zhou, Yihua, et al.
Published: (2024)

A Lightweight Framework for Trigger-Guided LoRA-Based Self-Adaptation in LLMs
by: Wei, Jiacheng, et al.
Published: (2025)

Self-Evolved Reward Learning for LLMs
by: Huang, Chenghua, et al.
Published: (2024)

Confidence Improves Self-Consistency in LLMs
by: Taubenfeld, Amir, et al.
Published: (2025)

Self-supervised Attribute-aware Dynamic Preference Ranking Alignment
by: Yang, Hongyu, et al.
Published: (2025)

The Self-Execution Benchmark: Measuring LLMs' Attempts to Overcome Their Lack of Self-Execution
by: Ezra, Elon, et al.
Published: (2025)

Improving the Reliability of LLMs: Combining CoT, RAG, Self-Consistency, and Self-Verification
by: Kumar, Adarsh, et al.
Published: (2025)

PoTPTQ: A Two-step Power-of-Two Post-training for LLMs
by: Wang, Xinyu, et al.
Published: (2025)

AskToAct: Enhancing LLMs Tool Use via Self-Correcting Clarification
by: Zhang, Xuan, et al.
Published: (2025)

Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs
by: Tie, Guiyao, et al.
Published: (2025)

Regression-aware Inference with LLMs
by: Lukasik, Michal, et al.
Published: (2024)

Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs
by: Lai, Xin, et al.
Published: (2024)

Theory of Mind and Self-Attributions of Mentality are Dissociable in LLMs
by: Kim, Junsol, et al.
Published: (2026)

Mitigating Attention Localization in Small Scale: Self-Attention Refinement via One-step Belief Propagation
by: Lee, Nakyung, et al.
Published: (2025)

SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
by: Xu, Tianyang, et al.
Published: (2024)

What Defines Good Reasoning in LLMs? Dissecting Reasoning Steps with Multi-Aspect Evaluation
by: Do, Heejin, et al.
Published: (2025)

From Building Blocks to Planning: Multi-Step Spatial Reasoning in LLMs with Reinforcement Learning
by: Tahmasbi, Amir, et al.
Published: (2025)

Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter?
by: Tyagi, Nemika, et al.
Published: (2024)

Self-Improving Customer Review Response Generation Based on LLMs
by: Azov, Guy, et al.
Published: (2024)

Distilling Text Style Transfer With Self-Explanation From LLMs
by: Zhang, Chiyu, et al.
Published: (2024)

SELT: Self-Evaluation Tree Search for LLMs with Task Decomposition
by: Wu, Mengsong, et al.
Published: (2025)

Cascaded Self-Evaluation Augmented Training for Lightweight Multimodal LLMs
by: Lv, Zheqi, et al.
Published: (2025)

SmartThinker: Learning to Compress and Preserve Reasoning by Step-Level Length Control
by: He, Xingyang, et al.
Published: (2025)

Defend LLMs Through Self-Consciousness
by: Huang, Boshi, et al.
Published: (2025)

Transformer-Squared: Self-adaptive LLMs
by: Sun, Qi, et al.
Published: (2025)

Confidence-aware Self-Semantic Distillation on Knowledge Graph Embedding
by: Liu, Yichen, et al.
Published: (2022)

Controlled Self-Evolution for Algorithmic Code Optimization
by: Hu, Tu, et al.
Published: (2026)

SuperCLUE-Math6: Graded Multi-Step Math Reasoning Benchmark for LLMs in Chinese
by: Xu, Liang, et al.
Published: (2024)

Tracking the Limits of Knowledge Propagation: How LLMs Fail at Multi-Step Reasoning with Conflicting Knowledge
by: Feng, Yiyang, et al.
Published: (2026)