Saved in:
| Main Authors: | Zeng, Qiuhai, Rajkumar, Sarvesh, Wang, Di, Gyanchandani, Narendra, Yan, Wenbo |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.18607 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ToolCritic: Detecting and Correcting Tool-Use Errors in Dialogue Systems
by: Hamad, Hassan, et al.
Published: (2025)
by: Hamad, Hassan, et al.
Published: (2025)
AIRepr: An Analyst-Inspector Framework for Evaluating Reproducibility of LLMs in Data Science
by: Zeng, Qiuhai, et al.
Published: (2025)
by: Zeng, Qiuhai, et al.
Published: (2025)
Hybrid CNN with Chebyshev Polynomial Expansion for Medical Image Analysis
by: Roy, Abhinav, et al.
Published: (2025)
by: Roy, Abhinav, et al.
Published: (2025)
ProKAN: Progressive Stacking of Kolmogorov-Arnold Networks for Efficient Liver Segmentation
by: Gyanchandani, Bhavesh, et al.
Published: (2024)
by: Gyanchandani, Bhavesh, et al.
Published: (2024)
Advancing Parkinson's Disease Progression Prediction: Comparing Long Short-Term Memory Networks and Kolmogorov-Arnold Networks
by: Roy, Abhinav, et al.
Published: (2024)
by: Roy, Abhinav, et al.
Published: (2024)
Watermarking Language Models with Error Correcting Codes
by: Chao, Patrick, et al.
Published: (2024)
by: Chao, Patrick, et al.
Published: (2024)
Act or Escalate? Evaluating Escalation Behavior in Automation with Language Models
by: DiSorbo, Matthew DosSantos, et al.
Published: (2026)
by: DiSorbo, Matthew DosSantos, et al.
Published: (2026)
Select before Act: Spatially Decoupled Action Repetition for Continuous Control
by: Nie, Buqing, et al.
Published: (2025)
by: Nie, Buqing, et al.
Published: (2025)
Numerical Error Analysis of Large Language Models
by: Budzinskiy, Stanislav, et al.
Published: (2025)
by: Budzinskiy, Stanislav, et al.
Published: (2025)
Multi-Layer GRPO: Enhancing Reasoning and Self-Correction in Large Language Models
by: Ding, Fei, et al.
Published: (2025)
by: Ding, Fei, et al.
Published: (2025)
Subliminal Corruption: Mechanisms, Thresholds, and Interpretability
by: Vir, Reya, et al.
Published: (2025)
by: Vir, Reya, et al.
Published: (2025)
ProgCo: Program Helps Self-Correction of Large Language Models
by: Song, Xiaoshuai, et al.
Published: (2025)
by: Song, Xiaoshuai, et al.
Published: (2025)
Synthetic Error Injection Fails to Elicit Self-Correction In Language Models
by: Wu, David X., et al.
Published: (2025)
by: Wu, David X., et al.
Published: (2025)
Test-Time Iterative Error Correction for Efficient Diffusion Models
by: Zhong, Yunshan, et al.
Published: (2025)
by: Zhong, Yunshan, et al.
Published: (2025)
A Functionality-Grounded Benchmark for Evaluating Web Agents in E-commerce Domains
by: Zhang, Xianren, et al.
Published: (2025)
by: Zhang, Xianren, et al.
Published: (2025)
Proximal Reliability Optimization for Reinforcement Learning
by: Patwardhan, Narendra, et al.
Published: (2019)
by: Patwardhan, Narendra, et al.
Published: (2019)
Evaluating Mathematical Reasoning of Large Language Models: A Focus on Error Identification and Correction
by: Li, Xiaoyuan, et al.
Published: (2024)
by: Li, Xiaoyuan, et al.
Published: (2024)
Train-before-Test Harmonizes Language Model Rankings
by: Zhang, Guanhua, et al.
Published: (2025)
by: Zhang, Guanhua, et al.
Published: (2025)
Grammatical Error Correction for Low-Resource Languages: The Case of Zarma
by: Keita, Mamadou K., et al.
Published: (2024)
by: Keita, Mamadou K., et al.
Published: (2024)
Training with Confidence: Catching Silent Errors in Deep Learning Training with Automated Proactive Checks
by: Jiang, Yuxuan, et al.
Published: (2025)
by: Jiang, Yuxuan, et al.
Published: (2025)
Unified Error Correction Code Transformer with Low Complexity
by: Yan, Yongli, et al.
Published: (2024)
by: Yan, Yongli, et al.
Published: (2024)
ReAct Meets ActRe: When Language Agents Enjoy Training Data Autonomy
by: Yang, Zonghan, et al.
Published: (2024)
by: Yang, Zonghan, et al.
Published: (2024)
Rethink the Role of Neural Decoders in Quantum Error Correction
by: Yan, Ge, et al.
Published: (2026)
by: Yan, Ge, et al.
Published: (2026)
Synthesizing and Adapting Error Correction Data for Mobile Large Language Model Applications
by: Zhang, Yanxiang, et al.
Published: (2025)
by: Zhang, Yanxiang, et al.
Published: (2025)
Boosting of Thoughts: Trial-and-Error Problem Solving with Large Language Models
by: Chen, Sijia, et al.
Published: (2024)
by: Chen, Sijia, et al.
Published: (2024)
ProEval: Proactive Failure Discovery and Efficient Performance Estimation for Generative AI Evaluation
by: Huang, Yizheng, et al.
Published: (2026)
by: Huang, Yizheng, et al.
Published: (2026)
A Modular Zero-Shot Pipeline for Accident Detection, Localization, and Classification in Traffic Surveillance Video
by: Thakur, Amey, et al.
Published: (2026)
by: Thakur, Amey, et al.
Published: (2026)
Benchmarking Machine Learning Models for Quantum Error Correction
by: Zhao, Yue
Published: (2023)
by: Zhao, Yue
Published: (2023)
M3PO: Massively Multi-Task Model-Based Policy Optimization
by: Narendra, Aditya, et al.
Published: (2025)
by: Narendra, Aditya, et al.
Published: (2025)
ProAgent: Building Proactive Cooperative Agents with Large Language Models
by: Zhang, Ceyao, et al.
Published: (2023)
by: Zhang, Ceyao, et al.
Published: (2023)
Causal Reflection with Language Models
by: Aryan, Abi, et al.
Published: (2025)
by: Aryan, Abi, et al.
Published: (2025)
Emergent Symbolic Structure in Health Foundation Models: Extraction, Alignment, and Cross-Modal Transfer
by: Katuwal, Gajendra, et al.
Published: (2026)
by: Katuwal, Gajendra, et al.
Published: (2026)
ActTail: Global Activation Sparsity in Large Language Models
by: Hou, Wenwen, et al.
Published: (2026)
by: Hou, Wenwen, et al.
Published: (2026)
TranSQL+: Serving Large Language Models with SQL on Low-Resource Hardware
by: Sun, Wenbo, et al.
Published: (2025)
by: Sun, Wenbo, et al.
Published: (2025)
What Makes Reasoning Invalid: Echo Reflection Mitigation for Large Language Models
by: He, Chen, et al.
Published: (2025)
by: He, Chen, et al.
Published: (2025)
Proactive Constrained Policy Optimization with Preemptive Penalty
by: Yang, Ning, et al.
Published: (2025)
by: Yang, Ning, et al.
Published: (2025)
Federated Learning Architectures: A Performance Evaluation with Crop Yield Prediction Application
by: Mukherjee, Anwesha, et al.
Published: (2024)
by: Mukherjee, Anwesha, et al.
Published: (2024)
Security in the Fine-Tuning Lifecycle of Large Language Models: Threats, Defenses,Evaluation, and Future Directions
by: Li, Wenjuan, et al.
Published: (2026)
by: Li, Wenjuan, et al.
Published: (2026)
Neural Grammatical Error Correction for Romanian
by: Cotet, Teodor-Mihai, et al.
Published: (2026)
by: Cotet, Teodor-Mihai, et al.
Published: (2026)
Learning When to Act: Interval-Aware Reinforcement Learning with Predictive Temporal Structure
by: Di Gioia, Davide
Published: (2026)
by: Di Gioia, Davide
Published: (2026)
Similar Items
-
ToolCritic: Detecting and Correcting Tool-Use Errors in Dialogue Systems
by: Hamad, Hassan, et al.
Published: (2025) -
AIRepr: An Analyst-Inspector Framework for Evaluating Reproducibility of LLMs in Data Science
by: Zeng, Qiuhai, et al.
Published: (2025) -
Hybrid CNN with Chebyshev Polynomial Expansion for Medical Image Analysis
by: Roy, Abhinav, et al.
Published: (2025) -
ProKAN: Progressive Stacking of Kolmogorov-Arnold Networks for Efficient Liver Segmentation
by: Gyanchandani, Bhavesh, et al.
Published: (2024) -
Advancing Parkinson's Disease Progression Prediction: Comparing Long Short-Term Memory Networks and Kolmogorov-Arnold Networks
by: Roy, Abhinav, et al.
Published: (2024)