:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huang, Yuchen, Li, Sijia, Liu, Minghao, Liu, Wei, Huang, Shijue, Fan, Zhiyuan, Chan, Hou Pong, Fung, Yi R.
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2511.09586
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SELF-REDRAFT: Eliciting Intrinsic Exploration-Exploitation Balance in Test-Time Scaling for Code Generation
by: Chen, Yixiang, et al.
Published: (2025)

CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents
by: Liu, Jiayu, et al.
Published: (2025)

Experience-Evolving Multi-Turn Tool-Use Agent with Hybrid Episodic-Procedural Memory
by: Li, Sijia, et al.
Published: (2025)

MedEBench: Diagnosing Reliability in Text-Guided Medical Image Editing
by: Liu, Minghao, et al.
Published: (2025)

Lean4Physics: Comprehensive Reasoning Framework for College-level Physics in Lean4
by: Li, Yuxin, et al.
Published: (2025)

From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Models
by: Huang, Kung-Hsiang, et al.
Published: (2024)

AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting
by: Huang, Shijue, et al.
Published: (2025)

SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation
by: Fan, Chongyu, et al.
Published: (2023)

GSTM-HMU: Generative Spatio-Temporal Modeling for Human Mobility Understanding
by: Luo, Wenying, et al.
Published: (2025)

Robust Layerwise Scaling Rules by Proper Weight Decay Tuning
by: Fan, Zhiyuan, et al.
Published: (2025)

Sample Efficient Experience Replay in Non-stationary Environments
by: Duan, Tianyang, et al.
Published: (2025)

What Limits Agentic Systems Efficiency?
by: Bian, Song, et al.
Published: (2025)

SAGE: A Novelty Gate for Efficient Memory Evolution in Agentic LLMs
by: Wang, Sijia, et al.
Published: (2026)

Enhancing Molecular Property Predictions by Learning from Bond Modelling and Interactions
by: Liu, Yunqing, et al.
Published: (2026)

Agentic Critical Training
by: Liu, Weize, et al.
Published: (2026)

Verbal Process Supervision Elicits Better Coding Agents
by: Chen, Hao-Yuan, et al.
Published: (2025)

Positive Experience Reflection for Agents in Interactive Text Environments
by: Lippmann, Philip, et al.
Published: (2024)

AdaBFL: Multi-Layer Defensive Adaptive Aggregation for Bzantine-Robust Federated Learning
by: Tang, Zehui, et al.
Published: (2026)

Local-Global Multimodal Contrastive Learning for Molecular Property Prediction
by: Liu, Xiayu, et al.
Published: (2026)

Challenging Forgets: Unveiling the Worst-Case Forget Sets in Machine Unlearning
by: Fan, Chongyu, et al.
Published: (2024)

Entropy Centroids as Intrinsic Rewards for Test-Time Scaling
by: Zhao, Wenshuo, et al.
Published: (2026)

Backdoor Secrets Unveiled: Identifying Backdoor Data with Optimized Scaled Prediction Consistency
by: Pal, Soumyadeep, et al.
Published: (2024)

Harmonizing Multi-Objective LLM Unlearning via Unified Domain Representation and Bidirectional Logit Distillation
by: Zhong, Yisheng, et al.
Published: (2026)

ProAct: Agentic Lookahead in Interactive Environments
by: Yu, Yangbin, et al.
Published: (2026)

Graph-based Confidence Calibration for Large Language Models
by: Li, Yukun, et al.
Published: (2024)

GEAR: Granularity-Adaptive Advantage Reweighting for LLM Agents via Self-Distillation
by: Li, Sijia, et al.
Published: (2026)

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
by: Dai, Weinan, et al.
Published: (2026)

CuES: A Curiosity-driven and Environment-grounded Synthesis Framework for Agentic RL
by: Mai, Shinji, et al.
Published: (2025)

Deep Frequency Derivative Learning for Non-stationary Time Series Forecasting
by: Fan, Wei, et al.
Published: (2024)

RollArt: Scaling Agentic RL Training via Disaggregated Infrastructure
by: Gao, Wei, et al.
Published: (2025)

Reasoning Model Unlearning: Forgetting Traces, Not Just Answers, While Preserving Reasoning Skills
by: Wang, Changsheng, et al.
Published: (2025)

Efficient Test-Time Scaling via Self-Calibration
by: Huang, Chengsong, et al.
Published: (2025)

ScaleDoc: Scaling LLM-based Predicates over Large Document Collections
by: Zhang, Hengrui, et al.
Published: (2025)

FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving
by: Lin, Xiaohan, et al.
Published: (2024)

Persona-DB: Efficient Large Language Model Personalization for Response Prediction with Collaborative Data Refinement
by: Sun, Chenkai, et al.
Published: (2024)

Automatic Dataset Construction (ADC): Sample Collection, Data Curation, and Beyond
by: Liu, Minghao, et al.
Published: (2024)

MedAgentGym: A Scalable Agentic Training Environment for Code-Centric Reasoning in Biomedical Data Science
by: Xu, Ran, et al.
Published: (2025)

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
by: Wang, Zhaoyang, et al.
Published: (2026)

Bi-LoRA: Efficient Sharpness-Aware Minimization for Fine-Tuning Large-Scale Models
by: Liu, Yuhang, et al.
Published: (2025)

Unveiling the Lack of LVLM Robustness to Fundamental Visual Variations: Why and Path Forward
by: Fan, Zhiyuan, et al.
Published: (2025)