:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Cao, Qi, Zhang, Shuhao, Zhou, Ruizhe, Zhang, Ruiyi, Qin, Peijia, Xie, Pengtao
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2601.22323
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Send a SCOUT First: Pre-hoc Reasoning for Adaptive Detector Allocation in Prompt-Injection Defense
by: Zhang, Shuhao, et al.
Published: (2026)

DAJ: Data-Reweighted LLM Judge for Test-Time Scaling in Code Generation
by: Qin, Peijia, et al.
Published: (2026)

DreamPRM-Code: Function-as-Step Process Reward Model with Label Correction for LLM Coding
by: Zhang, Ruiyi, et al.
Published: (2025)

BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation
by: Qin, Peijia, et al.
Published: (2024)

FunPRM: Function-as-Step Process Reward Model with Meta Reward Correction for Code Generation
by: Zhang, Ruiyi, et al.
Published: (2026)

LLMs Know When They Know, but Do Not Act on It: A Metacognitive Harness for Test-time Scaling
by: Cao, Qi, et al.
Published: (2026)

ATLAS: Agentic Test-time Learning-to-Allocate Scaling
by: Qin, Peijia, et al.
Published: (2026)

DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning
by: Cao, Qi, et al.
Published: (2025)

BiLoRA: A Bi-level Optimization Framework for Overfitting-Resilient Low-Rank Adaptation of Large Pre-trained Models
by: Qiang, Rushi, et al.
Published: (2024)

AIBuildAI: An AI Agent for Automatically Building AI Models
by: Zhang, Ruiyi, et al.
Published: (2026)

SteganoBackdoor: Stealthy and Data-Efficient Backdoor Attacks on Language Models
by: Xue, Eric, et al.
Published: (2025)

TapWeight: Reweighting Pretraining Objectives for Task-Adaptive Pretraining
by: Zhang, Ruiyi, et al.
Published: (2024)

DreamPRM-1.5: Unlocking the Potential of Each Instance for Multimodal Process Reward Model Training
by: Cao, Qi, et al.
Published: (2025)

BioTool: A Comprehensive Tool-Calling Dataset for Enhancing Biomedical Capabilities of Large Language Models
by: Gao, Xin, et al.
Published: (2026)

AutoLoRA: Automatically Tuning Matrix Ranks in Low-Rank Adaptation Based on Meta Learning
by: Zhang, Ruiyi, et al.
Published: (2024)

Can Prompts Rewind Time for LLMs? Evaluating the Effectiveness of Prompted Knowledge Cutoffs
by: Gao, Xin, et al.
Published: (2025)

A Foundational Multi-Modal Model for Few-Shot Learning
by: Dang, Pengtao, et al.
Published: (2025)

SRTFD: Scalable Real-Time Fault Diagnosis through Online Continual Learning
by: Zhao, Dandan, et al.
Published: (2024)

Neural Operators for Predictor Feedback Control of Nonlinear Delay Systems
by: Bhan, Luke, et al.
Published: (2024)

PreRoutGNN for Timing Prediction with Order Preserving Partition: Global Circuit Pre-training, Local Delay Learning and Attentional Cell Modeling
by: Zhong, Ruizhe, et al.
Published: (2024)

Downstream Task Guided Masking Learning in Masked Autoencoders Using Multi-Level Optimization
by: Guo, Han, et al.
Published: (2024)

Scalable Complexity Control Facilitates Reasoning Ability of LLMs
by: Hang, Liangkai, et al.
Published: (2025)

AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model
by: Li, Yuchen, et al.
Published: (2024)

Beyond Classical Attention: Quantum Attention for Scalable Computation
by: Guo, Xuyang, et al.
Published: (2023)

Adaptive Overclocking: Dynamic Control of Thinking Path Length via Real-Time Reasoning Signals
by: Jiang, Shuhao, et al.
Published: (2025)

SCOPE-FE: Structured Control of Operator and Pairwise Exploration for Feature Engineering
by: Park, Minhee, et al.
Published: (2026)

SCOPE-RL: Stable and Quantitative Control of Policy Entropy in RL Post-Training
by: Wang, Chen, et al.
Published: (2025)

Model-free Estimation of Latent Structure via Multiscale Nonparametric Maximum Likelihood
by: Aragam, Bryon, et al.
Published: (2024)

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning
by: Shi, Ruizhe, et al.
Published: (2023)

Certificated Actor-Critic: Hierarchical Reinforcement Learning with Control Barrier Functions for Safe Navigation
by: Xie, Junjun, et al.
Published: (2025)

Efficient Generative Prediction for EHR Foundation Models: The SCOPE and REACH Estimators
by: Solo, Luke, et al.
Published: (2026)

Equivariant Spherical Transformer for Efficient Molecular Modeling
by: An, Junyi, et al.
Published: (2025)

Efficient Privacy-Preserving KAN Inference Using Homomorphic Encryption
by: Lai, Zhizheng, et al.
Published: (2024)

FGTR: Fine-Grained Multi-Table Retrieval via Hierarchical LLM Reasoning
by: Sun, Chaojie, et al.
Published: (2026)

SPRINT: Scalable Policy Pre-Training via Language Instruction Relabeling
by: Zhang, Jesse, et al.
Published: (2023)

Constraint-based Pre-training: From Structured Constraints to Scalable Model Initialization
by: Feng, Fu, et al.
Published: (2026)

Near-Oracle KV Selection via Pre-hoc Sparsity for Long-Context Inference
by: Gao, Yifei, et al.
Published: (2026)

Generalizable and Stable Finetuning of Pretrained Language Models on Low-Resource Texts
by: Somayajula, Sai Ashish, et al.
Published: (2024)

DLP-LoRA: Efficient Task-Specific LoRA Fusion with a Dynamic, Lightweight Plugin for Large Language Models
by: Zhang, Yuxuan, et al.
Published: (2024)

SCOPE: Structured Prototype-Guided Adaptation for EEG Foundation Models with Limited Labels
by: Ma, Jingying, et al.
Published: (2026)