Saved in:
| Main Authors: | Cao, Qi, Zhang, Shuhao, Zhou, Ruizhe, Zhang, Ruiyi, Qin, Peijia, Xie, Pengtao |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.22323 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Send a SCOUT First: Pre-hoc Reasoning for Adaptive Detector Allocation in Prompt-Injection Defense
by: Zhang, Shuhao, et al.
Published: (2026)
by: Zhang, Shuhao, et al.
Published: (2026)
DAJ: Data-Reweighted LLM Judge for Test-Time Scaling in Code Generation
by: Qin, Peijia, et al.
Published: (2026)
by: Qin, Peijia, et al.
Published: (2026)
DreamPRM-Code: Function-as-Step Process Reward Model with Label Correction for LLM Coding
by: Zhang, Ruiyi, et al.
Published: (2025)
by: Zhang, Ruiyi, et al.
Published: (2025)
BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation
by: Qin, Peijia, et al.
Published: (2024)
by: Qin, Peijia, et al.
Published: (2024)
FunPRM: Function-as-Step Process Reward Model with Meta Reward Correction for Code Generation
by: Zhang, Ruiyi, et al.
Published: (2026)
by: Zhang, Ruiyi, et al.
Published: (2026)
LLMs Know When They Know, but Do Not Act on It: A Metacognitive Harness for Test-time Scaling
by: Cao, Qi, et al.
Published: (2026)
by: Cao, Qi, et al.
Published: (2026)
ATLAS: Agentic Test-time Learning-to-Allocate Scaling
by: Qin, Peijia, et al.
Published: (2026)
by: Qin, Peijia, et al.
Published: (2026)
DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning
by: Cao, Qi, et al.
Published: (2025)
by: Cao, Qi, et al.
Published: (2025)
BiLoRA: A Bi-level Optimization Framework for Overfitting-Resilient Low-Rank Adaptation of Large Pre-trained Models
by: Qiang, Rushi, et al.
Published: (2024)
by: Qiang, Rushi, et al.
Published: (2024)
AIBuildAI: An AI Agent for Automatically Building AI Models
by: Zhang, Ruiyi, et al.
Published: (2026)
by: Zhang, Ruiyi, et al.
Published: (2026)
SteganoBackdoor: Stealthy and Data-Efficient Backdoor Attacks on Language Models
by: Xue, Eric, et al.
Published: (2025)
by: Xue, Eric, et al.
Published: (2025)
TapWeight: Reweighting Pretraining Objectives for Task-Adaptive Pretraining
by: Zhang, Ruiyi, et al.
Published: (2024)
by: Zhang, Ruiyi, et al.
Published: (2024)
DreamPRM-1.5: Unlocking the Potential of Each Instance for Multimodal Process Reward Model Training
by: Cao, Qi, et al.
Published: (2025)
by: Cao, Qi, et al.
Published: (2025)
BioTool: A Comprehensive Tool-Calling Dataset for Enhancing Biomedical Capabilities of Large Language Models
by: Gao, Xin, et al.
Published: (2026)
by: Gao, Xin, et al.
Published: (2026)
AutoLoRA: Automatically Tuning Matrix Ranks in Low-Rank Adaptation Based on Meta Learning
by: Zhang, Ruiyi, et al.
Published: (2024)
by: Zhang, Ruiyi, et al.
Published: (2024)
Can Prompts Rewind Time for LLMs? Evaluating the Effectiveness of Prompted Knowledge Cutoffs
by: Gao, Xin, et al.
Published: (2025)
by: Gao, Xin, et al.
Published: (2025)
A Foundational Multi-Modal Model for Few-Shot Learning
by: Dang, Pengtao, et al.
Published: (2025)
by: Dang, Pengtao, et al.
Published: (2025)
SRTFD: Scalable Real-Time Fault Diagnosis through Online Continual Learning
by: Zhao, Dandan, et al.
Published: (2024)
by: Zhao, Dandan, et al.
Published: (2024)
Neural Operators for Predictor Feedback Control of Nonlinear Delay Systems
by: Bhan, Luke, et al.
Published: (2024)
by: Bhan, Luke, et al.
Published: (2024)
PreRoutGNN for Timing Prediction with Order Preserving Partition: Global Circuit Pre-training, Local Delay Learning and Attentional Cell Modeling
by: Zhong, Ruizhe, et al.
Published: (2024)
by: Zhong, Ruizhe, et al.
Published: (2024)
Downstream Task Guided Masking Learning in Masked Autoencoders Using Multi-Level Optimization
by: Guo, Han, et al.
Published: (2024)
by: Guo, Han, et al.
Published: (2024)
Scalable Complexity Control Facilitates Reasoning Ability of LLMs
by: Hang, Liangkai, et al.
Published: (2025)
by: Hang, Liangkai, et al.
Published: (2025)
AM-SAM: Automated Prompting and Mask Calibration for Segment Anything Model
by: Li, Yuchen, et al.
Published: (2024)
by: Li, Yuchen, et al.
Published: (2024)
Beyond Classical Attention: Quantum Attention for Scalable Computation
by: Guo, Xuyang, et al.
Published: (2023)
by: Guo, Xuyang, et al.
Published: (2023)
Adaptive Overclocking: Dynamic Control of Thinking Path Length via Real-Time Reasoning Signals
by: Jiang, Shuhao, et al.
Published: (2025)
by: Jiang, Shuhao, et al.
Published: (2025)
SCOPE-FE: Structured Control of Operator and Pairwise Exploration for Feature Engineering
by: Park, Minhee, et al.
Published: (2026)
by: Park, Minhee, et al.
Published: (2026)
SCOPE-RL: Stable and Quantitative Control of Policy Entropy in RL Post-Training
by: Wang, Chen, et al.
Published: (2025)
by: Wang, Chen, et al.
Published: (2025)
Model-free Estimation of Latent Structure via Multiscale Nonparametric Maximum Likelihood
by: Aragam, Bryon, et al.
Published: (2024)
by: Aragam, Bryon, et al.
Published: (2024)
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning
by: Shi, Ruizhe, et al.
Published: (2023)
by: Shi, Ruizhe, et al.
Published: (2023)
Certificated Actor-Critic: Hierarchical Reinforcement Learning with Control Barrier Functions for Safe Navigation
by: Xie, Junjun, et al.
Published: (2025)
by: Xie, Junjun, et al.
Published: (2025)
Efficient Generative Prediction for EHR Foundation Models: The SCOPE and REACH Estimators
by: Solo, Luke, et al.
Published: (2026)
by: Solo, Luke, et al.
Published: (2026)
Equivariant Spherical Transformer for Efficient Molecular Modeling
by: An, Junyi, et al.
Published: (2025)
by: An, Junyi, et al.
Published: (2025)
Efficient Privacy-Preserving KAN Inference Using Homomorphic Encryption
by: Lai, Zhizheng, et al.
Published: (2024)
by: Lai, Zhizheng, et al.
Published: (2024)
FGTR: Fine-Grained Multi-Table Retrieval via Hierarchical LLM Reasoning
by: Sun, Chaojie, et al.
Published: (2026)
by: Sun, Chaojie, et al.
Published: (2026)
SPRINT: Scalable Policy Pre-Training via Language Instruction Relabeling
by: Zhang, Jesse, et al.
Published: (2023)
by: Zhang, Jesse, et al.
Published: (2023)
Constraint-based Pre-training: From Structured Constraints to Scalable Model Initialization
by: Feng, Fu, et al.
Published: (2026)
by: Feng, Fu, et al.
Published: (2026)
Near-Oracle KV Selection via Pre-hoc Sparsity for Long-Context Inference
by: Gao, Yifei, et al.
Published: (2026)
by: Gao, Yifei, et al.
Published: (2026)
Generalizable and Stable Finetuning of Pretrained Language Models on Low-Resource Texts
by: Somayajula, Sai Ashish, et al.
Published: (2024)
by: Somayajula, Sai Ashish, et al.
Published: (2024)
DLP-LoRA: Efficient Task-Specific LoRA Fusion with a Dynamic, Lightweight Plugin for Large Language Models
by: Zhang, Yuxuan, et al.
Published: (2024)
by: Zhang, Yuxuan, et al.
Published: (2024)
SCOPE: Structured Prototype-Guided Adaptation for EEG Foundation Models with Limited Labels
by: Ma, Jingying, et al.
Published: (2026)
by: Ma, Jingying, et al.
Published: (2026)
Similar Items
-
Send a SCOUT First: Pre-hoc Reasoning for Adaptive Detector Allocation in Prompt-Injection Defense
by: Zhang, Shuhao, et al.
Published: (2026) -
DAJ: Data-Reweighted LLM Judge for Test-Time Scaling in Code Generation
by: Qin, Peijia, et al.
Published: (2026) -
DreamPRM-Code: Function-as-Step Process Reward Model with Label Correction for LLM Coding
by: Zhang, Ruiyi, et al.
Published: (2025) -
BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation
by: Qin, Peijia, et al.
Published: (2024) -
FunPRM: Function-as-Step Process Reward Model with Meta Reward Correction for Code Generation
by: Zhang, Ruiyi, et al.
Published: (2026)