Saved in:
| Main Authors: | Gu, Yuxuan, Bai, Weimin, Wang, Yifei, Luo, Weijian, Sun, He |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.15190 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FastMTP: Accelerating LLM Inference with Enhanced Multi-Token Prediction
by: Cai, Yuxuan, et al.
Published: (2025)
by: Cai, Yuxuan, et al.
Published: (2025)
Representation Learning of Lab Values via Masked AutoEncoders
by: Restrepo, David, et al.
Published: (2025)
by: Restrepo, David, et al.
Published: (2025)
SparVAR: Exploring Sparsity in Visual AutoRegressive Modeling for Training-Free Acceleration
by: Li, Zekun, et al.
Published: (2026)
by: Li, Zekun, et al.
Published: (2026)
VARADE: a Variational-based AutoRegressive model for Anomaly Detection on the Edge
by: Mascolini, Alessio, et al.
Published: (2024)
by: Mascolini, Alessio, et al.
Published: (2024)
Trust Region Masking for Long-Horizon LLM Reinforcement Learning
by: Li, Yingru, et al.
Published: (2025)
by: Li, Yingru, et al.
Published: (2025)
HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation
by: Kumbong, Hermann, et al.
Published: (2025)
by: Kumbong, Hermann, et al.
Published: (2025)
Permutation Equivariant Model-based Offline Reinforcement Learning for Auto-bidding
by: Mou, Zhiyu, et al.
Published: (2025)
by: Mou, Zhiyu, et al.
Published: (2025)
Fast Inference of Removal-Based Node Influence
by: Li, Weikai, et al.
Published: (2024)
by: Li, Weikai, et al.
Published: (2024)
Unbiased Diffusion Variational Inversion via Principled Posterior Matching
by: Bai, Weimin, et al.
Published: (2026)
by: Bai, Weimin, et al.
Published: (2026)
Cactus: Accelerating Auto-Regressive Decoding with Constrained Acceptance Speculative Sampling
by: Hao, Yongchang, et al.
Published: (2026)
by: Hao, Yongchang, et al.
Published: (2026)
A Practical Introduction to Deep Reinforcement Learning
by: Sun, Yinghan, et al.
Published: (2025)
by: Sun, Yinghan, et al.
Published: (2025)
Slow-Fast Inference: Training-Free Inference Acceleration via Within-Sentence Support Stability
by: Xie, Xingyu, et al.
Published: (2026)
by: Xie, Xingyu, et al.
Published: (2026)
Combining Bayesian Inference and Reinforcement Learning for Agent Decision Making: A Review
by: Zhou, Chengmin, et al.
Published: (2025)
by: Zhou, Chengmin, et al.
Published: (2025)
Heuristic Algorithm-based Action Masking Reinforcement Learning (HAAM-RL) with Ensemble Inference Method
by: Choi, Kyuwon, et al.
Published: (2024)
by: Choi, Kyuwon, et al.
Published: (2024)
Discrepancy-Aware Graph Mask Auto-Encoder
by: Zheng, Ziyu, et al.
Published: (2025)
by: Zheng, Ziyu, et al.
Published: (2025)
FastFT: Accelerating Reinforced Feature Transformation via Advanced Exploration Strategies
by: He, Tianqi, et al.
Published: (2025)
by: He, Tianqi, et al.
Published: (2025)
Neuro-symbolic Action Masking for Deep Reinforcement Learning
by: Han, Shuai, et al.
Published: (2026)
by: Han, Shuai, et al.
Published: (2026)
Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning
by: Pan, Haolin, et al.
Published: (2025)
by: Pan, Haolin, et al.
Published: (2025)
Non-Stationary Latent Auto-Regressive Bandits
by: Trella, Anna L., et al.
Published: (2024)
by: Trella, Anna L., et al.
Published: (2024)
Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences
by: Luo, Weijian
Published: (2024)
by: Luo, Weijian
Published: (2024)
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach
by: Chen, Yuxuan, et al.
Published: (2024)
by: Chen, Yuxuan, et al.
Published: (2024)
TTVS: Boosting Self-Exploring Reinforcement Learning via Test-time Variational Synthesis
by: Bai, Sikai, et al.
Published: (2026)
by: Bai, Sikai, et al.
Published: (2026)
Fast and Robust Likelihood-Guided Diffusion Posterior Sampling with Amortized Variational Inference
by: Zheng, Léon, et al.
Published: (2026)
by: Zheng, Léon, et al.
Published: (2026)
POLO: Preference-Guided Multi-Turn Reinforcement Learning for Lead Optimization
by: Wang, Ziqing, et al.
Published: (2025)
by: Wang, Ziqing, et al.
Published: (2025)
Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning
by: Yuan, Mingqi, et al.
Published: (2025)
by: Yuan, Mingqi, et al.
Published: (2025)
CIMAGE: Exploiting the Conditional Independence in Masked Graph Auto-encoders
by: Park, Jongwon, et al.
Published: (2025)
by: Park, Jongwon, et al.
Published: (2025)
Uni-Instruct: One-step Diffusion Model through Unified Diffusion Divergence Instruction
by: Wang, Yifei, et al.
Published: (2025)
by: Wang, Yifei, et al.
Published: (2025)
Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective
by: Zhao, Lei, et al.
Published: (2023)
by: Zhao, Lei, et al.
Published: (2023)
CFASL: Composite Factor-Aligned Symmetry Learning for Disentanglement in Variational AutoEncoder
by: Jung, Hee-Jun, et al.
Published: (2024)
by: Jung, Hee-Jun, et al.
Published: (2024)
LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs
by: Li, Ang, et al.
Published: (2025)
by: Li, Ang, et al.
Published: (2025)
G1: Teaching LLMs to Reason on Graphs with Reinforcement Learning
by: Guo, Xiaojun, et al.
Published: (2025)
by: Guo, Xiaojun, et al.
Published: (2025)
Market Making Strategies with Reinforcement Learning
by: Vicente, Óscar Fernández
Published: (2025)
by: Vicente, Óscar Fernández
Published: (2025)
DARTS: Distribution-Aware Active Rollout Trajectory Shaping for Accelerating LLM Reinforcement Learning
by: Wang, Yujie, et al.
Published: (2026)
by: Wang, Yujie, et al.
Published: (2026)
Behavior Preference Regression for Offline Reinforcement Learning
by: Srinivasan, Padmanaba, et al.
Published: (2025)
by: Srinivasan, Padmanaba, et al.
Published: (2025)
What Makes a Good Diffusion Planner for Decision Making?
by: Lu, Haofei, et al.
Published: (2025)
by: Lu, Haofei, et al.
Published: (2025)
Accelerating RL for LLM Reasoning with Optimal Advantage Regression
by: Brantley, Kianté, et al.
Published: (2025)
by: Brantley, Kianté, et al.
Published: (2025)
Learning Probabilities of Causation with Mask-Augmented Data
by: Wang, Shuai, et al.
Published: (2025)
by: Wang, Shuai, et al.
Published: (2025)
AutoHLS: Learning to Accelerate Design Space Exploration for HLS Designs
by: Ahmed, Md Rubel, et al.
Published: (2024)
by: Ahmed, Md Rubel, et al.
Published: (2024)
FlashSVD v1.5: Making Low-Rank Transformers Inference Actually Fast
by: Wu, Wenhao, et al.
Published: (2026)
by: Wu, Wenhao, et al.
Published: (2026)
Pessimistic Causal Reinforcement Learning with Mediators for Confounded Offline Data
by: Wang, Danyang, et al.
Published: (2024)
by: Wang, Danyang, et al.
Published: (2024)
Similar Items
-
FastMTP: Accelerating LLM Inference with Enhanced Multi-Token Prediction
by: Cai, Yuxuan, et al.
Published: (2025) -
Representation Learning of Lab Values via Masked AutoEncoders
by: Restrepo, David, et al.
Published: (2025) -
SparVAR: Exploring Sparsity in Visual AutoRegressive Modeling for Training-Free Acceleration
by: Li, Zekun, et al.
Published: (2026) -
VARADE: a Variational-based AutoRegressive model for Anomaly Detection on the Edge
by: Mascolini, Alessio, et al.
Published: (2024) -
Trust Region Masking for Long-Horizon LLM Reinforcement Learning
by: Li, Yingru, et al.
Published: (2025)