:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Gu, Yuxuan, Bai, Weimin, Wang, Yifei, Luo, Weijian, Sun, He
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2511.15190
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

FastMTP: Accelerating LLM Inference with Enhanced Multi-Token Prediction
by: Cai, Yuxuan, et al.
Published: (2025)

Representation Learning of Lab Values via Masked AutoEncoders
by: Restrepo, David, et al.
Published: (2025)

SparVAR: Exploring Sparsity in Visual AutoRegressive Modeling for Training-Free Acceleration
by: Li, Zekun, et al.
Published: (2026)

VARADE: a Variational-based AutoRegressive model for Anomaly Detection on the Edge
by: Mascolini, Alessio, et al.
Published: (2024)

Trust Region Masking for Long-Horizon LLM Reinforcement Learning
by: Li, Yingru, et al.
Published: (2025)

HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation
by: Kumbong, Hermann, et al.
Published: (2025)

Permutation Equivariant Model-based Offline Reinforcement Learning for Auto-bidding
by: Mou, Zhiyu, et al.
Published: (2025)

Fast Inference of Removal-Based Node Influence
by: Li, Weikai, et al.
Published: (2024)

Unbiased Diffusion Variational Inversion via Principled Posterior Matching
by: Bai, Weimin, et al.
Published: (2026)

Cactus: Accelerating Auto-Regressive Decoding with Constrained Acceptance Speculative Sampling
by: Hao, Yongchang, et al.
Published: (2026)

A Practical Introduction to Deep Reinforcement Learning
by: Sun, Yinghan, et al.
Published: (2025)

Slow-Fast Inference: Training-Free Inference Acceleration via Within-Sentence Support Stability
by: Xie, Xingyu, et al.
Published: (2026)

Combining Bayesian Inference and Reinforcement Learning for Agent Decision Making: A Review
by: Zhou, Chengmin, et al.
Published: (2025)

Heuristic Algorithm-based Action Masking Reinforcement Learning (HAAM-RL) with Ensemble Inference Method
by: Choi, Kyuwon, et al.
Published: (2024)

Discrepancy-Aware Graph Mask Auto-Encoder
by: Zheng, Ziyu, et al.
Published: (2025)

FastFT: Accelerating Reinforced Feature Transformation via Advanced Exploration Strategies
by: He, Tianqi, et al.
Published: (2025)

Neuro-symbolic Action Masking for Deep Reinforcement Learning
by: Han, Shuai, et al.
Published: (2026)

Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning
by: Pan, Haolin, et al.
Published: (2025)

Non-Stationary Latent Auto-Regressive Bandits
by: Trella, Anna L., et al.
Published: (2024)

Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences
by: Luo, Weijian
Published: (2024)

Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach
by: Chen, Yuxuan, et al.
Published: (2024)

TTVS: Boosting Self-Exploring Reinforcement Learning via Test-time Variational Synthesis
by: Bai, Sikai, et al.
Published: (2026)

Fast and Robust Likelihood-Guided Diffusion Posterior Sampling with Amortized Variational Inference
by: Zheng, Léon, et al.
Published: (2026)

POLO: Preference-Guided Multi-Turn Reinforcement Learning for Lead Optimization
by: Wang, Ziqing, et al.
Published: (2025)

Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning
by: Yuan, Mingqi, et al.
Published: (2025)

CIMAGE: Exploiting the Conditional Independence in Masked Graph Auto-encoders
by: Park, Jongwon, et al.
Published: (2025)

Uni-Instruct: One-step Diffusion Model through Unified Diffusion Divergence Instruction
by: Wang, Yifei, et al.
Published: (2025)

Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective
by: Zhao, Lei, et al.
Published: (2023)

CFASL: Composite Factor-Aligned Symmetry Learning for Disentanglement in Variational AutoEncoder
by: Jung, Hee-Jun, et al.
Published: (2024)

LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs
by: Li, Ang, et al.
Published: (2025)

G1: Teaching LLMs to Reason on Graphs with Reinforcement Learning
by: Guo, Xiaojun, et al.
Published: (2025)

Market Making Strategies with Reinforcement Learning
by: Vicente, Óscar Fernández
Published: (2025)

DARTS: Distribution-Aware Active Rollout Trajectory Shaping for Accelerating LLM Reinforcement Learning
by: Wang, Yujie, et al.
Published: (2026)

Behavior Preference Regression for Offline Reinforcement Learning
by: Srinivasan, Padmanaba, et al.
Published: (2025)

What Makes a Good Diffusion Planner for Decision Making?
by: Lu, Haofei, et al.
Published: (2025)

Accelerating RL for LLM Reasoning with Optimal Advantage Regression
by: Brantley, Kianté, et al.
Published: (2025)

Learning Probabilities of Causation with Mask-Augmented Data
by: Wang, Shuai, et al.
Published: (2025)

AutoHLS: Learning to Accelerate Design Space Exploration for HLS Designs
by: Ahmed, Md Rubel, et al.
Published: (2024)

FlashSVD v1.5: Making Low-Rank Transformers Inference Actually Fast
by: Wu, Wenhao, et al.
Published: (2026)

Pessimistic Causal Reinforcement Learning with Mediators for Confounded Offline Data
by: Wang, Danyang, et al.
Published: (2024)