Saved in:
| Main Authors: | Kang, Zilin, Liao, Chonghua, Xu, Tingqiang, Xu, Huazhe |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.08549 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Forget-and-Grow Strategy for Deep Reinforcement Learning Scaling in Continuous Control
by: Kang, Zilin, et al.
Published: (2025)
by: Kang, Zilin, et al.
Published: (2025)
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
by: Ji, Tianying, et al.
Published: (2024)
by: Ji, Tianying, et al.
Published: (2024)
Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control
by: Uehara, Masatoshi, et al.
Published: (2024)
by: Uehara, Masatoshi, et al.
Published: (2024)
Large Language Model Compression via the Nested Activation-Aware Decomposition
by: Lu, Jun, et al.
Published: (2025)
by: Lu, Jun, et al.
Published: (2025)
Fast Policy Learning for Linear Quadratic Control with Entropy Regularization
by: Guo, Xin, et al.
Published: (2023)
by: Guo, Xin, et al.
Published: (2023)
TemplateRL: Structured Template-Guided Reinforcement Learning for LLM Reasoning
by: Wu, Jinyang, et al.
Published: (2025)
by: Wu, Jinyang, et al.
Published: (2025)
Rethinking Entropy Regularization in Large Reasoning Models
by: Jiang, Yuxian, et al.
Published: (2025)
by: Jiang, Yuxian, et al.
Published: (2025)
Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement
by: Wen, Muning, et al.
Published: (2024)
by: Wen, Muning, et al.
Published: (2024)
E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity
by: Li, Yun, et al.
Published: (2023)
by: Li, Yun, et al.
Published: (2023)
Entropy-Regularized Process Reward Model
by: Zhang, Hanning, et al.
Published: (2024)
by: Zhang, Hanning, et al.
Published: (2024)
Boosting Entropy with Bell Box Quantization
by: Yang, Ningfeng, et al.
Published: (2026)
by: Yang, Ningfeng, et al.
Published: (2026)
Internal Value Alignment in Large Language Models through Controlled Value Vector Activation
by: Jin, Haoran, et al.
Published: (2025)
by: Jin, Haoran, et al.
Published: (2025)
Clip-Low Increases Entropy and Clip-High Decreases Entropy in Reinforcement Learning of Large Language Models
by: Park, Jaesung R., et al.
Published: (2025)
by: Park, Jaesung R., et al.
Published: (2025)
Mixture of Message Passing Experts with Routing Entropy Regularization for Node Classification
by: Chen, Xuanze, et al.
Published: (2025)
by: Chen, Xuanze, et al.
Published: (2025)
Massive Activations in Large Language Models
by: Sun, Mingjie, et al.
Published: (2024)
by: Sun, Mingjie, et al.
Published: (2024)
Entropy Reweighted Conformal Classification
by: Luo, Rui, et al.
Published: (2024)
by: Luo, Rui, et al.
Published: (2024)
Delta Activations: A Representation for Finetuned Large Language Models
by: Xu, Zhiqiu, et al.
Published: (2025)
by: Xu, Zhiqiu, et al.
Published: (2025)
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning
by: Shi, Ruizhe, et al.
Published: (2023)
by: Shi, Ruizhe, et al.
Published: (2023)
State Entropy Regularization for Robust Reinforcement Learning
by: Ashlag, Yonatan, et al.
Published: (2025)
by: Ashlag, Yonatan, et al.
Published: (2025)
Refined Analysis of Entropy-Regularized Actor-Critic
by: Labbi, Safwan, et al.
Published: (2026)
by: Labbi, Safwan, et al.
Published: (2026)
Generative Flow Networks as Entropy-Regularized RL
by: Tiapkin, Daniil, et al.
Published: (2023)
by: Tiapkin, Daniil, et al.
Published: (2023)
Flow Density Control: Generative Optimization Beyond Entropy-Regularized Fine-Tuning
by: De Santi, Riccardo, et al.
Published: (2025)
by: De Santi, Riccardo, et al.
Published: (2025)
Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization
by: Nrusimha, Aniruddha, et al.
Published: (2024)
by: Nrusimha, Aniruddha, et al.
Published: (2024)
DittoGym: Learning to Control Soft Shape-Shifting Robots
by: Huang, Suning, et al.
Published: (2024)
by: Huang, Suning, et al.
Published: (2024)
World Models with Hints of Large Language Models for Goal Achieving
by: Liu, Zeyuan, et al.
Published: (2024)
by: Liu, Zeyuan, et al.
Published: (2024)
First Activations Matter: Training-Free Methods for Dynamic Activation in Large Language Models
by: Ma, Chi, et al.
Published: (2024)
by: Ma, Chi, et al.
Published: (2024)
Facies Classification with Copula Entropy
by: Ma, Jian
Published: (2025)
by: Ma, Jian
Published: (2025)
Optimizing Learned Image Compression on Scalar and Entropy-Constraint Quantization
by: Borzechowski, Florian, et al.
Published: (2025)
by: Borzechowski, Florian, et al.
Published: (2025)
Convergence Theorems for Entropy-Regularized and Distributional Reinforcement Learning
by: Jhaveri, Yash, et al.
Published: (2025)
by: Jhaveri, Yash, et al.
Published: (2025)
Decoding Rewards in Competitive Games: Inverse Game Theory with Entropy Regularization
by: Liao, Junyi, et al.
Published: (2026)
by: Liao, Junyi, et al.
Published: (2026)
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
by: Wang, Shumin, et al.
Published: (2026)
by: Wang, Shumin, et al.
Published: (2026)
Model-Free Inference of Investor Preferences: A Relative Entropy IRL Approach
by: Xu, Chen
Published: (2026)
by: Xu, Chen
Published: (2026)
Universal Properties of Activation Sparsity in Modern Large Language Models
by: Szatkowski, Filip, et al.
Published: (2025)
by: Szatkowski, Filip, et al.
Published: (2025)
Smoothness Adaptivity in Constant-Depth Neural Networks: Optimal Rates via Smooth Activations
by: Liu, Yuhao, et al.
Published: (2026)
by: Liu, Yuhao, et al.
Published: (2026)
Understanding and Preventing Entropy Collapse in RLVR with On-Policy Entropy Flow Optimization
by: Xu, Huimin, et al.
Published: (2026)
by: Xu, Huimin, et al.
Published: (2026)
FACET: Force-Adaptive Control via Impedance Reference Tracking for Legged Robots
by: Xu, Botian, et al.
Published: (2025)
by: Xu, Botian, et al.
Published: (2025)
Steering Large Language Model Activations in Sparse Spaces
by: Bayat, Reza, et al.
Published: (2025)
by: Bayat, Reza, et al.
Published: (2025)
ERPPO: Entropy Regularization-based Proximal Policy Optimization
by: Lee, Changha, et al.
Published: (2026)
by: Lee, Changha, et al.
Published: (2026)
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL
by: Luo, Yu, et al.
Published: (2024)
by: Luo, Yu, et al.
Published: (2024)
On the Entropy Calibration of Language Models
by: Cao, Steven, et al.
Published: (2025)
by: Cao, Steven, et al.
Published: (2025)
Similar Items
-
A Forget-and-Grow Strategy for Deep Reinforcement Learning Scaling in Continuous Control
by: Kang, Zilin, et al.
Published: (2025) -
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
by: Ji, Tianying, et al.
Published: (2024) -
Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control
by: Uehara, Masatoshi, et al.
Published: (2024) -
Large Language Model Compression via the Nested Activation-Aware Decomposition
by: Lu, Jun, et al.
Published: (2025) -
Fast Policy Learning for Linear Quadratic Control with Entropy Regularization
by: Guo, Xin, et al.
Published: (2023)