:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kang, Zilin, Liao, Chonghua, Xu, Tingqiang, Xu, Huazhe
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2510.08549
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Forget-and-Grow Strategy for Deep Reinforcement Learning Scaling in Continuous Control
by: Kang, Zilin, et al.
Published: (2025)

ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
by: Ji, Tianying, et al.
Published: (2024)

Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control
by: Uehara, Masatoshi, et al.
Published: (2024)

Large Language Model Compression via the Nested Activation-Aware Decomposition
by: Lu, Jun, et al.
Published: (2025)

Fast Policy Learning for Linear Quadratic Control with Entropy Regularization
by: Guo, Xin, et al.
Published: (2023)

TemplateRL: Structured Template-Guided Reinforcement Learning for LLM Reasoning
by: Wu, Jinyang, et al.
Published: (2025)

Rethinking Entropy Regularization in Large Reasoning Models
by: Jiang, Yuxian, et al.
Published: (2025)

Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement
by: Wen, Muning, et al.
Published: (2024)

E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity
by: Li, Yun, et al.
Published: (2023)

Entropy-Regularized Process Reward Model
by: Zhang, Hanning, et al.
Published: (2024)

Boosting Entropy with Bell Box Quantization
by: Yang, Ningfeng, et al.
Published: (2026)

Internal Value Alignment in Large Language Models through Controlled Value Vector Activation
by: Jin, Haoran, et al.
Published: (2025)

Clip-Low Increases Entropy and Clip-High Decreases Entropy in Reinforcement Learning of Large Language Models
by: Park, Jaesung R., et al.
Published: (2025)

Mixture of Message Passing Experts with Routing Entropy Regularization for Node Classification
by: Chen, Xuanze, et al.
Published: (2025)

Massive Activations in Large Language Models
by: Sun, Mingjie, et al.
Published: (2024)

Entropy Reweighted Conformal Classification
by: Luo, Rui, et al.
Published: (2024)

Delta Activations: A Representation for Finetuned Large Language Models
by: Xu, Zhiqiu, et al.
Published: (2025)

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning
by: Shi, Ruizhe, et al.
Published: (2023)

State Entropy Regularization for Robust Reinforcement Learning
by: Ashlag, Yonatan, et al.
Published: (2025)

Refined Analysis of Entropy-Regularized Actor-Critic
by: Labbi, Safwan, et al.
Published: (2026)

Generative Flow Networks as Entropy-Regularized RL
by: Tiapkin, Daniil, et al.
Published: (2023)

Flow Density Control: Generative Optimization Beyond Entropy-Regularized Fine-Tuning
by: De Santi, Riccardo, et al.
Published: (2025)

Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization
by: Nrusimha, Aniruddha, et al.
Published: (2024)

DittoGym: Learning to Control Soft Shape-Shifting Robots
by: Huang, Suning, et al.
Published: (2024)

World Models with Hints of Large Language Models for Goal Achieving
by: Liu, Zeyuan, et al.
Published: (2024)

First Activations Matter: Training-Free Methods for Dynamic Activation in Large Language Models
by: Ma, Chi, et al.
Published: (2024)

Facies Classification with Copula Entropy
by: Ma, Jian
Published: (2025)

Optimizing Learned Image Compression on Scalar and Entropy-Constraint Quantization
by: Borzechowski, Florian, et al.
Published: (2025)

Convergence Theorems for Entropy-Regularized and Distributional Reinforcement Learning
by: Jhaveri, Yash, et al.
Published: (2025)

Decoding Rewards in Competitive Games: Inverse Game Theory with Entropy Regularization
by: Liao, Junyi, et al.
Published: (2026)

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
by: Wang, Shumin, et al.
Published: (2026)

Model-Free Inference of Investor Preferences: A Relative Entropy IRL Approach
by: Xu, Chen
Published: (2026)

Universal Properties of Activation Sparsity in Modern Large Language Models
by: Szatkowski, Filip, et al.
Published: (2025)

Smoothness Adaptivity in Constant-Depth Neural Networks: Optimal Rates via Smooth Activations
by: Liu, Yuhao, et al.
Published: (2026)

Understanding and Preventing Entropy Collapse in RLVR with On-Policy Entropy Flow Optimization
by: Xu, Huimin, et al.
Published: (2026)

FACET: Force-Adaptive Control via Impedance Reference Tracking for Legged Robots
by: Xu, Botian, et al.
Published: (2025)

Steering Large Language Model Activations in Sparse Spaces
by: Bayat, Reza, et al.
Published: (2025)

ERPPO: Entropy Regularization-based Proximal Policy Optimization
by: Lee, Changha, et al.
Published: (2026)

Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL
by: Luo, Yu, et al.
Published: (2024)

On the Entropy Calibration of Language Models
by: Cao, Steven, et al.
Published: (2025)