Saved in:
| Main Author: | Xu, Chen |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.24280 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Maximum Causal Entropy IRL in Mean-Field Games and GNEP Framework for Forward RL
by: Anahtarci, Berkay, et al.
Published: (2024)
by: Anahtarci, Berkay, et al.
Published: (2024)
FM-IRL: Flow-Matching for Reward Modeling and Policy Regularization in Reinforcement Learning
by: Wan, Zhenglin, et al.
Published: (2025)
by: Wan, Zhenglin, et al.
Published: (2025)
IRL for Restless Multi-Armed Bandits with Applications in Maternal and Child Health
by: Jain, Gauri, et al.
Published: (2024)
by: Jain, Gauri, et al.
Published: (2024)
Scouting By Reward: VLM-TO-IRL-Driven Player Selection For Esports
by: Yan, Qing, et al.
Published: (2026)
by: Yan, Qing, et al.
Published: (2026)
CoMI-IRL: Contrastive Multi-Intention Inverse Reinforcement Learning
by: Mone, Antonio, et al.
Published: (2026)
by: Mone, Antonio, et al.
Published: (2026)
CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving
by: Zheng, Xiaoji, et al.
Published: (2025)
by: Zheng, Xiaoji, et al.
Published: (2025)
TreeIRL: Safe Urban Driving with Tree Search and Inverse Reinforcement Learning
by: Tomov, Momchil S., et al.
Published: (2025)
by: Tomov, Momchil S., et al.
Published: (2025)
SurgIRL: Towards Life-Long Learning for Surgical Automation by Incremental Reinforcement Learning
by: Ho, Yun-Jie, et al.
Published: (2024)
by: Ho, Yun-Jie, et al.
Published: (2024)
Relative Entropy Estimation in Function Space: Theory and Applications to Trajectory Inference
by: Wang, Chao, et al.
Published: (2026)
by: Wang, Chao, et al.
Published: (2026)
FP-IRL: Fokker--Planck Inverse Reinforcement Learning -- A Physics-Constrained Approach to Markov Decision Processes
by: Huang, Chengyang, et al.
Published: (2023)
by: Huang, Chengyang, et al.
Published: (2023)
A Lecture Note on Offline RL and IRL, Part II: Foundations of Inverse Reinforcement Learning and Dynamic Discrete Choice Models
by: Kang, Enoch Hyunwook
Published: (2026)
by: Kang, Enoch Hyunwook
Published: (2026)
Phi: Preference Hijacking in Multi-modal Large Language Models at Inference Time
by: Lan, Yifan, et al.
Published: (2025)
by: Lan, Yifan, et al.
Published: (2025)
State-Free Inference of State-Space Models: The Transfer Function Approach
by: Parnichkun, Rom N., et al.
Published: (2024)
by: Parnichkun, Rom N., et al.
Published: (2024)
From Reward-Free Representations to Preferences: Rethinking Offline Preference-Based Reinforcement Learning
by: Yang, Jun-Jie, et al.
Published: (2026)
by: Yang, Jun-Jie, et al.
Published: (2026)
AMPS: Adaptive Modality Preference Steering via Functional Entropy
by: Huang, Zihan, et al.
Published: (2026)
by: Huang, Zihan, et al.
Published: (2026)
Entropy Controllable Direct Preference Optimization
by: Omura, Motoki, et al.
Published: (2024)
by: Omura, Motoki, et al.
Published: (2024)
Inference of Utilities and Time Preference in Sequential Decision-Making
by: Cao, Haoyang, et al.
Published: (2024)
by: Cao, Haoyang, et al.
Published: (2024)
GEM: Generative Entropy-Guided Preference Modeling for Few-shot Alignment of LLMs
by: Zhao, Yiyang, et al.
Published: (2025)
by: Zhao, Yiyang, et al.
Published: (2025)
Mix- and MoE-DPO: A Variational Inference Approach to Direct Preference Optimization
by: Bohne, Jason, et al.
Published: (2025)
by: Bohne, Jason, et al.
Published: (2025)
Fisher Random Walk: Automatic Debiasing Contextual Preference Inference for Large Language Model Evaluation
by: Zhang, Yichi, et al.
Published: (2025)
by: Zhang, Yichi, et al.
Published: (2025)
Relative Entropy Pathwise Policy Optimization
by: Voelcker, Claas, et al.
Published: (2025)
by: Voelcker, Claas, et al.
Published: (2025)
Aligning Language Models with Investor and Market Behavior for Financial Recommendations
by: Spadea, Fernando, et al.
Published: (2025)
by: Spadea, Fernando, et al.
Published: (2025)
Graph State-Space Models and Latent Relational Inference
by: Zambon, Daniele, et al.
Published: (2023)
by: Zambon, Daniele, et al.
Published: (2023)
IRPM: Intergroup Relative Preference Modeling for Pointwise Generative Reward Models
by: Song, Haonan, et al.
Published: (2026)
by: Song, Haonan, et al.
Published: (2026)
Entropy-regularized Gradient Estimators for Approximate Bayesian Inference
by: Kaur, Jasmeet
Published: (2025)
by: Kaur, Jasmeet
Published: (2025)
Theoretical Insights in Model Inversion Robustness and Conditional Entropy Maximization for Collaborative Inference Systems
by: Xia, Song, et al.
Published: (2025)
by: Xia, Song, et al.
Published: (2025)
Stock Recommendations for Individual Investors: A Temporal Graph Network Approach with Mean-Variance Efficient Sampling
by: Lee, Youngbin, et al.
Published: (2024)
by: Lee, Youngbin, et al.
Published: (2024)
Variational Inference, Entropy, and Orthogonality: A Unified Theory of Mixture-of-Experts
by: Su, Ye, et al.
Published: (2026)
by: Su, Ye, et al.
Published: (2026)
Benchmark of Likelihood-Free Inference Methods based on Neural and Optimal Transport Approaches
by: Aka, Samira, et al.
Published: (2026)
by: Aka, Samira, et al.
Published: (2026)
Label-Free Reinforcement Learning via Cross-Model Entropy
by: Gorbett, Matt, et al.
Published: (2026)
by: Gorbett, Matt, et al.
Published: (2026)
No Free Lunch: Non-Asymptotic Analysis of Prediction-Powered Inference
by: Mani, Pranav, et al.
Published: (2025)
by: Mani, Pranav, et al.
Published: (2025)
A Free Probabilistic Framework for Denoising Diffusion Models: Entropy, Transport, and Reverse Processes
by: Das, Swagatam
Published: (2025)
by: Das, Swagatam
Published: (2025)
Active Preference Inference using Language Models and Probabilistic Reasoning
by: Piriyakulkij, Wasu Top, et al.
Published: (2023)
by: Piriyakulkij, Wasu Top, et al.
Published: (2023)
Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints
by: Kang, Zilin, et al.
Published: (2025)
by: Kang, Zilin, et al.
Published: (2025)
Optimizing Language Models for Human Preferences is a Causal Inference Problem
by: Lin, Victoria, et al.
Published: (2024)
by: Lin, Victoria, et al.
Published: (2024)
Interpretable Relational Inference with LLM-Guided Symbolic Dynamics Modeling
by: Liang, Xiaoxiao, et al.
Published: (2026)
by: Liang, Xiaoxiao, et al.
Published: (2026)
Quantum Maximum Entropy Inference and Hamiltonian Learning
by: Gao, Minbo, et al.
Published: (2024)
by: Gao, Minbo, et al.
Published: (2024)
Entropy Adaptive Decoding: Dynamic Model Switching for Efficient Inference
by: Simonds, Toby
Published: (2025)
by: Simonds, Toby
Published: (2025)
Conditions on Preference Relations that Guarantee the Existence of Optimal Policies
by: Carr, Jonathan Colaço, et al.
Published: (2023)
by: Carr, Jonathan Colaço, et al.
Published: (2023)
SimPO: Simple Preference Optimization with a Reference-Free Reward
by: Meng, Yu, et al.
Published: (2024)
by: Meng, Yu, et al.
Published: (2024)
Similar Items
-
Maximum Causal Entropy IRL in Mean-Field Games and GNEP Framework for Forward RL
by: Anahtarci, Berkay, et al.
Published: (2024) -
FM-IRL: Flow-Matching for Reward Modeling and Policy Regularization in Reinforcement Learning
by: Wan, Zhenglin, et al.
Published: (2025) -
IRL for Restless Multi-Armed Bandits with Applications in Maternal and Child Health
by: Jain, Gauri, et al.
Published: (2024) -
Scouting By Reward: VLM-TO-IRL-Driven Player Selection For Esports
by: Yan, Qing, et al.
Published: (2026) -
CoMI-IRL: Contrastive Multi-Intention Inverse Reinforcement Learning
by: Mone, Antonio, et al.
Published: (2026)