:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Xu, Chen
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2604.24280
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Maximum Causal Entropy IRL in Mean-Field Games and GNEP Framework for Forward RL
by: Anahtarci, Berkay, et al.
Published: (2024)

FM-IRL: Flow-Matching for Reward Modeling and Policy Regularization in Reinforcement Learning
by: Wan, Zhenglin, et al.
Published: (2025)

IRL for Restless Multi-Armed Bandits with Applications in Maternal and Child Health
by: Jain, Gauri, et al.
Published: (2024)

Scouting By Reward: VLM-TO-IRL-Driven Player Selection For Esports
by: Yan, Qing, et al.
Published: (2026)

CoMI-IRL: Contrastive Multi-Intention Inverse Reinforcement Learning
by: Mone, Antonio, et al.
Published: (2026)

CoIRL-AD: Collaborative-Competitive Imitation-Reinforcement Learning in Latent World Models for Autonomous Driving
by: Zheng, Xiaoji, et al.
Published: (2025)

TreeIRL: Safe Urban Driving with Tree Search and Inverse Reinforcement Learning
by: Tomov, Momchil S., et al.
Published: (2025)

SurgIRL: Towards Life-Long Learning for Surgical Automation by Incremental Reinforcement Learning
by: Ho, Yun-Jie, et al.
Published: (2024)

Relative Entropy Estimation in Function Space: Theory and Applications to Trajectory Inference
by: Wang, Chao, et al.
Published: (2026)

FP-IRL: Fokker--Planck Inverse Reinforcement Learning -- A Physics-Constrained Approach to Markov Decision Processes
by: Huang, Chengyang, et al.
Published: (2023)

A Lecture Note on Offline RL and IRL, Part II: Foundations of Inverse Reinforcement Learning and Dynamic Discrete Choice Models
by: Kang, Enoch Hyunwook
Published: (2026)

Phi: Preference Hijacking in Multi-modal Large Language Models at Inference Time
by: Lan, Yifan, et al.
Published: (2025)

State-Free Inference of State-Space Models: The Transfer Function Approach
by: Parnichkun, Rom N., et al.
Published: (2024)

From Reward-Free Representations to Preferences: Rethinking Offline Preference-Based Reinforcement Learning
by: Yang, Jun-Jie, et al.
Published: (2026)

AMPS: Adaptive Modality Preference Steering via Functional Entropy
by: Huang, Zihan, et al.
Published: (2026)

Entropy Controllable Direct Preference Optimization
by: Omura, Motoki, et al.
Published: (2024)

Inference of Utilities and Time Preference in Sequential Decision-Making
by: Cao, Haoyang, et al.
Published: (2024)

GEM: Generative Entropy-Guided Preference Modeling for Few-shot Alignment of LLMs
by: Zhao, Yiyang, et al.
Published: (2025)

Mix- and MoE-DPO: A Variational Inference Approach to Direct Preference Optimization
by: Bohne, Jason, et al.
Published: (2025)

Fisher Random Walk: Automatic Debiasing Contextual Preference Inference for Large Language Model Evaluation
by: Zhang, Yichi, et al.
Published: (2025)

Relative Entropy Pathwise Policy Optimization
by: Voelcker, Claas, et al.
Published: (2025)

Aligning Language Models with Investor and Market Behavior for Financial Recommendations
by: Spadea, Fernando, et al.
Published: (2025)

Graph State-Space Models and Latent Relational Inference
by: Zambon, Daniele, et al.
Published: (2023)

IRPM: Intergroup Relative Preference Modeling for Pointwise Generative Reward Models
by: Song, Haonan, et al.
Published: (2026)

Entropy-regularized Gradient Estimators for Approximate Bayesian Inference
by: Kaur, Jasmeet
Published: (2025)

Theoretical Insights in Model Inversion Robustness and Conditional Entropy Maximization for Collaborative Inference Systems
by: Xia, Song, et al.
Published: (2025)

Stock Recommendations for Individual Investors: A Temporal Graph Network Approach with Mean-Variance Efficient Sampling
by: Lee, Youngbin, et al.
Published: (2024)

Variational Inference, Entropy, and Orthogonality: A Unified Theory of Mixture-of-Experts
by: Su, Ye, et al.
Published: (2026)

Benchmark of Likelihood-Free Inference Methods based on Neural and Optimal Transport Approaches
by: Aka, Samira, et al.
Published: (2026)

Label-Free Reinforcement Learning via Cross-Model Entropy
by: Gorbett, Matt, et al.
Published: (2026)

No Free Lunch: Non-Asymptotic Analysis of Prediction-Powered Inference
by: Mani, Pranav, et al.
Published: (2025)

A Free Probabilistic Framework for Denoising Diffusion Models: Entropy, Transport, and Reverse Processes
by: Das, Swagatam
Published: (2025)

Active Preference Inference using Language Models and Probabilistic Reasoning
by: Piriyakulkij, Wasu Top, et al.
Published: (2023)

Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints
by: Kang, Zilin, et al.
Published: (2025)

Optimizing Language Models for Human Preferences is a Causal Inference Problem
by: Lin, Victoria, et al.
Published: (2024)

Interpretable Relational Inference with LLM-Guided Symbolic Dynamics Modeling
by: Liang, Xiaoxiao, et al.
Published: (2026)

Quantum Maximum Entropy Inference and Hamiltonian Learning
by: Gao, Minbo, et al.
Published: (2024)

Entropy Adaptive Decoding: Dynamic Model Switching for Efficient Inference
by: Simonds, Toby
Published: (2025)

Conditions on Preference Relations that Guarantee the Existence of Optimal Policies
by: Carr, Jonathan Colaço, et al.
Published: (2023)

SimPO: Simple Preference Optimization with a Reference-Free Reward
by: Meng, Yu, et al.
Published: (2024)