:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liu, Shicheng, Xu, Siyuan, Qiu, Wenjie, Zhang, Hangfan, Zhu, Minghui
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2512.13837
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Interactive Inverse Reinforcement Learning of Interaction Scenarios via Bi-level Optimization
by: Mao, Yue, et al.
Published: (2026)

Federated reinforcement learning for robot motion planning with zero-shot generalization
by: Yuan, Zhenyuan, et al.
Published: (2024)

In-Trajectory Inverse Reinforcement Learning: Learn Incrementally Before An Ongoing Trajectory Terminates
by: Liu, Shicheng, et al.
Published: (2024)

Learning to summarize user information for personalized reinforcement learning from human feedback
by: Nam, Hyunji, et al.
Published: (2025)

Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator
by: Xu, Siyuan, et al.
Published: (2024)

Delayed homomorphic reinforcement learning for environments with delayed feedback
by: Lee, Jongsoo, et al.
Published: (2026)

Curriculum reinforcement learning with measurable task representation learning
by: Wen, Yongyan, et al.
Published: (2026)

Explainable deep learning improves human mental models of self-driving cars
by: Kenny, Eoin M., et al.
Published: (2024)

Byzantine-resilient federated online learning for Gaussian process regression
by: Zhang, Xu, et al.
Published: (2025)

Simple Denoising Diffusion Language Models
by: Zhu, Huaisheng, et al.
Published: (2025)

Using reinforcement learning to probe the role of feedback in skill acquisition
by: Terpin, Antonio, et al.
Published: (2025)

Deep reinforcement learning for irrigation scheduling using high-dimensional sensor feedback
by: Saikai, Yuji, et al.
Published: (2023)

A Bayesian latent class reinforcement learning framework to capture adaptive, feedback-driven travel behaviour
by: Sfeir, Georges, et al.
Published: (2025)

Leveraging weights signals -- Predicting and improving generalizability in reinforcement learning
by: Moulin, Olivier, et al.
Published: (2025)

Connections between reinforcement learning with feedback,test-time scaling, and diffusion guidance: An anthology
by: Jiao, Yuchen, et al.
Published: (2025)

TIFeD: a Tiny Integer-based Federated learning algorithm with Direct feedback alignment
by: Colombo, Luca, et al.
Published: (2024)

Ergodicity in reinforcement learning
by: Baumann, Dominik, et al.
Published: (2026)

Automated co-design of high-performance thermodynamic cycles via graph-based hierarchical reinforcement learning
by: Li, Wenqing, et al.
Published: (2026)

Streamlined optical training of large-scale modern deep learning architectures with direct feedback alignment
by: Wang, Ziao, et al.
Published: (2024)

Using reinforcement learning to improve drone-based inference of greenhouse gas fluxes
by: van Hove, Alouette, et al.
Published: (2024)

Adaptive traffic signal safety and efficiency improvement by multi objective deep reinforcement learning approach
by: Mirbakhsh, Shahin, et al.
Published: (2024)

Explainable deep reinforcement learning reveals energy-efficient control strategies for turbulent drag reduction
by: Tonti, Federica, et al.
Published: (2026)

SHAP-Guided Kernel Actor-Critic for Explainable Reinforcement Learning
by: Li, Na, et al.
Published: (2025)

Evaluating alignment between humans and neural network representations in image-based learning tasks
by: Demircan, Can, et al.
Published: (2023)

Stop Overvaluing Multi-Agent Debate -- We Must Rethink Evaluation and Embrace Model Heterogeneity
by: Zhang, Hangfan, et al.
Published: (2025)

Feature-driven reinforcement learning for photovoltaic in continuous intraday trading
by: Abate, Arega Getaneh, et al.
Published: (2025)

Advanced deep-reinforcement-learning methods for flow control: group-invariant and positional-encoding networks improve learning speed and quality
by: Jeon, Joongoo, et al.
Published: (2024)

Risk-averse learning with delayed feedback
by: Wang, Siyi, et al.
Published: (2024)

Experimental evaluation of offline reinforcement learning for HVAC control in buildings
by: Wang, Jun, et al.
Published: (2024)

TRACE: Trajectory Recovery for Continuous Mechanism Evolution in Causal Representation Learning
by: Fan, Shicheng, et al.
Published: (2026)

Recommender systems and reinforcement learning for human-building interaction and context-aware support: A text mining-driven review of scientific literature
by: Zhang, Wenhao, et al.
Published: (2024)

Normalization and effective learning rates in reinforcement learning
by: Lyle, Clare, et al.
Published: (2024)

An introduction to reinforcement learning for neuroscience
by: Jensen, Kristopher T.
Published: (2023)

Optimistic Q-learning for average reward and episodic reinforcement learning
by: Agrawal, Priyank, et al.
Published: (2024)

Ethics2vec: aligning automatic agents and human preferences
by: Bontempi, Gianluca
Published: (2025)

Complementing reinforcement learning with SFT through logit averaging in the post training of LLMs
by: Gan, Xingwei, et al.
Published: (2026)

LIRE: listwise reward enhancement for preference alignment
by: Zhu, Mingye, et al.
Published: (2024)

Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
by: Wang, Guojian, et al.
Published: (2023)

Model predictive control-based value estimation for efficient reinforcement learning
by: Wu, Qizhen, et al.
Published: (2023)

Generalized Bayesian deep reinforcement learning
by: Roy, Shreya Sinha, et al.
Published: (2024)