Saved in:
| Main Authors: | Liu, Shicheng, Xu, Siyuan, Qiu, Wenjie, Zhang, Hangfan, Zhu, Minghui |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.13837 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Interactive Inverse Reinforcement Learning of Interaction Scenarios via Bi-level Optimization
by: Mao, Yue, et al.
Published: (2026)
by: Mao, Yue, et al.
Published: (2026)
Federated reinforcement learning for robot motion planning with zero-shot generalization
by: Yuan, Zhenyuan, et al.
Published: (2024)
by: Yuan, Zhenyuan, et al.
Published: (2024)
In-Trajectory Inverse Reinforcement Learning: Learn Incrementally Before An Ongoing Trajectory Terminates
by: Liu, Shicheng, et al.
Published: (2024)
by: Liu, Shicheng, et al.
Published: (2024)
Learning to summarize user information for personalized reinforcement learning from human feedback
by: Nam, Hyunji, et al.
Published: (2025)
by: Nam, Hyunji, et al.
Published: (2025)
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator
by: Xu, Siyuan, et al.
Published: (2024)
by: Xu, Siyuan, et al.
Published: (2024)
Delayed homomorphic reinforcement learning for environments with delayed feedback
by: Lee, Jongsoo, et al.
Published: (2026)
by: Lee, Jongsoo, et al.
Published: (2026)
Curriculum reinforcement learning with measurable task representation learning
by: Wen, Yongyan, et al.
Published: (2026)
by: Wen, Yongyan, et al.
Published: (2026)
Explainable deep learning improves human mental models of self-driving cars
by: Kenny, Eoin M., et al.
Published: (2024)
by: Kenny, Eoin M., et al.
Published: (2024)
Byzantine-resilient federated online learning for Gaussian process regression
by: Zhang, Xu, et al.
Published: (2025)
by: Zhang, Xu, et al.
Published: (2025)
Simple Denoising Diffusion Language Models
by: Zhu, Huaisheng, et al.
Published: (2025)
by: Zhu, Huaisheng, et al.
Published: (2025)
Using reinforcement learning to probe the role of feedback in skill acquisition
by: Terpin, Antonio, et al.
Published: (2025)
by: Terpin, Antonio, et al.
Published: (2025)
Deep reinforcement learning for irrigation scheduling using high-dimensional sensor feedback
by: Saikai, Yuji, et al.
Published: (2023)
by: Saikai, Yuji, et al.
Published: (2023)
A Bayesian latent class reinforcement learning framework to capture adaptive, feedback-driven travel behaviour
by: Sfeir, Georges, et al.
Published: (2025)
by: Sfeir, Georges, et al.
Published: (2025)
Leveraging weights signals -- Predicting and improving generalizability in reinforcement learning
by: Moulin, Olivier, et al.
Published: (2025)
by: Moulin, Olivier, et al.
Published: (2025)
Connections between reinforcement learning with feedback,test-time scaling, and diffusion guidance: An anthology
by: Jiao, Yuchen, et al.
Published: (2025)
by: Jiao, Yuchen, et al.
Published: (2025)
TIFeD: a Tiny Integer-based Federated learning algorithm with Direct feedback alignment
by: Colombo, Luca, et al.
Published: (2024)
by: Colombo, Luca, et al.
Published: (2024)
Ergodicity in reinforcement learning
by: Baumann, Dominik, et al.
Published: (2026)
by: Baumann, Dominik, et al.
Published: (2026)
Automated co-design of high-performance thermodynamic cycles via graph-based hierarchical reinforcement learning
by: Li, Wenqing, et al.
Published: (2026)
by: Li, Wenqing, et al.
Published: (2026)
Streamlined optical training of large-scale modern deep learning architectures with direct feedback alignment
by: Wang, Ziao, et al.
Published: (2024)
by: Wang, Ziao, et al.
Published: (2024)
Using reinforcement learning to improve drone-based inference of greenhouse gas fluxes
by: van Hove, Alouette, et al.
Published: (2024)
by: van Hove, Alouette, et al.
Published: (2024)
Adaptive traffic signal safety and efficiency improvement by multi objective deep reinforcement learning approach
by: Mirbakhsh, Shahin, et al.
Published: (2024)
by: Mirbakhsh, Shahin, et al.
Published: (2024)
Explainable deep reinforcement learning reveals energy-efficient control strategies for turbulent drag reduction
by: Tonti, Federica, et al.
Published: (2026)
by: Tonti, Federica, et al.
Published: (2026)
SHAP-Guided Kernel Actor-Critic for Explainable Reinforcement Learning
by: Li, Na, et al.
Published: (2025)
by: Li, Na, et al.
Published: (2025)
Evaluating alignment between humans and neural network representations in image-based learning tasks
by: Demircan, Can, et al.
Published: (2023)
by: Demircan, Can, et al.
Published: (2023)
Stop Overvaluing Multi-Agent Debate -- We Must Rethink Evaluation and Embrace Model Heterogeneity
by: Zhang, Hangfan, et al.
Published: (2025)
by: Zhang, Hangfan, et al.
Published: (2025)
Feature-driven reinforcement learning for photovoltaic in continuous intraday trading
by: Abate, Arega Getaneh, et al.
Published: (2025)
by: Abate, Arega Getaneh, et al.
Published: (2025)
Advanced deep-reinforcement-learning methods for flow control: group-invariant and positional-encoding networks improve learning speed and quality
by: Jeon, Joongoo, et al.
Published: (2024)
by: Jeon, Joongoo, et al.
Published: (2024)
Risk-averse learning with delayed feedback
by: Wang, Siyi, et al.
Published: (2024)
by: Wang, Siyi, et al.
Published: (2024)
Experimental evaluation of offline reinforcement learning for HVAC control in buildings
by: Wang, Jun, et al.
Published: (2024)
by: Wang, Jun, et al.
Published: (2024)
TRACE: Trajectory Recovery for Continuous Mechanism Evolution in Causal Representation Learning
by: Fan, Shicheng, et al.
Published: (2026)
by: Fan, Shicheng, et al.
Published: (2026)
Recommender systems and reinforcement learning for human-building interaction and context-aware support: A text mining-driven review of scientific literature
by: Zhang, Wenhao, et al.
Published: (2024)
by: Zhang, Wenhao, et al.
Published: (2024)
Normalization and effective learning rates in reinforcement learning
by: Lyle, Clare, et al.
Published: (2024)
by: Lyle, Clare, et al.
Published: (2024)
An introduction to reinforcement learning for neuroscience
by: Jensen, Kristopher T.
Published: (2023)
by: Jensen, Kristopher T.
Published: (2023)
Optimistic Q-learning for average reward and episodic reinforcement learning
by: Agrawal, Priyank, et al.
Published: (2024)
by: Agrawal, Priyank, et al.
Published: (2024)
Ethics2vec: aligning automatic agents and human preferences
by: Bontempi, Gianluca
Published: (2025)
by: Bontempi, Gianluca
Published: (2025)
Complementing reinforcement learning with SFT through logit averaging in the post training of LLMs
by: Gan, Xingwei, et al.
Published: (2026)
by: Gan, Xingwei, et al.
Published: (2026)
LIRE: listwise reward enhancement for preference alignment
by: Zhu, Mingye, et al.
Published: (2024)
by: Zhu, Mingye, et al.
Published: (2024)
Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
by: Wang, Guojian, et al.
Published: (2023)
by: Wang, Guojian, et al.
Published: (2023)
Model predictive control-based value estimation for efficient reinforcement learning
by: Wu, Qizhen, et al.
Published: (2023)
by: Wu, Qizhen, et al.
Published: (2023)
Generalized Bayesian deep reinforcement learning
by: Roy, Shreya Sinha, et al.
Published: (2024)
by: Roy, Shreya Sinha, et al.
Published: (2024)
Similar Items
-
Interactive Inverse Reinforcement Learning of Interaction Scenarios via Bi-level Optimization
by: Mao, Yue, et al.
Published: (2026) -
Federated reinforcement learning for robot motion planning with zero-shot generalization
by: Yuan, Zhenyuan, et al.
Published: (2024) -
In-Trajectory Inverse Reinforcement Learning: Learn Incrementally Before An Ongoing Trajectory Terminates
by: Liu, Shicheng, et al.
Published: (2024) -
Learning to summarize user information for personalized reinforcement learning from human feedback
by: Nam, Hyunji, et al.
Published: (2025) -
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator
by: Xu, Siyuan, et al.
Published: (2024)