Saved in:
| Main Authors: | Zhang, Jing, Fang, Linjiajie, Shi, Kexin, Wang, Wenjia, Jing, Bing-Yi |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.20312 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
by: Fang, Linjiajie, et al.
Published: (2024)
by: Fang, Linjiajie, et al.
Published: (2024)
Optimistic Q-learning for average reward and episodic reinforcement learning
by: Agrawal, Priyank, et al.
Published: (2024)
by: Agrawal, Priyank, et al.
Published: (2024)
Online reinforcement learning via sparse Gaussian mixture model Q-functions
by: Vu, Minh, et al.
Published: (2025)
by: Vu, Minh, et al.
Published: (2025)
Causal prompting model-based offline reinforcement learning
by: Yu, Xuehui, et al.
Published: (2024)
by: Yu, Xuehui, et al.
Published: (2024)
Enhanced Bayesian Personalized Ranking for Robust Hard Negative Sampling in Recommender Systems
by: Shi, Kexin, et al.
Published: (2024)
by: Shi, Kexin, et al.
Published: (2024)
Regularized Q-learning
by: Lim, Han-Dong, et al.
Published: (2022)
by: Lim, Han-Dong, et al.
Published: (2022)
HyperQ-Opt: Q-learning for Hyperparameter Optimization
by: Hasan, Md. Tarek
Published: (2024)
by: Hasan, Md. Tarek
Published: (2024)
Transfer Q-learning
by: Chen, Elynn, et al.
Published: (2022)
by: Chen, Elynn, et al.
Published: (2022)
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation
by: Park, Jaehyun, et al.
Published: (2024)
by: Park, Jaehyun, et al.
Published: (2024)
Generalisation in Multitask Fitted Q-Iteration and Offline Q-learning
by: Manda, Kausthubh, et al.
Published: (2025)
by: Manda, Kausthubh, et al.
Published: (2025)
Q-learning with Posterior Sampling
by: Agrawal, Priyank, et al.
Published: (2025)
by: Agrawal, Priyank, et al.
Published: (2025)
An Index Policy Based on Sarsa and Q-learning for Heterogeneous Smart Target Tracking
by: Hao, Yuhang, et al.
Published: (2024)
by: Hao, Yuhang, et al.
Published: (2024)
Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation
by: Zhang, Yixuan, et al.
Published: (2024)
by: Zhang, Yixuan, et al.
Published: (2024)
Deep Double Q-learning
by: Nagarajan, Prabhat, et al.
Published: (2025)
by: Nagarajan, Prabhat, et al.
Published: (2025)
Sample Complexity of Variance-reduced Distributionally Robust Q-learning
by: Wang, Shengbo, et al.
Published: (2023)
by: Wang, Shengbo, et al.
Published: (2023)
Q-learning with Adjoint Matching
by: Li, Qiyang, et al.
Published: (2026)
by: Li, Qiyang, et al.
Published: (2026)
Gap-Dependent Bounds for Federated $Q$-learning
by: Zhang, Haochen, et al.
Published: (2025)
by: Zhang, Haochen, et al.
Published: (2025)
Q-learning as a monotone scheme
by: Yang, Lingyi
Published: (2024)
by: Yang, Lingyi
Published: (2024)
A Finite Sample Complexity Bound for Distributionally Robust Q-learning
by: Wang, Shengbo, et al.
Published: (2023)
by: Wang, Shengbo, et al.
Published: (2023)
Expert or not? assessing data quality in offline reinforcement learning
by: Asadulaev, Arip, et al.
Published: (2025)
by: Asadulaev, Arip, et al.
Published: (2025)
Experimental evaluation of offline reinforcement learning for HVAC control in buildings
by: Wang, Jun, et al.
Published: (2024)
by: Wang, Jun, et al.
Published: (2024)
Deep Q-Network (DQN) multi-agent reinforcement learning (MARL) for Stock Trading
by: Tidwell, John Christopher, et al.
Published: (2025)
by: Tidwell, John Christopher, et al.
Published: (2025)
Gaussian Approximation for Asynchronous Q-learning
by: Rubtsov, Artemy, et al.
Published: (2026)
by: Rubtsov, Artemy, et al.
Published: (2026)
MinMaxMin $Q$-learning
by: Soffair, Nitsan, et al.
Published: (2024)
by: Soffair, Nitsan, et al.
Published: (2024)
Is Q-learning an Ill-posed Problem?
by: Wissmann, Philipp, et al.
Published: (2025)
by: Wissmann, Philipp, et al.
Published: (2025)
PyCFRL: A Python library for counterfactually fair offline reinforcement learning via sequential data preprocessing
by: Zhang, Jianhan, et al.
Published: (2025)
by: Zhang, Jianhan, et al.
Published: (2025)
rQdia: Regularizing Q-Value Distributions With Image Augmentation
by: Lerman, Sam, et al.
Published: (2025)
by: Lerman, Sam, et al.
Published: (2025)
VA-learning as a more efficient alternative to Q-learning
by: Tang, Yunhao, et al.
Published: (2023)
by: Tang, Yunhao, et al.
Published: (2023)
Model predictive control-based value estimation for efficient reinforcement learning
by: Wu, Qizhen, et al.
Published: (2023)
by: Wu, Qizhen, et al.
Published: (2023)
Q-learning with temporal memory to navigate turbulence
by: Rando, Marco, et al.
Published: (2024)
by: Rando, Marco, et al.
Published: (2024)
Regularized Q-learning through Robust Averaging
by: Schmitt-Förster, Peter, et al.
Published: (2024)
by: Schmitt-Förster, Peter, et al.
Published: (2024)
Stabilizing Extreme Q-learning by Maclaurin Expansion
by: Omura, Motoki, et al.
Published: (2024)
by: Omura, Motoki, et al.
Published: (2024)
Asymptotic Analysis of Sample-averaged Q-learning
by: Panda, Saunak Kumar, et al.
Published: (2024)
by: Panda, Saunak Kumar, et al.
Published: (2024)
Tensor-Efficient High-Dimensional Q-learning
by: Wu, Junyi, et al.
Published: (2025)
by: Wu, Junyi, et al.
Published: (2025)
Constrained Policy Optimization with Explicit Behavior Density for Offline Reinforcement Learning
by: Zhang, Jing, et al.
Published: (2023)
by: Zhang, Jing, et al.
Published: (2023)
Periodic agent-state based Q-learning for POMDPs
by: Sinha, Amit, et al.
Published: (2024)
by: Sinha, Amit, et al.
Published: (2024)
Maximum entropy GFlowNets with soft Q-learning
by: Mohammadpour, Sobhan, et al.
Published: (2023)
by: Mohammadpour, Sobhan, et al.
Published: (2023)
Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error
by: Li, Haoran, et al.
Published: (2024)
by: Li, Haoran, et al.
Published: (2024)
Robust $Q$-learning Algorithm for Markov Decision Processes under Wasserstein Uncertainty
by: Neufeld, Ariel, et al.
Published: (2022)
by: Neufeld, Ariel, et al.
Published: (2022)
Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learning
by: Benechehab, Abdelhakim, et al.
Published: (2024)
by: Benechehab, Abdelhakim, et al.
Published: (2024)
Similar Items
-
Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning
by: Fang, Linjiajie, et al.
Published: (2024) -
Optimistic Q-learning for average reward and episodic reinforcement learning
by: Agrawal, Priyank, et al.
Published: (2024) -
Online reinforcement learning via sparse Gaussian mixture model Q-functions
by: Vu, Minh, et al.
Published: (2025) -
Causal prompting model-based offline reinforcement learning
by: Yu, Xuehui, et al.
Published: (2024) -
Enhanced Bayesian Personalized Ranking for Robust Hard Negative Sampling in Recommender Systems
by: Shi, Kexin, et al.
Published: (2024)