:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Zu, Lipeng, Zhou, Hansong, Zhang, Xiaonan
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Machine Learning
Online-Zugang:	https://arxiv.org/abs/2511.03836
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Behavior-Adaptive Q-Learning: A Unifying Framework for Offline-to-Online RL
von: Zu, Lipeng, et al.
Veröffentlicht: (2025)

FedAR: Addressing Client Unavailability in Federated Learning with Local Update Approximation and Rectification
von: Jiang, Chutian, et al.
Veröffentlicht: (2024)

From Static Constraints to Dynamic Adaptation: Sample-Level Constraint Relaxation for Offline-to-Online Reinforcement Learning
von: Zu, Lipeng, et al.
Veröffentlicht: (2025)

Digi-Q: Learning Q-Value Functions for Training Device-Control Agents
von: Bai, Hao, et al.
Veröffentlicht: (2025)

The Role of Target Update Frequencies in Q-Learning
von: Weissmann, Simon, et al.
Veröffentlicht: (2026)

Expert Q-learning: Deep Reinforcement Learning with Coarse State Values from Offline Expert Examples
von: Meng, Li, et al.
Veröffentlicht: (2021)

Learning Model Successors
von: Chang, Yingshan, et al.
Veröffentlicht: (2025)

Iterated $Q$-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning
von: Vincent, Théo, et al.
Veröffentlicht: (2024)

Deep Q-Exponential Processes
von: Chang, Zhi, et al.
Veröffentlicht: (2024)

Distributionally Robust Deep Q-Learning
von: Lu, Chung I, et al.
Veröffentlicht: (2025)

On the Reduction of Variance and Overestimation of Deep Q-Learning
von: Sabry, Mohammed, et al.
Veröffentlicht: (2019)

Structured Difference-of-Q via Orthogonal Learning
von: Cao, Defu, et al.
Veröffentlicht: (2024)

Fast Adaptive Anti-Jamming Channel Access via Deep Q Learning and Coarse-Grained Spectrum Prediction
von: Zhang, Jianshu, et al.
Veröffentlicht: (2025)

Learning Successor Features the Simple Way
von: Chua, Raymond, et al.
Veröffentlicht: (2024)

SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning
von: Zhang, Shuai, et al.
Veröffentlicht: (2024)

SDM-Q: Cost-Aware Staged Decision-Making for Multi-Omics Classification with Deep Q-Learning
von: Mu, Nan, et al.
Veröffentlicht: (2026)

Peng's Q($λ$) for Conservative Value Estimation in Offline Reinforcement Learning
von: Kim, Byeongchan, et al.
Veröffentlicht: (2026)

Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
von: Zhang, Jing, et al.
Veröffentlicht: (2024)

Deep Q-Learning with Gradient Target Tracking
von: Park, Bum Geun, et al.
Veröffentlicht: (2025)

VerifierQ: Enhancing LLM Test Time Compute with Q-Learning-based Verifiers
von: Qi, Jianing, et al.
Veröffentlicht: (2024)

Moment Matching Q-Learning
von: Yiyan, et al.
Veröffentlicht: (2026)

Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics
von: Grillotti, Luca, et al.
Veröffentlicht: (2024)

$β$-DQN: Improving Deep Q-Learning By Evolving the Behavior
von: Zhang, Hongming, et al.
Veröffentlicht: (2025)

Deep Double Q-learning
von: Nagarajan, Prabhat, et al.
Veröffentlicht: (2025)

Optimization and Application of Cloud-based Deep Learning Architecture for Multi-Source Data Prediction
von: Zhang, Yang, et al.
Veröffentlicht: (2024)

Adaptive Action Chunking via Multi-Chunk Q Value Estimation
von: Shin, Yongjae, et al.
Veröffentlicht: (2026)

Deep Transfer $Q$-Learning for Offline Non-Stationary Reinforcement Learning
von: Chai, Jinhang, et al.
Veröffentlicht: (2025)

Quantile Q-Learning: Revisiting Offline Extreme Q-Learning with Quantile Regression
von: Gao, Xinming, et al.
Veröffentlicht: (2025)

Residual Q-Learning: Offline and Online Policy Customization without Value
von: Li, Chenran, et al.
Veröffentlicht: (2023)

Adaptive Federated Learning Defences via Trust-Aware Deep Q-Networks
von: Palit, Vedant
Veröffentlicht: (2025)

Universal Approximation Theorem for Deep Q-Learning via FBSDE System
von: Qi, Qian
Veröffentlicht: (2025)

UAV Trajectory Optimization via Improved Noisy Deep Q-Network
von: Hengyu, Zhang, et al.
Veröffentlicht: (2026)

Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
von: Hong, Joey, et al.
Veröffentlicht: (2024)

Successor-Predecessor Intrinsic Exploration
von: Yu, Changmin, et al.
Veröffentlicht: (2023)

Deep Reinforcement Learning with Spiking Q-learning
von: Chen, Ding, et al.
Veröffentlicht: (2022)

Quantitative Trading using Deep Q Learning
von: Sarkar, Soumyadip
Veröffentlicht: (2023)

Surrogate Ensemble in Expensive Multi-Objective Optimization via Deep Q-Learning
von: Wu, Yuxin, et al.
Veröffentlicht: (2026)

Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
von: Jain, Arnav Kumar, et al.
Veröffentlicht: (2024)

MuonQ: Enhancing Low-Bit Muon Quantization via Directional Fidelity Optimization
von: Su, Yupeng, et al.
Veröffentlicht: (2026)

Beyond ReLU: Chebyshev-DQN for Enhanced Deep Q-Networks
von: Yazdannik, Saman, et al.
Veröffentlicht: (2025)