Saved in:
| Main Authors: | Han, Jiale, Dai, Xiaowu, Zhu, Yuhua |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.00520 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Online Auction Design Using Distribution-Free Uncertainty Quantification with Applications to E-Commerce
by: Han, Jiale, et al.
Published: (2024)
by: Han, Jiale, et al.
Published: (2024)
A Robust Multi-Item Auction Design with Statistical Learning
by: Han, Jiale, et al.
Published: (2023)
by: Han, Jiale, et al.
Published: (2023)
Variance Reduction Based Experience Replay for Policy Optimization
by: Zheng, Hua, et al.
Published: (2026)
by: Zheng, Hua, et al.
Published: (2026)
Incentivizing Truthful Language Models via Peer Elicitation Games
by: Chen, Baiting, et al.
Published: (2025)
by: Chen, Baiting, et al.
Published: (2025)
On the Convergence of Experience Replay in Policy Optimization: Characterizing Bias, Variance, and Finite-Time Convergence
by: Zheng, Hua, et al.
Published: (2021)
by: Zheng, Hua, et al.
Published: (2021)
Post-Regularization Confidence Bands for Ordinary Differential Equations
by: Dai, Xiaowu, et al.
Published: (2021)
by: Dai, Xiaowu, et al.
Published: (2021)
Uncertainty Prioritized Experience Replay
by: Carrasco-Davis, Rodrigo, et al.
Published: (2025)
by: Carrasco-Davis, Rodrigo, et al.
Published: (2025)
Revisiting Experience Replayable Conditions
by: Kobayashi, Taisuke
Published: (2024)
by: Kobayashi, Taisuke
Published: (2024)
Auto-bidding under Return-on-Spend Constraints with Uncertainty Quantification
by: Han, Jiale, et al.
Published: (2025)
by: Han, Jiale, et al.
Published: (2025)
Finite-Time Analysis of Temporal Difference Learning with Experience Replay
by: Lim, Han-Dong, et al.
Published: (2023)
by: Lim, Han-Dong, et al.
Published: (2023)
Reliability-Adjusted Prioritized Experience Replay
by: Pleiss, Leonard S., et al.
Published: (2025)
by: Pleiss, Leonard S., et al.
Published: (2025)
Maximum Entropy Hindsight Experience Replay
by: Crowder, Douglas C., et al.
Published: (2024)
by: Crowder, Douglas C., et al.
Published: (2024)
Experience Replay with Random Reshuffling
by: Fujita, Yasuhiro
Published: (2025)
by: Fujita, Yasuhiro
Published: (2025)
VLM-Guided Experience Replay
by: Sharony, Elad, et al.
Published: (2026)
by: Sharony, Elad, et al.
Published: (2026)
Ranked Set Sampling-Based Multilayer Perceptron: Improving Generalization via Variance-Based Bounds
by: Li, Feijiang, et al.
Published: (2025)
by: Li, Feijiang, et al.
Published: (2025)
Enabling On-Device Learning via Experience Replay with Efficient Dataset Condensation
by: Xu, Gelei, et al.
Published: (2024)
by: Xu, Gelei, et al.
Published: (2024)
SPRINT: Stochastic Performative Prediction With Variance Reduction
by: Xie, Tian, et al.
Published: (2025)
by: Xie, Tian, et al.
Published: (2025)
A Data Envelopment Analysis Approach for Assessing Fairness in Resource Allocation: Application to Kidney Exchange Programs
by: Kaazempur-Mofrad, Ali, et al.
Published: (2024)
by: Kaazempur-Mofrad, Ali, et al.
Published: (2024)
Non-Uniform Memory Sampling in Experience Replay
by: Krutsylo, Andrii
Published: (2025)
by: Krutsylo, Andrii
Published: (2025)
Efficient RL Training for LLMs with Experience Replay
by: Arnal, Charles, et al.
Published: (2026)
by: Arnal, Charles, et al.
Published: (2026)
On the Limitation and Experience Replay for GNNs in Continual Learning
by: Su, Junwei, et al.
Published: (2023)
by: Su, Junwei, et al.
Published: (2023)
Fairness-aware kidney exchange and kidney paired donation
by: Zhang, Mingrui, et al.
Published: (2025)
by: Zhang, Mingrui, et al.
Published: (2025)
Effect Decomposition of Functional-Output Computer Experiments via Orthogonal Additive Gaussian Processes
by: Tan, Yu, et al.
Published: (2025)
by: Tan, Yu, et al.
Published: (2025)
ROER: Regularized Optimal Experience Replay
by: Li, Changling, et al.
Published: (2024)
by: Li, Changling, et al.
Published: (2024)
Hindsight Experience Replay Accelerates Proximal Policy Optimization
by: Crowder, Douglas C., et al.
Published: (2024)
by: Crowder, Douglas C., et al.
Published: (2024)
A Tighter Convergence Proof of Reverse Experience Replay
by: Jiang, Nan, et al.
Published: (2024)
by: Jiang, Nan, et al.
Published: (2024)
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement
by: Zhu, Yiwen, et al.
Published: (2024)
by: Zhu, Yiwen, et al.
Published: (2024)
Mechanism Design for Quality-Preserving LLM Advertising
by: Han, Jiale, et al.
Published: (2026)
by: Han, Jiale, et al.
Published: (2026)
SelfReplay: Adapting Self-Supervised Sensory Models via Adaptive Meta-Task Replay
by: Yoon, Hyungjun, et al.
Published: (2024)
by: Yoon, Hyungjun, et al.
Published: (2024)
Prediction-Powered Conditional Inference
by: Sui, Yang, et al.
Published: (2026)
by: Sui, Yang, et al.
Published: (2026)
Conformal Risk Minimization with Variance Reduction
by: Noorani, Sima, et al.
Published: (2024)
by: Noorani, Sima, et al.
Published: (2024)
Preliminary Tests of the Anticipatory Classifier System with Hindsight Experience Replay
by: Unold, Olgierd, et al.
Published: (2026)
by: Unold, Olgierd, et al.
Published: (2026)
Catastrophic Forgetting Mitigation via Discrepancy-Weighted Experience Replay
by: Xu, Xinrun, et al.
Published: (2025)
by: Xu, Xinrun, et al.
Published: (2025)
Ctrl-Z: Controlling AI Agents via Resampling
by: Bhatt, Aryan, et al.
Published: (2025)
by: Bhatt, Aryan, et al.
Published: (2025)
Mastering the Game of Go with Self-play Experience Replay
by: Liu, Jingbin, et al.
Published: (2026)
by: Liu, Jingbin, et al.
Published: (2026)
ALIGN: Aligned Delegation with Performance Guarantees for Multi-Agent LLM Reasoning
by: Zhu, Tong, et al.
Published: (2026)
by: Zhu, Tong, et al.
Published: (2026)
Controllable Generation via Locally Constrained Resampling
by: Ahmed, Kareem, et al.
Published: (2024)
by: Ahmed, Kareem, et al.
Published: (2024)
AdaER: An Adaptive Experience Replay Approach for Continual Lifelong Learning
by: Li, Xingyu, et al.
Published: (2023)
by: Li, Xingyu, et al.
Published: (2023)
On the Reduction of Variance and Overestimation of Deep Q-Learning
by: Sabry, Mohammed, et al.
Published: (2019)
by: Sabry, Mohammed, et al.
Published: (2019)
Incentivized Exploration with Stochastic Covariates: A Two-Stage Mechanism Design for Recommender System
by: Li, Yuantong, et al.
Published: (2024)
by: Li, Yuantong, et al.
Published: (2024)
Similar Items
-
Online Auction Design Using Distribution-Free Uncertainty Quantification with Applications to E-Commerce
by: Han, Jiale, et al.
Published: (2024) -
A Robust Multi-Item Auction Design with Statistical Learning
by: Han, Jiale, et al.
Published: (2023) -
Variance Reduction Based Experience Replay for Policy Optimization
by: Zheng, Hua, et al.
Published: (2026) -
Incentivizing Truthful Language Models via Peer Elicitation Games
by: Chen, Baiting, et al.
Published: (2025) -
On the Convergence of Experience Replay in Policy Optimization: Characterizing Bias, Variance, and Finite-Time Convergence
by: Zheng, Hua, et al.
Published: (2021)