Guardat en:
| Autors principals: | Chai, Jinhang, Chen, Elynn, Yang, Lin |
|---|---|
| Format: | Preprint |
| Publicat: |
2025
|
| Matèries: | |
| Accés en línia: | https://arxiv.org/abs/2502.00534 |
| Etiquetes: |
Afegir etiqueta
Sense etiquetes, Sigues el primer a etiquetar aquest registre!
|
Ítems similars
Deep Transfer $Q$-Learning for Offline Non-Stationary Reinforcement Learning
per: Chai, Jinhang, et al.
Publicat: (2025)
per: Chai, Jinhang, et al.
Publicat: (2025)
Low-Rank Plus Sparse Matrix Transfer Learning under Growing Representations and Ambient Dimensions
per: Chai, Jinhang, et al.
Publicat: (2026)
per: Chai, Jinhang, et al.
Publicat: (2026)
One-Step Bellman Alignment Enables Provably Efficient Transfer in Online RL
per: Chen, Elynn, et al.
Publicat: (2026)
per: Chen, Elynn, et al.
Publicat: (2026)
Structured Matrix Learning under Arbitrary Entrywise Dependence and Estimation of Markov Transition Kernel
per: Chai, Jinhang, et al.
Publicat: (2024)
per: Chai, Jinhang, et al.
Publicat: (2024)
Data-Driven Knowledge Transfer in Batch $Q^*$ Learning
per: Chen, Elynn, et al.
Publicat: (2024)
per: Chen, Elynn, et al.
Publicat: (2024)
Transfer Q-learning
per: Chen, Elynn, et al.
Publicat: (2022)
per: Chen, Elynn, et al.
Publicat: (2022)
Transfer Learning for Contextual Joint Assortment-Pricing under Cross-Market Heterogeneity
per: Chen, Elynn, et al.
Publicat: (2026)
per: Chen, Elynn, et al.
Publicat: (2026)
On Convergence of Average-Reward Q-Learning in Weakly Communicating Markov Decision Processes
per: Wan, Yi, et al.
Publicat: (2024)
per: Wan, Yi, et al.
Publicat: (2024)
Transition Constrained Bayesian Optimization via Markov Decision Processes
per: Folch, Jose Pablo, et al.
Publicat: (2024)
per: Folch, Jose Pablo, et al.
Publicat: (2024)
Transfer Faster, Price Smarter: Minimax Dynamic Pricing under Cross-Market Preference Shift
per: Zhang, Yi, et al.
Publicat: (2025)
per: Zhang, Yi, et al.
Publicat: (2025)
Provably Efficient Reward Transfer in Reinforcement Learning with Discrete Markov Decision Processes
per: Vora, Kevin, et al.
Publicat: (2025)
per: Vora, Kevin, et al.
Publicat: (2025)
Federated Control in Markov Decision Processes
per: Jin, Hao, et al.
Publicat: (2024)
per: Jin, Hao, et al.
Publicat: (2024)
Learning in Markov Decision Processes with Exogenous Dynamics
per: Maran, Davide, et al.
Publicat: (2026)
per: Maran, Davide, et al.
Publicat: (2026)
Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization
per: Bozkus, Talha, et al.
Publicat: (2024)
per: Bozkus, Talha, et al.
Publicat: (2024)
A Cantor-Kantorovich Metric Between Markov Decision Processes with Application to Transfer Learning
per: Banse, Adrien, et al.
Publicat: (2024)
per: Banse, Adrien, et al.
Publicat: (2024)
Prior-Aligned Meta-RL: Thompson Sampling with Learned Priors and Guarantees in Finite-Horizon MDPs
per: Zhou, Runlin, et al.
Publicat: (2025)
per: Zhou, Runlin, et al.
Publicat: (2025)
Monitored Markov Decision Processes
per: Parisi, Simone, et al.
Publicat: (2024)
per: Parisi, Simone, et al.
Publicat: (2024)
Learning Utilities from Demonstrations in Markov Decision Processes
per: Lazzati, Filippo, et al.
Publicat: (2024)
per: Lazzati, Filippo, et al.
Publicat: (2024)
A Unified Theory of Compositionality, Modularity, and Interpretability in Markov Decision Processes
per: Ringstrom, Thomas J., et al.
Publicat: (2025)
per: Ringstrom, Thomas J., et al.
Publicat: (2025)
SMART Fine-tuning Factor Augmented Neural Lasso
per: Chai, Jinhang, et al.
Publicat: (2026)
per: Chai, Jinhang, et al.
Publicat: (2026)
MATE: Solving Contextual Markov Decision Processes with Memory of Accumulated Transition Embeddings
per: Hwang, Himchan, et al.
Publicat: (2026)
per: Hwang, Himchan, et al.
Publicat: (2026)
Non-stationary and Varying-discounting Markov Decision Processes for Reinforcement Learning
per: Chen, Zhizuo, et al.
Publicat: (2025)
per: Chen, Zhizuo, et al.
Publicat: (2025)
Generalized Linear Markov Decision Process
per: Zhang, Sinian, et al.
Publicat: (2025)
per: Zhang, Sinian, et al.
Publicat: (2025)
Robust $Q$-learning Algorithm for Markov Decision Processes under Wasserstein Uncertainty
per: Neufeld, Ariel, et al.
Publicat: (2022)
per: Neufeld, Ariel, et al.
Publicat: (2022)
Horizon-Free Regret for Linear Markov Decision Processes
per: Zhang, Zihan, et al.
Publicat: (2024)
per: Zhang, Zihan, et al.
Publicat: (2024)
Learning Markov Decision Processes under Fully Bandit Feedback
per: Zhuo, Zhengjia, et al.
Publicat: (2026)
per: Zhuo, Zhengjia, et al.
Publicat: (2026)
Localized exploration in contextual dynamic pricing achieves dimension-free regret
per: Chai, Jinhang, et al.
Publicat: (2024)
per: Chai, Jinhang, et al.
Publicat: (2024)
Weakly Time-Coupled Approximation of Markov Decision Processes
per: Soheili, Negar, et al.
Publicat: (2026)
per: Soheili, Negar, et al.
Publicat: (2026)
Optimal Decision Tree Policies for Markov Decision Processes
per: Vos, Daniël, et al.
Publicat: (2023)
per: Vos, Daniël, et al.
Publicat: (2023)
Learning Deterministic Policies with Policy Gradients in Constrained Markov Decision Processes
per: Montenegro, Alessandro, et al.
Publicat: (2025)
per: Montenegro, Alessandro, et al.
Publicat: (2025)
Interaction-Grounded Learning for Contextual Markov Decision Processes with Personalized Feedback
per: Zhang, Mengxiao, et al.
Publicat: (2026)
per: Zhang, Mengxiao, et al.
Publicat: (2026)
Impact of Markov Decision Process Design on Sim-to-Real Reinforcement Learning
per: Krau, Tatjana, et al.
Publicat: (2026)
per: Krau, Tatjana, et al.
Publicat: (2026)
Learning Constrained Markov Decision Processes With Non-stationary Rewards and Constraints
per: Stradi, Francesco Emanuele, et al.
Publicat: (2024)
per: Stradi, Francesco Emanuele, et al.
Publicat: (2024)
Policy Testing in Markov Decision Processes
per: Ariu, Kaito, et al.
Publicat: (2025)
per: Ariu, Kaito, et al.
Publicat: (2025)
Best-of-Both-Worlds for Heavy-Tailed Markov Decision Processes
per: Chen, Yu, et al.
Publicat: (2026)
per: Chen, Yu, et al.
Publicat: (2026)
Optimistic Actor-Critic with Parametric Policies for Linear Markov Decision Processes
per: Lin, Max Qiushi, et al.
Publicat: (2026)
per: Lin, Max Qiushi, et al.
Publicat: (2026)
Tensor-Fused Multi-View Graph Contrastive Learning
per: Wu, Yujia, et al.
Publicat: (2024)
per: Wu, Yujia, et al.
Publicat: (2024)
Performative Reinforcement Learning with Linear Markov Decision Process
per: Mandal, Debmalya, et al.
Publicat: (2024)
per: Mandal, Debmalya, et al.
Publicat: (2024)
High-Dimensional Tensor Discriminant Analysis: Low-Rank Discriminant Structure, Representation Synergy, and Theoretical Guarantees
per: Chen, Elynn, et al.
Publicat: (2025)
per: Chen, Elynn, et al.
Publicat: (2025)
Markov Decision Processes under External Temporal Processes
per: Ayyagari, Ranga Shaarad, et al.
Publicat: (2023)
per: Ayyagari, Ranga Shaarad, et al.
Publicat: (2023)
Ítems similars
-
Deep Transfer $Q$-Learning for Offline Non-Stationary Reinforcement Learning
per: Chai, Jinhang, et al.
Publicat: (2025) -
Low-Rank Plus Sparse Matrix Transfer Learning under Growing Representations and Ambient Dimensions
per: Chai, Jinhang, et al.
Publicat: (2026) -
One-Step Bellman Alignment Enables Provably Efficient Transfer in Online RL
per: Chen, Elynn, et al.
Publicat: (2026) -
Structured Matrix Learning under Arbitrary Entrywise Dependence and Estimation of Markov Transition Kernel
per: Chai, Jinhang, et al.
Publicat: (2024) -
Data-Driven Knowledge Transfer in Batch $Q^*$ Learning
per: Chen, Elynn, et al.
Publicat: (2024)