Guardado en:
| Autores principales: | Zhao, Runze, Yu, Yue, Wang, Ruhan, Huang, Chunfeng, Zhou, Dongruo |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2508.02103 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation
por: Zhao, Runze, et al.
Publicado: (2025)
por: Zhao, Runze, et al.
Publicado: (2025)
Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning
por: Wang, Ruhan, et al.
Publicado: (2024)
por: Wang, Ruhan, et al.
Publicado: (2024)
How to Provably Improve Return Conditioned Supervised Learning?
por: Liu, Zhishuai, et al.
Publicado: (2025)
por: Liu, Zhishuai, et al.
Publicado: (2025)
Dependency-aware Maximum Likelihood Estimation for Active Learning
por: Kalkanli, Beyza, et al.
Publicado: (2025)
por: Kalkanli, Beyza, et al.
Publicado: (2025)
Maximum Likelihood Reinforcement Learning
por: Tajwar, Fahim, et al.
Publicado: (2026)
por: Tajwar, Fahim, et al.
Publicado: (2026)
Federated In-Context Learning: Iterative Refinement for Improved Answer Quality
por: Wang, Ruhan, et al.
Publicado: (2025)
por: Wang, Ruhan, et al.
Publicado: (2025)
On the Limits of Test-Time Compute: Sequential Reward Filtering for Better Inference
por: Yu, Yue, et al.
Publicado: (2025)
por: Yu, Yue, et al.
Publicado: (2025)
Provable Zero-Shot Generalization in Offline Reinforcement Learning
por: Wang, Zhiyong, et al.
Publicado: (2025)
por: Wang, Zhiyong, et al.
Publicado: (2025)
FERA: Uncertainty-Aware Federated Reasoning for Large Language Models
por: Wang, Ruhan, et al.
Publicado: (2026)
por: Wang, Ruhan, et al.
Publicado: (2026)
On Learning-Curve Monotonicity for Maximum Likelihood Estimators
por: Sellke, Mark, et al.
Publicado: (2025)
por: Sellke, Mark, et al.
Publicado: (2025)
Near-Optimal Second-Order Guarantees for Model-Based Adversarial Imitation Learning
por: Li, Shangzhe, et al.
Publicado: (2025)
por: Li, Shangzhe, et al.
Publicado: (2025)
Fine-Tuning Flow Matching via Maximum Likelihood Estimation of Reconstructions
por: Li, Zhaoyi, et al.
Publicado: (2025)
por: Li, Zhaoyi, et al.
Publicado: (2025)
Offline Preference Optimization via Maximum Marginal Likelihood Estimation
por: Najafi, Saeed, et al.
Publicado: (2025)
por: Najafi, Saeed, et al.
Publicado: (2025)
Deriving the Scaled-Dot-Function via Maximum Likelihood Estimation and Maximum Entropy Approach
por: Ma, Jiyong
Publicado: (2025)
por: Ma, Jiyong
Publicado: (2025)
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
por: Wang, Zhiyong, et al.
Publicado: (2024)
por: Wang, Zhiyong, et al.
Publicado: (2024)
Targeted Maximum Likelihood Estimation for Integral Projection Models in Population Ecology
por: Zhou, Yunzhe, et al.
Publicado: (2024)
por: Zhou, Yunzhe, et al.
Publicado: (2024)
Improved Techniques for Maximum Likelihood Estimation for Diffusion ODEs
por: Zheng, Kaiwen, et al.
Publicado: (2023)
por: Zheng, Kaiwen, et al.
Publicado: (2023)
HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning and Monte Carlo Tree Search
por: Nguyen, Tuan Ngo, et al.
Publicado: (2024)
por: Nguyen, Tuan Ngo, et al.
Publicado: (2024)
Breaking the $\log(1/Δ_2)$ Barrier: Better Batched Best Arm Identification with Adaptive Grids
por: Jin, Tianyuan, et al.
Publicado: (2025)
por: Jin, Tianyuan, et al.
Publicado: (2025)
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
por: Di, Qiwei, et al.
Publicado: (2024)
por: Di, Qiwei, et al.
Publicado: (2024)
Performance of Cross-Validated Targeted Maximum Likelihood Estimation
por: Smith, Matthew J., et al.
Publicado: (2024)
por: Smith, Matthew J., et al.
Publicado: (2024)
Variational Approximated Restricted Maximum Likelihood Estimation for Spatial Data
por: Thakur, Debjoy
Publicado: (2026)
por: Thakur, Debjoy
Publicado: (2026)
Momentum SVGD-EM for Accelerated Maximum Marginal Likelihood Estimation
por: Rozzio, Adam, et al.
Publicado: (2026)
por: Rozzio, Adam, et al.
Publicado: (2026)
CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing
por: Yang, Chen, et al.
Publicado: (2024)
por: Yang, Chen, et al.
Publicado: (2024)
Momentum Particle Maximum Likelihood
por: Lim, Jen Ning, et al.
Publicado: (2023)
por: Lim, Jen Ning, et al.
Publicado: (2023)
Model-free Estimation of Latent Structure via Multiscale Nonparametric Maximum Likelihood
por: Aragam, Bryon, et al.
Publicado: (2024)
por: Aragam, Bryon, et al.
Publicado: (2024)
RL2ML: Finite-Rollout Surrogate Objectives from Reinforcement Learning to Maximum Likelihood
por: Zheng, Yifu
Publicado: (2026)
por: Zheng, Yifu
Publicado: (2026)
Interacting Particle Langevin Algorithm for Maximum Marginal Likelihood Estimation
por: Akyildiz, Ö. Deniz, et al.
Publicado: (2023)
por: Akyildiz, Ö. Deniz, et al.
Publicado: (2023)
Maximum Likelihood Learning of Latent Dynamics Without Reconstruction
por: Hromadka, Samo, et al.
Publicado: (2025)
por: Hromadka, Samo, et al.
Publicado: (2025)
Implicit Maximum Likelihood Estimation for Real-time Generative Model Predictive Control
por: Lee, Grayson, et al.
Publicado: (2026)
por: Lee, Grayson, et al.
Publicado: (2026)
IMLE Policy: Fast and Sample Efficient Visuomotor Policy Learning via Implicit Maximum Likelihood Estimation
por: Rana, Krishan, et al.
Publicado: (2025)
por: Rana, Krishan, et al.
Publicado: (2025)
Preference Elicitation for Multi-objective Combinatorial Optimization with Active Learning and Maximum Likelihood Estimation
por: Defresne, Marianne, et al.
Publicado: (2025)
por: Defresne, Marianne, et al.
Publicado: (2025)
Efficient Targeted Maximum Likelihood Estimators for Two-Phase Design Problems
por: Qiu, Sky, et al.
Publicado: (2026)
por: Qiu, Sky, et al.
Publicado: (2026)
Hardness of Maximum Likelihood Learning of DPPs
por: Grigorescu, Elena, et al.
Publicado: (2022)
por: Grigorescu, Elena, et al.
Publicado: (2022)
Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds
por: Wang, Zhiyong, et al.
Publicado: (2024)
por: Wang, Zhiyong, et al.
Publicado: (2024)
From Data to Rewards: a Bilevel Optimization Perspective on Maximum Likelihood Estimation
por: Benechehab, Abdelhakim, et al.
Publicado: (2025)
por: Benechehab, Abdelhakim, et al.
Publicado: (2025)
Mitigating Instance Entanglement in Instance-Dependent Partial Label Learning
por: Zhao, Rui, et al.
Publicado: (2026)
por: Zhao, Rui, et al.
Publicado: (2026)
Foundation of Calculating Normalized Maximum Likelihood for Continuous Probability Models
por: Suzuki, Atsushi, et al.
Publicado: (2024)
por: Suzuki, Atsushi, et al.
Publicado: (2024)
Sensor Design for Accuracy-Bounded Estimation via Maximum-Entropy Likelihood Synthesis
por: Bhattacharya, Raktim
Publicado: (2026)
por: Bhattacharya, Raktim
Publicado: (2026)
Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function Approximation: Minimax Optimal and Instance-Dependent Regret Bounds
por: Huang, Jiayi, et al.
Publicado: (2023)
por: Huang, Jiayi, et al.
Publicado: (2023)
Ejemplares similares
-
Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation
por: Zhao, Runze, et al.
Publicado: (2025) -
Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning
por: Wang, Ruhan, et al.
Publicado: (2024) -
How to Provably Improve Return Conditioned Supervised Learning?
por: Liu, Zhishuai, et al.
Publicado: (2025) -
Dependency-aware Maximum Likelihood Estimation for Active Learning
por: Kalkanli, Beyza, et al.
Publicado: (2025) -
Maximum Likelihood Reinforcement Learning
por: Tajwar, Fahim, et al.
Publicado: (2026)