:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Zhao, Runze, Yu, Yue, Wang, Ruhan, Huang, Chunfeng, Zhou, Dongruo
Formato:	Preprint
Publicado:	2025
Materias:	Machine Learning
Acceso en línea:	https://arxiv.org/abs/2508.02103
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation
por: Zhao, Runze, et al.
Publicado: (2025)

Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning
por: Wang, Ruhan, et al.
Publicado: (2024)

How to Provably Improve Return Conditioned Supervised Learning?
por: Liu, Zhishuai, et al.
Publicado: (2025)

Dependency-aware Maximum Likelihood Estimation for Active Learning
por: Kalkanli, Beyza, et al.
Publicado: (2025)

Maximum Likelihood Reinforcement Learning
por: Tajwar, Fahim, et al.
Publicado: (2026)

Federated In-Context Learning: Iterative Refinement for Improved Answer Quality
por: Wang, Ruhan, et al.
Publicado: (2025)

On the Limits of Test-Time Compute: Sequential Reward Filtering for Better Inference
por: Yu, Yue, et al.
Publicado: (2025)

Provable Zero-Shot Generalization in Offline Reinforcement Learning
por: Wang, Zhiyong, et al.
Publicado: (2025)

FERA: Uncertainty-Aware Federated Reasoning for Large Language Models
por: Wang, Ruhan, et al.
Publicado: (2026)

On Learning-Curve Monotonicity for Maximum Likelihood Estimators
por: Sellke, Mark, et al.
Publicado: (2025)

Near-Optimal Second-Order Guarantees for Model-Based Adversarial Imitation Learning
por: Li, Shangzhe, et al.
Publicado: (2025)

Fine-Tuning Flow Matching via Maximum Likelihood Estimation of Reconstructions
por: Li, Zhaoyi, et al.
Publicado: (2025)

Offline Preference Optimization via Maximum Marginal Likelihood Estimation
por: Najafi, Saeed, et al.
Publicado: (2025)

Deriving the Scaled-Dot-Function via Maximum Likelihood Estimation and Maximum Entropy Approach
por: Ma, Jiyong
Publicado: (2025)

Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
por: Wang, Zhiyong, et al.
Publicado: (2024)

Targeted Maximum Likelihood Estimation for Integral Projection Models in Population Ecology
por: Zhou, Yunzhe, et al.
Publicado: (2024)

Improved Techniques for Maximum Likelihood Estimation for Diffusion ODEs
por: Zheng, Kaiwen, et al.
Publicado: (2023)

HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning and Monte Carlo Tree Search
por: Nguyen, Tuan Ngo, et al.
Publicado: (2024)

Breaking the $\log(1/Δ_2)$ Barrier: Better Batched Best Arm Identification with Adaptive Grids
por: Jin, Tianyuan, et al.
Publicado: (2025)

Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
por: Di, Qiwei, et al.
Publicado: (2024)

Performance of Cross-Validated Targeted Maximum Likelihood Estimation
por: Smith, Matthew J., et al.
Publicado: (2024)

Variational Approximated Restricted Maximum Likelihood Estimation for Spatial Data
por: Thakur, Debjoy
Publicado: (2026)

Momentum SVGD-EM for Accelerated Maximum Marginal Likelihood Estimation
por: Rozzio, Adam, et al.
Publicado: (2026)

CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing
por: Yang, Chen, et al.
Publicado: (2024)

Momentum Particle Maximum Likelihood
por: Lim, Jen Ning, et al.
Publicado: (2023)

Model-free Estimation of Latent Structure via Multiscale Nonparametric Maximum Likelihood
por: Aragam, Bryon, et al.
Publicado: (2024)

RL2ML: Finite-Rollout Surrogate Objectives from Reinforcement Learning to Maximum Likelihood
por: Zheng, Yifu
Publicado: (2026)

Interacting Particle Langevin Algorithm for Maximum Marginal Likelihood Estimation
por: Akyildiz, Ö. Deniz, et al.
Publicado: (2023)

Maximum Likelihood Learning of Latent Dynamics Without Reconstruction
por: Hromadka, Samo, et al.
Publicado: (2025)

Implicit Maximum Likelihood Estimation for Real-time Generative Model Predictive Control
por: Lee, Grayson, et al.
Publicado: (2026)

IMLE Policy: Fast and Sample Efficient Visuomotor Policy Learning via Implicit Maximum Likelihood Estimation
por: Rana, Krishan, et al.
Publicado: (2025)

Preference Elicitation for Multi-objective Combinatorial Optimization with Active Learning and Maximum Likelihood Estimation
por: Defresne, Marianne, et al.
Publicado: (2025)

Efficient Targeted Maximum Likelihood Estimators for Two-Phase Design Problems
por: Qiu, Sky, et al.
Publicado: (2026)

Hardness of Maximum Likelihood Learning of DPPs
por: Grigorescu, Elena, et al.
Publicado: (2022)

Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds
por: Wang, Zhiyong, et al.
Publicado: (2024)

From Data to Rewards: a Bilevel Optimization Perspective on Maximum Likelihood Estimation
por: Benechehab, Abdelhakim, et al.
Publicado: (2025)

Mitigating Instance Entanglement in Instance-Dependent Partial Label Learning
por: Zhao, Rui, et al.
Publicado: (2026)

Foundation of Calculating Normalized Maximum Likelihood for Continuous Probability Models
por: Suzuki, Atsushi, et al.
Publicado: (2024)

Sensor Design for Accuracy-Bounded Estimation via Maximum-Entropy Likelihood Synthesis
por: Bhattacharya, Raktim
Publicado: (2026)

Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function Approximation: Minimax Optimal and Instance-Dependent Regret Bounds
por: Huang, Jiayi, et al.
Publicado: (2023)