:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Zeng, Sihan, Bhatt, Sujay, Ganesh, Sumitra, Koppel, Alec
Format:	Preprint
Veröffentlicht:	2026
Schlagworte:	Machine Learning Optimization and Control
Online-Zugang:	https://arxiv.org/abs/2601.16399
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Learning in Herding Mean Field Games: Single-Loop Algorithm with Finite-Time Convergence Analysis
von: Zeng, Sihan, et al.
Veröffentlicht: (2024)

Partially Observable Contextual Bandits with Linear Payoffs
von: Zeng, Sihan, et al.
Veröffentlicht: (2024)

Regularized Proportional Fairness Mechanism for Resource Allocation Without Money
von: Zeng, Sihan, et al.
Veröffentlicht: (2025)

Learning Payment-Free Resource Allocation Mechanisms
von: Zeng, Sihan, et al.
Veröffentlicht: (2023)

Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning
von: Zeng, Sihan, et al.
Veröffentlicht: (2024)

Learning in Stackelberg Mean Field Games: A Non-Asymptotic Analysis
von: Zeng, Sihan, et al.
Veröffentlicht: (2025)

Approximate Equivariance in Reinforcement Learning
von: Park, Jung Yeon, et al.
Veröffentlicht: (2024)

Rethinking Neural Network Learning Rates: A Stackelberg Perspective
von: Zeng, Sihan, et al.
Veröffentlicht: (2026)

Finite-Time Complexity of Online Primal-Dual Natural Actor-Critic Algorithm for Constrained Markov Decision Processes
von: Zeng, Sihan, et al.
Veröffentlicht: (2021)

Constrained Bi-Level Optimization: Proximal Lagrangian Value function Approach and Hessian-free Algorithm
von: Yao, Wei, et al.
Veröffentlicht: (2024)

Fast Two-Time-Scale Stochastic Gradient Method with Applications in Reinforcement Learning
von: Zeng, Sihan, et al.
Veröffentlicht: (2024)

A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning
von: Zeng, Sihan, et al.
Veröffentlicht: (2021)

Sharpened Lazy Incremental Quasi-Newton Method
von: Lahoti, Aakash, et al.
Veröffentlicht: (2023)

Moreau Envelope for Nonconvex Bi-Level Optimization: A Single-loop and Hessian-free Solution Strategy
von: Liu, Risheng, et al.
Veröffentlicht: (2024)

A Communication-Efficient Decentralized Actor-Critic Algorithm
von: Ren, Xiaoxing, et al.
Veröffentlicht: (2025)

Weak Convergence Analysis of Online Neural Actor-Critic Algorithms
von: Lam, Samuel Chun-Hei, et al.
Veröffentlicht: (2024)

DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning under Uncertainty
von: Cui, Mingxuan, et al.
Veröffentlicht: (2025)

Quasi-Newton Compatible Actor-Critic for Deterministic Policies
von: Kordabad, Arash Bahari, et al.
Veröffentlicht: (2025)

UVIP: Model-Free Approach to Evaluate Reinforcement Learning Algorithms
von: Belomestny, Denis, et al.
Veröffentlicht: (2021)

Optimal Hessian/Jacobian-Free Nonconvex-PL Bilevel Optimization
von: Huang, Feihu
Veröffentlicht: (2024)

QCQP-Net: Reliably Learning Feasible Alternating Current Optimal Power Flow Solutions Under Constraints
von: Zeng, Sihan, et al.
Veröffentlicht: (2024)

Wasserstein Flow Meets Replicator Dynamics: A Mean-Field Analysis of Representation Learning in Actor-Critic
von: Zhang, Yufeng, et al.
Veröffentlicht: (2021)

CACTO-SL: Using Sobolev Learning to improve Continuous Actor-Critic with Trajectory Optimization
von: Alboni, Elisa, et al.
Veröffentlicht: (2023)

Zero-Order Optimization for LLM Fine-Tuning via Learnable Direction Sampling
von: Parfenov, Valery, et al.
Veröffentlicht: (2026)

Control Theoretic Approach to Fine-Tuning and Transfer Learning
von: Bayram, Erkan, et al.
Veröffentlicht: (2024)

Convergence of Actor-Critic Learning for Mean Field Games and Mean Field Control in Continuous Spaces
von: Fouque, Jean-Pierre, et al.
Veröffentlicht: (2025)

ACING: Actor-Critic for Instruction Learning in Black-Box LLMs
von: Kharrat, Salma, et al.
Veröffentlicht: (2024)

Achieving $ε^{-2}$ Sample Complexity for Single-Loop Actor-Critic under Minimal Assumptions
von: Hamza, Ishaq, et al.
Veröffentlicht: (2026)

Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
von: Zhao, Hanyang, et al.
Veröffentlicht: (2025)

Stochastic Hessian Fittings with Lie Groups
von: Li, Xi-Lin
Veröffentlicht: (2024)

Tuning-Free Stochastic Optimization
von: Khaled, Ahmed, et al.
Veröffentlicht: (2024)

Exploration from a Primal-Dual Lens: Value-Incentivized Actor-Critic Methods for Sample-Efficient Online RL
von: Yang, Tong, et al.
Veröffentlicht: (2025)

Gradient-Normalized Smoothness for Optimization with Approximate Hessians
von: Semenov, Andrei, et al.
Veröffentlicht: (2025)

Towards Quantifying the Hessian Structure of Neural Networks
von: Dong, Zhaorui, et al.
Veröffentlicht: (2025)

SGD with Partial Hessian for Deep Neural Networks Optimization
von: Sun, Ying, et al.
Veröffentlicht: (2024)

New Hybrid Fine-Tuning Paradigm for LLMs: Algorithm Design and Convergence Analysis Framework
von: Ma, Shaocong, et al.
Veröffentlicht: (2026)

Iterative Tuning of Nonlinear Model Predictive Control for Robotic Manufacturing Tasks
von: Ingole, Deepak, et al.
Veröffentlicht: (2025)

Extensions of Robbins-Siegmund Theorem with Applications in Reinforcement Learning
von: Liu, Xinyu, et al.
Veröffentlicht: (2025)

Accelerated Multi-Time-Scale Stochastic Approximation: Optimal Complexity and Applications in Reinforcement Learning and Multi-Agent Games
von: Zeng, Sihan, et al.
Veröffentlicht: (2024)

A Hessian-Aware Stochastic Differential Equation for Modelling SGD
von: Li, Xiang, et al.
Veröffentlicht: (2024)