:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	Abrahamsen, Nilin
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Machine Learning
Online-Zugang:	https://arxiv.org/abs/2512.23353
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

PROMA: Projected Microbatch Accumulation for Reference-Free Proximal Policy Updates
von: Abrahamsen, Nilin
Veröffentlicht: (2026)

A Kaczmarz-inspired approach to accelerate the optimization of neural network wavefunctions
von: Goldshlager, Gil, et al.
Veröffentlicht: (2024)

Convergence of variational Monte Carlo simulation and scale-invariant pre-training
von: Abrahamsen, Nilin, et al.
Veröffentlicht: (2023)

Learning in complex action spaces without policy gradients
von: Tavakoli, Arash, et al.
Veröffentlicht: (2024)

Transductive Off-policy Proximal Policy Optimization
von: Gan, Yaozhong, et al.
Veröffentlicht: (2024)

Equivalence of stochastic and deterministic policy gradients
von: Todorov, Emo
Veröffentlicht: (2025)

Policy gradient methods for ordinal policies
von: Weinberger, Simón, et al.
Veröffentlicht: (2025)

Bayesian policy gradient and actor-critic algorithms
von: Ghavamzadeh, Mohammad, et al.
Veröffentlicht: (2026)

Trainability issues in quantum policy gradients
von: Sequeira, André, et al.
Veröffentlicht: (2024)

$\pi2\text{vec}$: Policy Representations with Successor Features
von: Scarpellini, Gianluca, et al.
Veröffentlicht: (2023)

Some remarks on gradient dominance and LQR policy optimization
von: Sontag, Eduardo D.
Veröffentlicht: (2025)

A policy gradient approach for optimization of smooth risk measures
von: Vijayan, Nithia, et al.
Veröffentlicht: (2022)

A Unified Theory of Stochastic Proximal Point Methods without Smoothness
von: Richtárik, Peter, et al.
Veröffentlicht: (2024)

ExO-PPO: an Extended Off-policy Proximal Policy Optimization Algorithm
von: Wang, Hanyong, et al.
Veröffentlicht: (2026)

A policy gradient approach for Finite Horizon Constrained Markov Decision Processes
von: Guin, Soumyajit, et al.
Veröffentlicht: (2022)

Off-policy Distributional Q($λ$): Distributional RL without Importance Sampling
von: Tang, Yunhao, et al.
Veröffentlicht: (2024)

Quantum natural gradient without monotonicity
von: Sasaki, Toi, et al.
Veröffentlicht: (2024)

Finding good policies in average-reward Markov Decision Processes without prior knowledge
von: Tuynman, Adrienne, et al.
Veröffentlicht: (2024)

Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint
von: Vijayan, Nithia, et al.
Veröffentlicht: (2021)

Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradients
von: Davar, Parisa, et al.
Veröffentlicht: (2024)

pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation
von: Chen, Hansheng, et al.
Veröffentlicht: (2025)

The Generalized Proximity Forest
von: Shaw, Ben, et al.
Veröffentlicht: (2025)

Softmax gradient policy for variance minimization and risk-averse multi armed bandits
von: Turinici, Gabriel
Veröffentlicht: (2026)

Layerwise Proximal Replay: A Proximal Point Method for Online Continual Learning
von: Yoo, Jason, et al.
Veröffentlicht: (2024)

Old lamp
von: TahirNilin
Veröffentlicht: (2021)

Forest Proximities for Time Series
von: Shaw, Ben, et al.
Veröffentlicht: (2024)

Wasserstein Proximal Policy Gradient
von: Zhu, Zhaoyu, et al.
Veröffentlicht: (2026)

Proximal-IMH: Proximal Posterior Proposals for Independent Metropolis-Hastings with Approximate Operators
von: Chen, Youguang, et al.
Veröffentlicht: (2026)

Geometry and convergence of natural policy gradient methods
von: Müller, Johannes, et al.
Veröffentlicht: (2022)

Proximity Matters: Local Proximity Enhanced Balancing for Treatment Effect Estimation
von: Wang, Hao, et al.
Veröffentlicht: (2024)

Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
von: Yang, Yan, et al.
Veröffentlicht: (2024)

Proximal Policy Distillation
von: Spigler, Giacomo
Veröffentlicht: (2024)

Central Path Proximal Policy Optimization
von: Milosevic, Nikola, et al.
Veröffentlicht: (2025)

Proximal Iteration for Nonlinear Adaptive Lasso
von: Wycoff, Nathan, et al.
Veröffentlicht: (2024)

Deep deterministic policy gradient with symmetric data augmentation for lateral attitude tracking control of a fixed-wing aircraft
von: Li, Yifei, et al.
Veröffentlicht: (2024)

Fast training of accurate physics-informed neural networks without gradient descent
von: Datar, Chinmay, et al.
Veröffentlicht: (2024)

Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
von: Ding, Dongsheng, et al.
Veröffentlicht: (2022)

Independent policy gradient-based reinforcement learning for economic and reliable energy management of multi-microgrid systems
von: Hu, Junkai, et al.
Veröffentlicht: (2025)

Deep Gaussian Process Proximal Policy Optimization
von: van der Lende, Matthijs, et al.
Veröffentlicht: (2025)

Actor-Critic Pretraining for Proximal Policy Optimization
von: Kernbach, Andreas, et al.
Veröffentlicht: (2026)