Saved in:
| Main Authors: | Weinberger, Simón, Cugliari, Jairo |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.18614 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Pr{é}diction optimale pour un mod{è}le ordinal {à} covariables fonctionnelles
by: Weinberger, Simón, et al.
Published: (2025)
by: Weinberger, Simón, et al.
Published: (2025)
Almost sure convergence rates of stochastic gradient methods under gradient domination
by: Weissmann, Simon, et al.
Published: (2024)
by: Weissmann, Simon, et al.
Published: (2024)
Equivalence of stochastic and deterministic policy gradients
by: Todorov, Emo
Published: (2025)
by: Todorov, Emo
Published: (2025)
Bayesian policy gradient and actor-critic algorithms
by: Ghavamzadeh, Mohammad, et al.
Published: (2026)
by: Ghavamzadeh, Mohammad, et al.
Published: (2026)
Geometry and convergence of natural policy gradient methods
by: Müller, Johannes, et al.
Published: (2022)
by: Müller, Johannes, et al.
Published: (2022)
Trainability issues in quantum policy gradients
by: Sequeira, André, et al.
Published: (2024)
by: Sequeira, André, et al.
Published: (2024)
Some remarks on gradient dominance and LQR policy optimization
by: Sontag, Eduardo D.
Published: (2025)
by: Sontag, Eduardo D.
Published: (2025)
ISOPO: Proximal policy gradients without pi-old
by: Abrahamsen, Nilin
Published: (2025)
by: Abrahamsen, Nilin
Published: (2025)
A policy gradient approach for optimization of smooth risk measures
by: Vijayan, Nithia, et al.
Published: (2022)
by: Vijayan, Nithia, et al.
Published: (2022)
Splitting criteria for ordinal decision trees: an experimental study
by: Ayllón-Gavilán, Rafael, et al.
Published: (2024)
by: Ayllón-Gavilán, Rafael, et al.
Published: (2024)
dlordinal: a Python package for deep ordinal classification
by: Bérchez-Moreno, Francisco, et al.
Published: (2024)
by: Bérchez-Moreno, Francisco, et al.
Published: (2024)
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
by: Ding, Dongsheng, et al.
Published: (2022)
by: Ding, Dongsheng, et al.
Published: (2022)
Learning in complex action spaces without policy gradients
by: Tavakoli, Arash, et al.
Published: (2024)
by: Tavakoli, Arash, et al.
Published: (2024)
Convergence of policy gradient methods for finite-horizon exploratory linear-quadratic control problems
by: Giegrich, Michael, et al.
Published: (2022)
by: Giegrich, Michael, et al.
Published: (2022)
Transductive Off-policy Proximal Policy Optimization
by: Gan, Yaozhong, et al.
Published: (2024)
by: Gan, Yaozhong, et al.
Published: (2024)
A policy gradient approach for Finite Horizon Constrained Markov Decision Processes
by: Guin, Soumyajit, et al.
Published: (2022)
by: Guin, Soumyajit, et al.
Published: (2022)
TOC-UCO: a comprehensive repository of tabular ordinal classification datasets
by: Ayllón-Gavilán, Rafael, et al.
Published: (2025)
by: Ayllón-Gavilán, Rafael, et al.
Published: (2025)
Binomial flows: Denoising and flow matching for discrete ordinal data
by: Shenfeld, Yair, et al.
Published: (2026)
by: Shenfeld, Yair, et al.
Published: (2026)
Exploration-Exploitation Tradeoff in Universal Lossy Compression
by: Weinberger, Nir, et al.
Published: (2025)
by: Weinberger, Nir, et al.
Published: (2025)
A representation-learning game for classes of prediction tasks
by: Uzan, Neria, et al.
Published: (2024)
by: Uzan, Neria, et al.
Published: (2024)
A stochastic gradient method for trilevel optimization
by: Giovannelli, Tommaso, et al.
Published: (2025)
by: Giovannelli, Tommaso, et al.
Published: (2025)
An interpretable neural network-based non-proportional odds model for ordinal regression
by: Okuno, Akifumi, et al.
Published: (2023)
by: Okuno, Akifumi, et al.
Published: (2023)
Statistical curriculum learning: An elimination algorithm achieving an oracle risk
by: Cohen, Omer, et al.
Published: (2024)
by: Cohen, Omer, et al.
Published: (2024)
Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint
by: Vijayan, Nithia, et al.
Published: (2021)
by: Vijayan, Nithia, et al.
Published: (2021)
Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradients
by: Davar, Parisa, et al.
Published: (2024)
by: Davar, Parisa, et al.
Published: (2024)
Quantum Maximum Likelihood Prediction via Hilbert Space Embeddings
by: Sreekumar, Sreejith, et al.
Published: (2026)
by: Sreekumar, Sreejith, et al.
Published: (2026)
Policy Newton methods for Distortion Riskmetrics
by: Pachal, Soumen, et al.
Published: (2025)
by: Pachal, Soumen, et al.
Published: (2025)
Softmax gradient policy for variance minimization and risk-averse multi armed bandits
by: Turinici, Gabriel
Published: (2026)
by: Turinici, Gabriel
Published: (2026)
Auto-conditioned primal-dual hybrid gradient method and alternating direction method of multipliers
by: Lan, Guanghui, et al.
Published: (2024)
by: Lan, Guanghui, et al.
Published: (2024)
Conjugate gradient methods for high-dimensional GLMMs
by: Pandolfi, Andrea, et al.
Published: (2024)
by: Pandolfi, Andrea, et al.
Published: (2024)
Unlearning in- vs. out-of-distribution data in LLMs under gradient-based method
by: Baluta, Teodora, et al.
Published: (2024)
by: Baluta, Teodora, et al.
Published: (2024)
Clustered KL-barycenter design for policy evaluation
by: Weissmann, Simon, et al.
Published: (2025)
by: Weissmann, Simon, et al.
Published: (2025)
Interpretable liquid crystal phase classification via two-by-two ordinal patterns
by: Voltarelli, Leonardo G. J. M., et al.
Published: (2026)
by: Voltarelli, Leonardo G. J. M., et al.
Published: (2026)
Analysis of On-policy Policy Gradient Methods under the Distribution Mismatch
by: Wang, Weizhen, et al.
Published: (2025)
by: Wang, Weizhen, et al.
Published: (2025)
Strongly-polynomial time and validation analysis of policy gradient methods
by: Ju, Caleb, et al.
Published: (2024)
by: Ju, Caleb, et al.
Published: (2024)
Local linear convergence of gradient methods for overparameterized Gaussian mixtures
by: Wang, Jingxing, et al.
Published: (2026)
by: Wang, Jingxing, et al.
Published: (2026)
The Structure of Cross-Validation Error: Stability, Covariance, and Minimax Limits
by: Nachum, Ido, et al.
Published: (2025)
by: Nachum, Ido, et al.
Published: (2025)
Minimax Limits of k-Fold Cross-Validation via Majority
by: Nachum, Ido, et al.
Published: (2026)
by: Nachum, Ido, et al.
Published: (2026)
On Bits and Bandits: Quantifying the Regret-Information Trade-off
by: Shufaro, Itai, et al.
Published: (2024)
by: Shufaro, Itai, et al.
Published: (2024)
Which Algorithms Have Tight Generalization Bounds?
by: Gastpar, Michael, et al.
Published: (2024)
by: Gastpar, Michael, et al.
Published: (2024)
Similar Items
-
Pr{é}diction optimale pour un mod{è}le ordinal {à} covariables fonctionnelles
by: Weinberger, Simón, et al.
Published: (2025) -
Almost sure convergence rates of stochastic gradient methods under gradient domination
by: Weissmann, Simon, et al.
Published: (2024) -
Equivalence of stochastic and deterministic policy gradients
by: Todorov, Emo
Published: (2025) -
Bayesian policy gradient and actor-critic algorithms
by: Ghavamzadeh, Mohammad, et al.
Published: (2026) -
Geometry and convergence of natural policy gradient methods
by: Müller, Johannes, et al.
Published: (2022)