:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Weinberger, Simón, Cugliari, Jairo
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2506.18614
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Pr{é}diction optimale pour un mod{è}le ordinal {à} covariables fonctionnelles
by: Weinberger, Simón, et al.
Published: (2025)

Almost sure convergence rates of stochastic gradient methods under gradient domination
by: Weissmann, Simon, et al.
Published: (2024)

Equivalence of stochastic and deterministic policy gradients
by: Todorov, Emo
Published: (2025)

Bayesian policy gradient and actor-critic algorithms
by: Ghavamzadeh, Mohammad, et al.
Published: (2026)

Geometry and convergence of natural policy gradient methods
by: Müller, Johannes, et al.
Published: (2022)

Trainability issues in quantum policy gradients
by: Sequeira, André, et al.
Published: (2024)

Some remarks on gradient dominance and LQR policy optimization
by: Sontag, Eduardo D.
Published: (2025)

ISOPO: Proximal policy gradients without pi-old
by: Abrahamsen, Nilin
Published: (2025)

A policy gradient approach for optimization of smooth risk measures
by: Vijayan, Nithia, et al.
Published: (2022)

Splitting criteria for ordinal decision trees: an experimental study
by: Ayllón-Gavilán, Rafael, et al.
Published: (2024)

dlordinal: a Python package for deep ordinal classification
by: Bérchez-Moreno, Francisco, et al.
Published: (2024)

Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
by: Ding, Dongsheng, et al.
Published: (2022)

Learning in complex action spaces without policy gradients
by: Tavakoli, Arash, et al.
Published: (2024)

Convergence of policy gradient methods for finite-horizon exploratory linear-quadratic control problems
by: Giegrich, Michael, et al.
Published: (2022)

Transductive Off-policy Proximal Policy Optimization
by: Gan, Yaozhong, et al.
Published: (2024)

A policy gradient approach for Finite Horizon Constrained Markov Decision Processes
by: Guin, Soumyajit, et al.
Published: (2022)

TOC-UCO: a comprehensive repository of tabular ordinal classification datasets
by: Ayllón-Gavilán, Rafael, et al.
Published: (2025)

Binomial flows: Denoising and flow matching for discrete ordinal data
by: Shenfeld, Yair, et al.
Published: (2026)

Exploration-Exploitation Tradeoff in Universal Lossy Compression
by: Weinberger, Nir, et al.
Published: (2025)

A representation-learning game for classes of prediction tasks
by: Uzan, Neria, et al.
Published: (2024)

A stochastic gradient method for trilevel optimization
by: Giovannelli, Tommaso, et al.
Published: (2025)

An interpretable neural network-based non-proportional odds model for ordinal regression
by: Okuno, Akifumi, et al.
Published: (2023)

Statistical curriculum learning: An elimination algorithm achieving an oracle risk
by: Cohen, Omer, et al.
Published: (2024)

Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint
by: Vijayan, Nithia, et al.
Published: (2021)

Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradients
by: Davar, Parisa, et al.
Published: (2024)

Quantum Maximum Likelihood Prediction via Hilbert Space Embeddings
by: Sreekumar, Sreejith, et al.
Published: (2026)

Policy Newton methods for Distortion Riskmetrics
by: Pachal, Soumen, et al.
Published: (2025)

Softmax gradient policy for variance minimization and risk-averse multi armed bandits
by: Turinici, Gabriel
Published: (2026)

Auto-conditioned primal-dual hybrid gradient method and alternating direction method of multipliers
by: Lan, Guanghui, et al.
Published: (2024)

Conjugate gradient methods for high-dimensional GLMMs
by: Pandolfi, Andrea, et al.
Published: (2024)

Unlearning in- vs. out-of-distribution data in LLMs under gradient-based method
by: Baluta, Teodora, et al.
Published: (2024)

Clustered KL-barycenter design for policy evaluation
by: Weissmann, Simon, et al.
Published: (2025)

Interpretable liquid crystal phase classification via two-by-two ordinal patterns
by: Voltarelli, Leonardo G. J. M., et al.
Published: (2026)

Analysis of On-policy Policy Gradient Methods under the Distribution Mismatch
by: Wang, Weizhen, et al.
Published: (2025)

Strongly-polynomial time and validation analysis of policy gradient methods
by: Ju, Caleb, et al.
Published: (2024)

Local linear convergence of gradient methods for overparameterized Gaussian mixtures
by: Wang, Jingxing, et al.
Published: (2026)

The Structure of Cross-Validation Error: Stability, Covariance, and Minimax Limits
by: Nachum, Ido, et al.
Published: (2025)

Minimax Limits of k-Fold Cross-Validation via Majority
by: Nachum, Ido, et al.
Published: (2026)

On Bits and Bandits: Quantifying the Regret-Information Trade-off
by: Shufaro, Itai, et al.
Published: (2024)

Which Algorithms Have Tight Generalization Bounds?
by: Gastpar, Michael, et al.
Published: (2024)