Saved in:
| Main Authors: | Tangri, Rohan, Calliess, Jan-Peter |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.22993 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
End-to-End Policy Learning of a Statistical Arbitrage Autoencoder Architecture
by: Krause, Fabian, et al.
Published: (2024)
by: Krause, Fabian, et al.
Published: (2024)
Deep Learning for Financial Time Series: A Large-Scale Benchmark of Risk-Adjusted Performance
by: Saly-Kaufmann, Adir, et al.
Published: (2026)
by: Saly-Kaufmann, Adir, et al.
Published: (2026)
Uniform Convergence Beyond Glivenko-Cantelli
by: Devale, Tanmay, et al.
Published: (2025)
by: Devale, Tanmay, et al.
Published: (2025)
Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization
by: Luis, Carlos E., et al.
Published: (2023)
by: Luis, Carlos E., et al.
Published: (2023)
Glivenko-Cantelli for $f$-divergence
by: Wang, Haoming, et al.
Published: (2025)
by: Wang, Haoming, et al.
Published: (2025)
The Empirical Mean is Minimax Optimal for Local Glivenko-Cantelli
by: Cohen, Doron, et al.
Published: (2024)
by: Cohen, Doron, et al.
Published: (2024)
Equivariant Goal Conditioned Contrastive Reinforcement Learning
by: Tangri, Arsh, et al.
Published: (2025)
by: Tangri, Arsh, et al.
Published: (2025)
PAC-Bayesian Bounds on Constrained f-Entropic Risk Measures
by: Atbir, Hind, et al.
Published: (2025)
by: Atbir, Hind, et al.
Published: (2025)
State-wise Constrained Policy Optimization
by: Zhao, Weiye, et al.
Published: (2023)
by: Zhao, Weiye, et al.
Published: (2023)
e-COP : Episodic Constrained Optimization of Policies
by: Agnihotri, Akhil, et al.
Published: (2024)
by: Agnihotri, Akhil, et al.
Published: (2024)
Proactive Constrained Policy Optimization with Preemptive Penalty
by: Yang, Ning, et al.
Published: (2025)
by: Yang, Ning, et al.
Published: (2025)
Inference Time Policy Optimization for Offline RL with Differentiable World Models
by: Deb, Rohan, et al.
Published: (2026)
by: Deb, Rohan, et al.
Published: (2026)
Pretrain Value, Not Reward: Decoupled Value Policy Optimization
by: Huang, Chenghua, et al.
Published: (2025)
by: Huang, Chenghua, et al.
Published: (2025)
Matrix-Valued Optimism is Matrix-Valued Augmentation: Additive Hybrid Designs for Constrained Optimization
by: Zhao, Jiayi
Published: (2026)
by: Zhao, Jiayi
Published: (2026)
Constrained Group Relative Policy Optimization
by: Girgis, Roger, et al.
Published: (2026)
by: Girgis, Roger, et al.
Published: (2026)
Equivariant Offline Reinforcement Learning
by: Tangri, Arsh, et al.
Published: (2024)
by: Tangri, Arsh, et al.
Published: (2024)
Autoregressive Policy Optimization for Constrained Allocation Tasks
by: Winkel, David, et al.
Published: (2024)
by: Winkel, David, et al.
Published: (2024)
Extreme Value Policy Optimization for Safe Reinforcement Learning
by: Gao, Shiqing, et al.
Published: (2026)
by: Gao, Shiqing, et al.
Published: (2026)
Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk
by: Ying, Chengyang, et al.
Published: (2022)
by: Ying, Chengyang, et al.
Published: (2022)
Concentration Bounds for Optimized Certainty Equivalent Risk Estimation
by: Ghosh, Ayon, et al.
Published: (2024)
by: Ghosh, Ayon, et al.
Published: (2024)
Universal Dynamic Regret and Constraint Violation Bounds for Constrained Online Convex Optimization
by: Supantha, Subhamon, et al.
Published: (2025)
by: Supantha, Subhamon, et al.
Published: (2025)
Risk-Averse Constrained Reinforcement Learning with Optimized Certainty Equivalents
by: Lee, Jane H., et al.
Published: (2025)
by: Lee, Jane H., et al.
Published: (2025)
Massively Scaling Explicit Policy-conditioned Value Functions
by: Bohlinger, Nico, et al.
Published: (2025)
by: Bohlinger, Nico, et al.
Published: (2025)
Mitigating the Safety Alignment Tax with Null-Space Constrained Policy Optimization
by: Niu, Yifan, et al.
Published: (2025)
by: Niu, Yifan, et al.
Published: (2025)
Constrained Policy Optimization with Explicit Behavior Density for Offline Reinforcement Learning
by: Zhang, Jing, et al.
Published: (2023)
by: Zhang, Jing, et al.
Published: (2023)
Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization
by: Stradi, Francesco Emanuele, et al.
Published: (2024)
by: Stradi, Francesco Emanuele, et al.
Published: (2024)
Optimal Bounds for Adversarial Constrained Online Convex Optimization
by: Ferreira, Ricardo N., et al.
Published: (2025)
by: Ferreira, Ricardo N., et al.
Published: (2025)
Value-Free Policy Optimization via Reward Partitioning
by: Faye, Bilal, et al.
Published: (2025)
by: Faye, Bilal, et al.
Published: (2025)
Detector-Evasive LLM Paraphrasing via Constrained Policy Optimization
by: Wang, Mingyi, et al.
Published: (2026)
by: Wang, Mingyi, et al.
Published: (2026)
Conformal Constrained Policy Optimization for Cost-Effective LLM Agents
by: Si, Wenwen, et al.
Published: (2025)
by: Si, Wenwen, et al.
Published: (2025)
Incentivizing Safer Actions in Policy Optimization for Constrained Reinforcement Learning
by: Hazra, Somnath, et al.
Published: (2025)
by: Hazra, Somnath, et al.
Published: (2025)
Constrained Policy Optimization via Sampling-Based Weight-Space Projection
by: Cao, Shengfan, et al.
Published: (2025)
by: Cao, Shengfan, et al.
Published: (2025)
Co2PO: Coordinated Constrained Policy Optimization for Multi-Agent RL
by: Patel, Shrenik, et al.
Published: (2026)
by: Patel, Shrenik, et al.
Published: (2026)
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization
by: He, Longxiang, et al.
Published: (2024)
by: He, Longxiang, et al.
Published: (2024)
Stepwise Alignment for Constrained Language Model Policy Optimization
by: Wachi, Akifumi, et al.
Published: (2024)
by: Wachi, Akifumi, et al.
Published: (2024)
Adaptive Test-Time Compute Allocation for Reasoning LLMs via Constrained Policy Optimization
by: Zhai, Zhiyuan, et al.
Published: (2026)
by: Zhai, Zhiyuan, et al.
Published: (2026)
Data-Dependent Regret Bounds for Constrained MABs
by: Genalti, Gianmarco, et al.
Published: (2025)
by: Genalti, Gianmarco, et al.
Published: (2025)
Adversarial Constrained Policy Optimization: Improving Constrained Reinforcement Learning by Adapting Budgets
by: Ma, Jianmina, et al.
Published: (2024)
by: Ma, Jianmina, et al.
Published: (2024)
Projection-Free Functional Constrained Optimization for Risk Aversion and Sparsity Control
by: Cheng, Yi, et al.
Published: (2022)
by: Cheng, Yi, et al.
Published: (2022)
Diffusion Policies with Value-Conditional Optimization for Offline Reinforcement Learning
by: Ma, Yunchang, et al.
Published: (2025)
by: Ma, Yunchang, et al.
Published: (2025)
Similar Items
-
End-to-End Policy Learning of a Statistical Arbitrage Autoencoder Architecture
by: Krause, Fabian, et al.
Published: (2024) -
Deep Learning for Financial Time Series: A Large-Scale Benchmark of Risk-Adjusted Performance
by: Saly-Kaufmann, Adir, et al.
Published: (2026) -
Uniform Convergence Beyond Glivenko-Cantelli
by: Devale, Tanmay, et al.
Published: (2025) -
Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization
by: Luis, Carlos E., et al.
Published: (2023) -
Glivenko-Cantelli for $f$-divergence
by: Wang, Haoming, et al.
Published: (2025)