Saved in:
| Main Authors: | Castiglioni, Matteo, Nuara, Alessandro, Romano, Giulia, Spadaro, Giorgio, Trovò, Francesco, Gatti, Nicola |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2201.07139 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
No-Regret Learning Under Adversarial Resource Constraints: A Spending Plan Is All You Need!
by: Stradi, Francesco Emanuele, et al.
Published: (2025)
by: Stradi, Francesco Emanuele, et al.
Published: (2025)
Beyond Slater's Condition in Online CMDPs with Stochastic and Adversarial Constraints
by: Stradi, Francesco Emanuele, et al.
Published: (2025)
by: Stradi, Francesco Emanuele, et al.
Published: (2025)
Learning Adversarial MDPs with Stochastic Hard Constraints
by: Stradi, Francesco Emanuele, et al.
Published: (2024)
by: Stradi, Francesco Emanuele, et al.
Published: (2024)
Multi-Armed Bandits With Best-Action Queries
by: Bacchiocchi, Francesco, et al.
Published: (2026)
by: Bacchiocchi, Francesco, et al.
Published: (2026)
Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization
by: Stradi, Francesco Emanuele, et al.
Published: (2024)
by: Stradi, Francesco Emanuele, et al.
Published: (2024)
Truly Adapting to Adversarial Constraints in Constrained MABs
by: Stradi, Francesco Emanuele, et al.
Published: (2026)
by: Stradi, Francesco Emanuele, et al.
Published: (2026)
Toward Optimal Regret in Robust Pricing: Decoupling Corruption and Time
by: Kalupahana, Kalana, et al.
Published: (2026)
by: Kalupahana, Kalana, et al.
Published: (2026)
Learning Constrained Markov Decision Processes With Non-stationary Rewards and Constraints
by: Stradi, Francesco Emanuele, et al.
Published: (2024)
by: Stradi, Francesco Emanuele, et al.
Published: (2024)
Safe Offline Reinforcement Learning with Real-Time Budget Constraints
by: Lin, Qian, et al.
Published: (2023)
by: Lin, Qian, et al.
Published: (2023)
LeakSealer: A Semisupervised Defense for LLMs Against Prompt Injection and Leakage Attacks
by: Panebianco, Francesco, et al.
Published: (2025)
by: Panebianco, Francesco, et al.
Published: (2025)
Online Learning under Budget and ROI Constraints via Weak Adaptivity
by: Castiglioni, Matteo, et al.
Published: (2023)
by: Castiglioni, Matteo, et al.
Published: (2023)
Beyond Hard Constraints: Budget-Conditioned Reachability For Safe Offline Reinforcement Learning
by: Brahmanage, Janaka Chathuranga, et al.
Published: (2026)
by: Brahmanage, Janaka Chathuranga, et al.
Published: (2026)
Best-of-Both-Worlds Policy Optimization for CMDPs with Bandit Feedback
by: Stradi, Francesco Emanuele, et al.
Published: (2024)
by: Stradi, Francesco Emanuele, et al.
Published: (2024)
Regret Minimization for Piecewise Linear Rewards: Contracts, Auctions, and Beyond
by: Bacchiocchi, Francesco, et al.
Published: (2025)
by: Bacchiocchi, Francesco, et al.
Published: (2025)
Learning Optimal Contracts: How to Exploit Small Action Spaces
by: Bacchiocchi, Francesco, et al.
Published: (2023)
by: Bacchiocchi, Francesco, et al.
Published: (2023)
A Best-of-Both-Worlds Algorithm for Constrained MDPs with Long-Term Constraints
by: Germano, Jacopo, et al.
Published: (2023)
by: Germano, Jacopo, et al.
Published: (2023)
Online Optimization for Offline Safe Reinforcement Learning
by: Chemingui, Yassine, et al.
Published: (2025)
by: Chemingui, Yassine, et al.
Published: (2025)
Learning Concave Bid Shading Strategies in Online Auctions via Measure-valued Proximal Optimization
by: Nodozi, Iman, et al.
Published: (2025)
by: Nodozi, Iman, et al.
Published: (2025)
Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning
by: Yao, Yihang, et al.
Published: (2023)
by: Yao, Yihang, et al.
Published: (2023)
MicroFlow: An Efficient Rust-Based Inference Engine for TinyML
by: Carnelos, Matteo, et al.
Published: (2024)
by: Carnelos, Matteo, et al.
Published: (2024)
Information-Theoretic Safe Bayesian Optimization
by: Bottero, Alessandro G., et al.
Published: (2024)
by: Bottero, Alessandro G., et al.
Published: (2024)
VAO: Validation-Aligned Optimization for Cross-Task Generative Auto-Bidding
by: Lv, Yiqin, et al.
Published: (2025)
by: Lv, Yiqin, et al.
Published: (2025)
Delivery Optimized Discovery in Behavioral User Segmentation under Budget Constraint
by: Chopra, Harshita, et al.
Published: (2024)
by: Chopra, Harshita, et al.
Published: (2024)
Data-Dependent Regret Bounds for Constrained MABs
by: Genalti, Gianmarco, et al.
Published: (2025)
by: Genalti, Gianmarco, et al.
Published: (2025)
Markov Persuasion Processes: Learning to Persuade from Scratch
by: Bacchiocchi, Francesco, et al.
Published: (2024)
by: Bacchiocchi, Francesco, et al.
Published: (2024)
Automating the loop in traffic incident management on highway
by: Cercola, Matteo, et al.
Published: (2025)
by: Cercola, Matteo, et al.
Published: (2025)
Behaviour Policy Optimization: Provably Lower Variance Return Estimates for Off-Policy Reinforcement Learning
by: Goodall, Alexander W., et al.
Published: (2025)
by: Goodall, Alexander W., et al.
Published: (2025)
Constrained Phi-Equilibria
by: Bernasconi, Martino, et al.
Published: (2023)
by: Bernasconi, Martino, et al.
Published: (2023)
$(ε, u)$-Adaptive Regret Minimization in Heavy-Tailed Bandits
by: Genalti, Gianmarco, et al.
Published: (2023)
by: Genalti, Gianmarco, et al.
Published: (2023)
Robust Shielding for Safe Reinforcement Learning
by: Court, Edwin Hamel-De le, et al.
Published: (2026)
by: Court, Edwin Hamel-De le, et al.
Published: (2026)
SafeAdapt: Provably Safe Policy Updates in Deep Reinforcement Learning
by: Anisimov, Maksim, et al.
Published: (2026)
by: Anisimov, Maksim, et al.
Published: (2026)
Gym4ReaL: A Suite for Benchmarking Real-World Reinforcement Learning
by: Salaorni, Davide, et al.
Published: (2025)
by: Salaorni, Davide, et al.
Published: (2025)
SMLE: Safe Machine Learning via Embedded Overapproximation
by: Francobaldi, Matteo, et al.
Published: (2024)
by: Francobaldi, Matteo, et al.
Published: (2024)
A Survey of Constraint Formulations in Safe Reinforcement Learning
by: Wachi, Akifumi, et al.
Published: (2024)
by: Wachi, Akifumi, et al.
Published: (2024)
Certifiable Safe RLHF: Fixed-Penalty Constraint Optimization for Safer Language Models
by: Pandit, Kartik, et al.
Published: (2025)
by: Pandit, Kartik, et al.
Published: (2025)
Conditional Diffusion as Latent Constraints for Controllable Symbolic Music Generation
by: Pettenó, Matteo, et al.
Published: (2025)
by: Pettenó, Matteo, et al.
Published: (2025)
Joint Continual Learning of Local Language Models and Cloud Offloading Decisions with Budget Constraints
by: Chen, Evan, et al.
Published: (2026)
by: Chen, Evan, et al.
Published: (2026)
Constraint-Adaptive Policy Switching for Offline Safe Reinforcement Learning
by: Chemingui, Yassine, et al.
Published: (2024)
by: Chemingui, Yassine, et al.
Published: (2024)
SB-TRPO: Towards Safe Reinforcement Learning with Hard Constraints
by: Wagner, Dominik, et al.
Published: (2025)
by: Wagner, Dominik, et al.
Published: (2025)
Safe Reinforcement Learning with Learned Non-Markovian Safety Constraints
by: Low, Siow Meng, et al.
Published: (2024)
by: Low, Siow Meng, et al.
Published: (2024)
Similar Items
-
No-Regret Learning Under Adversarial Resource Constraints: A Spending Plan Is All You Need!
by: Stradi, Francesco Emanuele, et al.
Published: (2025) -
Beyond Slater's Condition in Online CMDPs with Stochastic and Adversarial Constraints
by: Stradi, Francesco Emanuele, et al.
Published: (2025) -
Learning Adversarial MDPs with Stochastic Hard Constraints
by: Stradi, Francesco Emanuele, et al.
Published: (2024) -
Multi-Armed Bandits With Best-Action Queries
by: Bacchiocchi, Francesco, et al.
Published: (2026) -
Optimal Strong Regret and Violation in Constrained MDPs via Policy Optimization
by: Stradi, Francesco Emanuele, et al.
Published: (2024)