Saved in:
| Main Authors: | Shabadi, Guruprerana, Mallik, Kaushik |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.02151 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Risk-Sensitive Agent Compositions
by: Shabadi, Guruprerana, et al.
Published: (2025)
by: Shabadi, Guruprerana, et al.
Published: (2025)
Programmatic Reinforcement Learning: Navigating Gridworlds
by: Shabadi, Guruprerana, et al.
Published: (2024)
by: Shabadi, Guruprerana, et al.
Published: (2024)
Do We Need Frontier Models to Verify Mathematical Proofs?
by: Naik, Aaditya, et al.
Published: (2026)
by: Naik, Aaditya, et al.
Published: (2026)
Optimization Modulo Integer Linear-Exponential Programs
by: Hitarth, S, et al.
Published: (2025)
by: Hitarth, S, et al.
Published: (2025)
Auction-Based Scheduling
by: Avni, Guy, et al.
Published: (2023)
by: Avni, Guy, et al.
Published: (2023)
Learning in Budgeted Auctions with Spacing Objectives
by: Fikioris, Giannis, et al.
Published: (2024)
by: Fikioris, Giannis, et al.
Published: (2024)
Automated Deterministic Auction Design with Objective Decomposition
by: Duan, Zhijian, et al.
Published: (2024)
by: Duan, Zhijian, et al.
Published: (2024)
Evolving Diffusion and Flow Matching Policies for Online Reinforcement Learning
by: Zhang, Chubin, et al.
Published: (2025)
by: Zhang, Chubin, et al.
Published: (2025)
Glitches in Decision Tree Ensemble Models
by: Chandra, Satyankar, et al.
Published: (2025)
by: Chandra, Satyankar, et al.
Published: (2025)
Monitoring of Static Fairness
by: Henzinger, Thomas A., et al.
Published: (2025)
by: Henzinger, Thomas A., et al.
Published: (2025)
Online Adaptation for Enhancing Imitation Learning Policies
by: Malato, Federico, et al.
Published: (2024)
by: Malato, Federico, et al.
Published: (2024)
PolicyEvolve: Evolving Programmatic Policies by LLMs for multi-player games via Population-Based Training
by: Lv, Mingrui, et al.
Published: (2025)
by: Lv, Mingrui, et al.
Published: (2025)
Efficient Dynamic Shielding for Parametric Safety Specifications
by: Corsi, Davide, et al.
Published: (2025)
by: Corsi, Davide, et al.
Published: (2025)
Co-Evolving Policy Distillation
by: Gu, Naibin, et al.
Published: (2026)
by: Gu, Naibin, et al.
Published: (2026)
Online SLA Decomposition: Enabling Real-Time Adaptation to Evolving Network Systems
by: Hsu, Cyril Shih-Huan, et al.
Published: (2024)
by: Hsu, Cyril Shih-Huan, et al.
Published: (2024)
Breaking Determinism: Stochastic Modeling for Reliable Off-Policy Evaluation in Ad Auctions
by: Yeom, Hongseon, et al.
Published: (2025)
by: Yeom, Hongseon, et al.
Published: (2025)
Online Combinatorial Allocations and Auctions with Few Samples
by: Dütting, Paul, et al.
Published: (2024)
by: Dütting, Paul, et al.
Published: (2024)
Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
by: Zhang, Qixin, et al.
Published: (2025)
by: Zhang, Qixin, et al.
Published: (2025)
OASIS: Online Activation Subspace Learning for Memory-Efficient Training
by: Choudhary, Sakshi, et al.
Published: (2026)
by: Choudhary, Sakshi, et al.
Published: (2026)
Off-Policy Evaluation and Counterfactual Methods in Dynamic Auction Environments
by: Guha, Ritam, et al.
Published: (2025)
by: Guha, Ritam, et al.
Published: (2025)
Channel Estimation by Infinite Width Convolutional Networks
by: Mallik, Mohammed, et al.
Published: (2025)
by: Mallik, Mohammed, et al.
Published: (2025)
Online Causal Inference for Advertising in Real-Time Bidding Auctions
by: Waisman, Caio, et al.
Published: (2019)
by: Waisman, Caio, et al.
Published: (2019)
Improved Online Learning Algorithms for CTR Prediction in Ad Auctions
by: Feng, Zhe, et al.
Published: (2024)
by: Feng, Zhe, et al.
Published: (2024)
Optimizing Online Advertising with Multi-Armed Bandits: Mitigating the Cold Start Problem under Auction Dynamics
by: Soboleva, Anastasiia, et al.
Published: (2025)
by: Soboleva, Anastasiia, et al.
Published: (2025)
Evolving Restricted Boltzmann Machine-Kohonen Network for Online Clustering
by: Senthilnath, J., et al.
Published: (2024)
by: Senthilnath, J., et al.
Published: (2024)
Flow-Based Policy for Online Reinforcement Learning
by: Lv, Lei, et al.
Published: (2025)
by: Lv, Lei, et al.
Published: (2025)
MANGO: Meta-Adaptive Network Gradient Optimization for Online Continual Learning
by: Awasthi, Ankita, et al.
Published: (2026)
by: Awasthi, Ankita, et al.
Published: (2026)
ELENA: Epigenetic Learning through Evolved Neural Adaptation
by: Kriuk, Boris, et al.
Published: (2025)
by: Kriuk, Boris, et al.
Published: (2025)
Multi-Objective $\textit{min-max}$ Online Convex Optimization
by: Vaze, Rahul, et al.
Published: (2025)
by: Vaze, Rahul, et al.
Published: (2025)
SNPL: Simultaneous Policy Learning and Evaluation for Safe Multi-Objective Policy Improvement
by: Cho, Brian, et al.
Published: (2025)
by: Cho, Brian, et al.
Published: (2025)
Causal and Federated Multimodal Learning for Cardiovascular Risk Prediction under Heterogeneous Populations
by: Kaushik, Rohit, et al.
Published: (2026)
by: Kaushik, Rohit, et al.
Published: (2026)
Learning Control Policies for Variable Objectives from Offline Data
by: Weber, Marc, et al.
Published: (2023)
by: Weber, Marc, et al.
Published: (2023)
Online Auction Design Using Distribution-Free Uncertainty Quantification with Applications to E-Commerce
by: Han, Jiale, et al.
Published: (2024)
by: Han, Jiale, et al.
Published: (2024)
Information-Consistent Language Model Recommendations through Group Relative Policy Optimization
by: Prabhune, Sonal, et al.
Published: (2025)
by: Prabhune, Sonal, et al.
Published: (2025)
Evolved Sample Weights for Bias Mitigation: Effectiveness Depends on the Fairness Objective
by: Saini, Anil K., et al.
Published: (2025)
by: Saini, Anil K., et al.
Published: (2025)
Online Feature Updates Improve Online (Generalized) Label Shift Adaptation
by: Wu, Ruihan, et al.
Published: (2024)
by: Wu, Ruihan, et al.
Published: (2024)
An Offline Adaptation Framework for Constrained Multi-Objective Reinforcement Learning
by: Lin, Qian, et al.
Published: (2024)
by: Lin, Qian, et al.
Published: (2024)
One-Way Policy Optimization for Self-Evolving LLMs
by: Yang, Shuo, et al.
Published: (2026)
by: Yang, Shuo, et al.
Published: (2026)
Robust Multi-Objective Preference Alignment with Online DPO
by: Gupta, Raghav, et al.
Published: (2025)
by: Gupta, Raghav, et al.
Published: (2025)
InvEvolve: Evolving White-Box Inventory Policies via Large Language Models with Performance Guarantees
by: Huang, Chenyu, et al.
Published: (2026)
by: Huang, Chenyu, et al.
Published: (2026)
Similar Items
-
Risk-Sensitive Agent Compositions
by: Shabadi, Guruprerana, et al.
Published: (2025) -
Programmatic Reinforcement Learning: Navigating Gridworlds
by: Shabadi, Guruprerana, et al.
Published: (2024) -
Do We Need Frontier Models to Verify Mathematical Proofs?
by: Naik, Aaditya, et al.
Published: (2026) -
Optimization Modulo Integer Linear-Exponential Programs
by: Hitarth, S, et al.
Published: (2025) -
Auction-Based Scheduling
by: Avni, Guy, et al.
Published: (2023)