Saved in:
Bibliographic Details
Main Author: Cerulli, Giovanni
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.12235
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910213062262784
author Cerulli, Giovanni
author_facet Cerulli, Giovanni
contents We study optimal policy learning under combined budget and minimum coverage constraints. We show that the problem admits a knapsack-type structure and that the optimal policy can be characterized by an affine threshold rule involving both budget and coverage shadow prices. We establish that the linear programming relaxation of the combinatorial solution has an O(1) integrality gap, implying asymptotic equivalence with the optimal discrete allocation. Building on this result, we analyze two implementable approaches: a Greedy-Lagrangian (GLC) and a rank-and-cut (RC) algorithm. We show that the GLC closely approximates the optimal solution and achieves near-optimal performance in finite samples. By contrast, RC is approximately optimal whenever the coverage constraint is slack or costs are homogeneous, while misallocation arises only when cost heterogeneity interacts with a binding coverage constraint. Monte Carlo evidence supports these findings.
format Preprint
id arxiv_https___arxiv_org_abs_2605_12235
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Optimal Policy Learning under Budget and Coverage Constraints
Cerulli, Giovanni
Machine Learning
We study optimal policy learning under combined budget and minimum coverage constraints. We show that the problem admits a knapsack-type structure and that the optimal policy can be characterized by an affine threshold rule involving both budget and coverage shadow prices. We establish that the linear programming relaxation of the combinatorial solution has an O(1) integrality gap, implying asymptotic equivalence with the optimal discrete allocation. Building on this result, we analyze two implementable approaches: a Greedy-Lagrangian (GLC) and a rank-and-cut (RC) algorithm. We show that the GLC closely approximates the optimal solution and achieves near-optimal performance in finite samples. By contrast, RC is approximately optimal whenever the coverage constraint is slack or costs are homogeneous, while misallocation arises only when cost heterogeneity interacts with a binding coverage constraint. Monte Carlo evidence supports these findings.
title Optimal Policy Learning under Budget and Coverage Constraints
topic Machine Learning
url https://arxiv.org/abs/2605.12235