Saved in:
Bibliographic Details
Main Authors: Yang, Jiaqi, Liang, Enming, Su, Zicheng, Zou, Zhichao, Zhen, Peng, Guo, Jiecheng, Ma, Wanjing, An, Kun
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2501.01874
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916687131967488
author Yang, Jiaqi
Liang, Enming
Su, Zicheng
Zou, Zhichao
Zhen, Peng
Guo, Jiecheng
Ma, Wanjing
An, Kun
author_facet Yang, Jiaqi
Liang, Enming
Su, Zicheng
Zou, Zhichao
Zhen, Peng
Guo, Jiecheng
Ma, Wanjing
An, Kun
contents Decision-focused learning (DFL) offers an end-to-end approach to the predict-then-optimize (PO) framework by training predictive models directly on decision loss (DL), enhancing decision-making performance within PO contexts. However, the implementation of DFL poses distinct challenges. Primarily, DL can result in deviation from the physical significance of the predictions under limited data. Additionally, some predictive models are non-differentiable or black-box, which cannot be adjusted using gradient-based methods. To tackle the above challenges, we propose a novel framework, Decision-Focused Fine-tuning (DFF), which embeds the DFL module into the PO pipeline via a novel bias correction module. DFF is formulated as a constrained optimization problem that maintains the proximity of the DL-enhanced model to the original predictive model within a defined trust region. We theoretically prove that DFF strictly confines prediction bias within a predetermined upper bound, even with limited datasets, thereby substantially reducing prediction shifts caused by DL under limited data. Furthermore, the bias correction module can be integrated into diverse predictive models, enhancing adaptability to a broad range of PO tasks. Extensive evaluations on synthetic and real-world datasets, including network flow, portfolio optimization, and resource allocation problems with different predictive models, demonstrate that DFF not only improves decision performance but also adheres to fine-tuning constraints, showcasing robust adaptability across various scenarios.
format Preprint
id arxiv_https___arxiv_org_abs_2501_01874
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle DFF: Decision-Focused Fine-tuning for Smarter Predict-then-Optimize with Limited Data
Yang, Jiaqi
Liang, Enming
Su, Zicheng
Zou, Zhichao
Zhen, Peng
Guo, Jiecheng
Ma, Wanjing
An, Kun
Machine Learning
Decision-focused learning (DFL) offers an end-to-end approach to the predict-then-optimize (PO) framework by training predictive models directly on decision loss (DL), enhancing decision-making performance within PO contexts. However, the implementation of DFL poses distinct challenges. Primarily, DL can result in deviation from the physical significance of the predictions under limited data. Additionally, some predictive models are non-differentiable or black-box, which cannot be adjusted using gradient-based methods. To tackle the above challenges, we propose a novel framework, Decision-Focused Fine-tuning (DFF), which embeds the DFL module into the PO pipeline via a novel bias correction module. DFF is formulated as a constrained optimization problem that maintains the proximity of the DL-enhanced model to the original predictive model within a defined trust region. We theoretically prove that DFF strictly confines prediction bias within a predetermined upper bound, even with limited datasets, thereby substantially reducing prediction shifts caused by DL under limited data. Furthermore, the bias correction module can be integrated into diverse predictive models, enhancing adaptability to a broad range of PO tasks. Extensive evaluations on synthetic and real-world datasets, including network flow, portfolio optimization, and resource allocation problems with different predictive models, demonstrate that DFF not only improves decision performance but also adheres to fine-tuning constraints, showcasing robust adaptability across various scenarios.
title DFF: Decision-Focused Fine-tuning for Smarter Predict-then-Optimize with Limited Data
topic Machine Learning
url https://arxiv.org/abs/2501.01874