Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Furman, Oleksii, Marszałek, Patryk, Masłowski, Jan, Gaiński, Piotr, Zięba, Maciej, Śmieja, Marek
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2602.17244
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917281903149056
author	Furman, Oleksii Marszałek, Patryk Masłowski, Jan Gaiński, Piotr Zięba, Maciej Śmieja, Marek
author_facet	Furman, Oleksii Marszałek, Patryk Masłowski, Jan Gaiński, Piotr Zięba, Maciej Śmieja, Marek
contents	Counterfactual explanations (CFs) provide human-interpretable insights into model's predictions by identifying minimal changes to input features that would alter the model's output. However, existing methods struggle to generate multiple high-quality explanations that (1) affect only a small portion of the features, (2) can be applied to tabular data with heterogeneous features, and (3) are consistent with the user-defined constraints. We propose CounterFlowNet, a generative approach that formulates CF generation as sequential feature modification using conditional Generative Flow Networks (GFlowNet). CounterFlowNet is trained to sample CFs proportionally to a user-specified reward function that can encode key CF desiderata: validity, sparsity, proximity and plausibility, encouraging high-quality explanations. The sequential formulation yields highly sparse edits, while a unified action space seamlessly supports continuous and categorical features. Moreover, actionability constraints, such as immutability and monotonicity of features, can be enforced at inference time via action masking, without retraining. Experiments on eight datasets under two evaluation protocols demonstrate that CounterFlowNet achieves superior trade-offs between validity, sparsity, plausibility, and diversity with full satisfaction of the given constraints.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_17244
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	CounterFlowNet: From Minimal Changes to Meaningful Counterfactual Explanations Furman, Oleksii Marszałek, Patryk Masłowski, Jan Gaiński, Piotr Zięba, Maciej Śmieja, Marek Machine Learning Counterfactual explanations (CFs) provide human-interpretable insights into model's predictions by identifying minimal changes to input features that would alter the model's output. However, existing methods struggle to generate multiple high-quality explanations that (1) affect only a small portion of the features, (2) can be applied to tabular data with heterogeneous features, and (3) are consistent with the user-defined constraints. We propose CounterFlowNet, a generative approach that formulates CF generation as sequential feature modification using conditional Generative Flow Networks (GFlowNet). CounterFlowNet is trained to sample CFs proportionally to a user-specified reward function that can encode key CF desiderata: validity, sparsity, proximity and plausibility, encouraging high-quality explanations. The sequential formulation yields highly sparse edits, while a unified action space seamlessly supports continuous and categorical features. Moreover, actionability constraints, such as immutability and monotonicity of features, can be enforced at inference time via action masking, without retraining. Experiments on eight datasets under two evaluation protocols demonstrate that CounterFlowNet achieves superior trade-offs between validity, sparsity, plausibility, and diversity with full satisfaction of the given constraints.
title	CounterFlowNet: From Minimal Changes to Meaningful Counterfactual Explanations
topic	Machine Learning
url	https://arxiv.org/abs/2602.17244

Similar Items