Saved in:
Bibliographic Details
Main Authors: Liu, Yiping, Zhang, Mengxiao, Liu, Jiamou, Yang, Song
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.08284
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915148117049344
author Liu, Yiping
Zhang, Mengxiao
Liu, Jiamou
Yang, Song
author_facet Liu, Yiping
Zhang, Mengxiao
Liu, Jiamou
Yang, Song
contents Machine learning (ML) models have become essential tools in various scenarios. Their effectiveness, however, hinges on a substantial volume of data for satisfactory performance. Model marketplaces have thus emerged as crucial platforms bridging model consumers seeking ML solutions and data owners possessing valuable data. These marketplaces leverage model trading mechanisms to properly incentive data owners to contribute their data, and return a well performing ML model to the model consumers. However, existing model trading mechanisms often assume the data owners are willing to share their data before being paid, which is not reasonable in real world. Given that, we propose a novel mechanism, named Structural Importance based Model Trading (SIMT) mechanism, that assesses the data importance and compensates data owners accordingly without disclosing the data. Specifically, SIMT procures feature and label data from data owners according to their structural importance, and then trains a graph neural network for model consumers. Theoretically, SIMT ensures incentive compatible, individual rational and budget feasible. The experiments on five popular datasets validate that SIMT consistently outperforms vanilla baselines by up to $40\%$ in both MacroF1 and MicroF1.
format Preprint
id arxiv_https___arxiv_org_abs_2502_08284
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Data Pricing for Graph Neural Networks without Pre-purchased Inspection
Liu, Yiping
Zhang, Mengxiao
Liu, Jiamou
Yang, Song
Computer Science and Game Theory
Machine Learning
Machine learning (ML) models have become essential tools in various scenarios. Their effectiveness, however, hinges on a substantial volume of data for satisfactory performance. Model marketplaces have thus emerged as crucial platforms bridging model consumers seeking ML solutions and data owners possessing valuable data. These marketplaces leverage model trading mechanisms to properly incentive data owners to contribute their data, and return a well performing ML model to the model consumers. However, existing model trading mechanisms often assume the data owners are willing to share their data before being paid, which is not reasonable in real world. Given that, we propose a novel mechanism, named Structural Importance based Model Trading (SIMT) mechanism, that assesses the data importance and compensates data owners accordingly without disclosing the data. Specifically, SIMT procures feature and label data from data owners according to their structural importance, and then trains a graph neural network for model consumers. Theoretically, SIMT ensures incentive compatible, individual rational and budget feasible. The experiments on five popular datasets validate that SIMT consistently outperforms vanilla baselines by up to $40\%$ in both MacroF1 and MicroF1.
title Data Pricing for Graph Neural Networks without Pre-purchased Inspection
topic Computer Science and Game Theory
Machine Learning
url https://arxiv.org/abs/2502.08284