Marc-запись: :: Library Catalog

Сохранить в:

Библиографические подробности
Главный автор:	Shikuri, Yuta
Формат:	Preprint
Опубликовано:	2024
Предметы:	Machine Learning
Online-ссылка:	https://arxiv.org/abs/2412.17455
Метки:	Добавить метку Нет меток, Требуется 1-ая метка записи!

_version_	1866914001326178304
author	Shikuri, Yuta
author_facet	Shikuri, Yuta
contents	Gaussian process regression is a powerful Bayesian nonlinear regression method. Recent research has enabled the capture of many types of observations using non-Gaussian likelihoods. To deal with various tasks in spatial modeling, we benefit from this development. Difficulties still arise when we can only access summarized data consisting of representative features, summary statistics, and data point counts. Such situations frequently occur primarily due to concerns about confidentiality and management costs associated with spatial data. This study tackles learning and inference using only summarized data within the framework of Gaussian process regression. To address this challenge, we analyze the approximation errors in the marginal likelihood and posterior distribution that arise from utilizing representative features. We also introduce the concept of sample quasi-likelihood, which facilitates learning and inference using only summarized data. Non-Gaussian likelihoods satisfying certain assumptions can be captured by specifying a variance function that characterizes a sample quasi-likelihood function. Theoretical and experimental results demonstrate that the approximation performance is influenced by the granularity of summarized data relative to the length scale of covariance functions. Experiments on a real-world dataset highlight the practicality of our method for spatial modeling.
format	Preprint
id	arxiv_https___arxiv_org_abs_2412_17455
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Learning from Summarized Data: Gaussian Process Regression with Sample Quasi-Likelihood Shikuri, Yuta Machine Learning Gaussian process regression is a powerful Bayesian nonlinear regression method. Recent research has enabled the capture of many types of observations using non-Gaussian likelihoods. To deal with various tasks in spatial modeling, we benefit from this development. Difficulties still arise when we can only access summarized data consisting of representative features, summary statistics, and data point counts. Such situations frequently occur primarily due to concerns about confidentiality and management costs associated with spatial data. This study tackles learning and inference using only summarized data within the framework of Gaussian process regression. To address this challenge, we analyze the approximation errors in the marginal likelihood and posterior distribution that arise from utilizing representative features. We also introduce the concept of sample quasi-likelihood, which facilitates learning and inference using only summarized data. Non-Gaussian likelihoods satisfying certain assumptions can be captured by specifying a variance function that characterizes a sample quasi-likelihood function. Theoretical and experimental results demonstrate that the approximation performance is influenced by the granularity of summarized data relative to the length scale of covariance functions. Experiments on a real-world dataset highlight the practicality of our method for spatial modeling.
title	Learning from Summarized Data: Gaussian Process Regression with Sample Quasi-Likelihood
topic	Machine Learning
url	https://arxiv.org/abs/2412.17455

Схожие документы