Salvato in:
Dettagli Bibliografici
Autori principali: Scherer, Maximilian, Bernard, Jürgen, Schreck, Tobias
Natura: Dataset Open Access
Lingua:en
Pubblicazione: PANGAEA 2011
Soggetti:
Accesso online:https://doi.org/10.1594/PANGAEA.756307
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
_version_ 1867170911725027328
author Scherer, Maximilian
Bernard, Jürgen
Schreck, Tobias
author_facet Scherer, Maximilian
Bernard, Jürgen
Schreck, Tobias
collection Datos científicos de ciencias marinas y ambientales
contents Increasing amounts of data is collected in most areas of research and application. The degree to which this data can be accessed, analyzed, and retrieved, is a decisive in obtaining progress in fields such as scientific research or industrial production. We present a novel methodology supporting content-based retrieval and exploratory search in repositories of multivariate research data. In particular, our methods are able to describe two-dimensional functional dependencies in research data, e.g. the relationship between ination and unemployment in economics. Our basic idea is to use feature vectors based on the goodness-of-fit of a set of regression models to describe the data mathematically. We denote this approach Regressional Features and use it for content-based search and, since our approach motivates an intuitive definition of interestingness, for exploring the most interesting data. We apply our method on considerable real-world research datasets, showing the usefulness of our approach for user-centered access to research data in a Digital Library system.
format Dataset Open Access
id pangaea_https___doi_org_10_1594_PANGAEA_756307
institution PANGAEA
language en
publishDate 2011
publisher PANGAEA
record_format pangaea
spellingShingle Reference list of sources used for two experimental data files dataBSRN and dataMixed
Scherer, Maximilian
Bernard, Jürgen
Schreck, Tobias

Increasing amounts of data is collected in most areas of research and application. The degree to which this data can be accessed, analyzed, and retrieved, is a decisive in obtaining progress in fields such as scientific research or industrial production. We present a novel methodology supporting content-based retrieval and exploratory search in repositories of multivariate research data. In particular, our methods are able to describe two-dimensional functional dependencies in research data, e.g. the relationship between ination and unemployment in economics. Our basic idea is to use feature vectors based on the goodness-of-fit of a set of regression models to describe the data mathematically. We denote this approach Regressional Features and use it for content-based search and, since our approach motivates an intuitive definition of interestingness, for exploring the most interesting data. We apply our method on considerable real-world research datasets, showing the usefulness of our approach for user-centered access to research data in a Digital Library system.
title Reference list of sources used for two experimental data files dataBSRN and dataMixed
topic
url https://doi.org/10.1594/PANGAEA.756307