Saved in:
Bibliographic Details
Main Authors: Dennig, Frederik L., Polk, Tom, Lin, Zudi, Schreck, Tobias, Pfister, Hanspeter, Behrisch, Michael
Format: Preprint
Published: 2019
Subjects:
Online Access:https://arxiv.org/abs/1907.12489
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910445009371136
author Dennig, Frederik L.
Polk, Tom
Lin, Zudi
Schreck, Tobias
Pfister, Hanspeter
Behrisch, Michael
author_facet Dennig, Frederik L.
Polk, Tom
Lin, Zudi
Schreck, Tobias
Pfister, Hanspeter
Behrisch, Michael
contents The detection of interesting patterns in large high-dimensional datasets is difficult because of their dimensionality and pattern complexity. Therefore, analysts require automated support for the extraction of relevant patterns. In this paper, we present FDive, a visual active learning system that helps to create visually explorable relevance models, assisted by learning a pattern-based similarity. We use a small set of user-provided labels to rank similarity measures, consisting of feature descriptor and distance function combinations, by their ability to distinguish relevant from irrelevant data. Based on the best-ranked similarity measure, the system calculates an interactive Self-Organizing Map-based relevance model, which classifies data according to the cluster affiliation. It also automatically prompts further relevance feedback to improve its accuracy. Uncertain areas, especially near the decision boundaries, are highlighted and can be refined by the user. We evaluate our approach by comparison to state-of-the-art feature selection techniques and demonstrate the usefulness of our approach by a case study classifying electron microscopy images of brain cells. The results show that FDive enhances both the quality and understanding of relevance models and can thus lead to new insights for brain research.
format Preprint
id arxiv_https___arxiv_org_abs_1907_12489
institution arXiv
publishDate 2019
record_format arxiv
spellingShingle FDive: Learning Relevance Models using Pattern-based Similarity Measures
Dennig, Frederik L.
Polk, Tom
Lin, Zudi
Schreck, Tobias
Pfister, Hanspeter
Behrisch, Michael
Machine Learning
The detection of interesting patterns in large high-dimensional datasets is difficult because of their dimensionality and pattern complexity. Therefore, analysts require automated support for the extraction of relevant patterns. In this paper, we present FDive, a visual active learning system that helps to create visually explorable relevance models, assisted by learning a pattern-based similarity. We use a small set of user-provided labels to rank similarity measures, consisting of feature descriptor and distance function combinations, by their ability to distinguish relevant from irrelevant data. Based on the best-ranked similarity measure, the system calculates an interactive Self-Organizing Map-based relevance model, which classifies data according to the cluster affiliation. It also automatically prompts further relevance feedback to improve its accuracy. Uncertain areas, especially near the decision boundaries, are highlighted and can be refined by the user. We evaluate our approach by comparison to state-of-the-art feature selection techniques and demonstrate the usefulness of our approach by a case study classifying electron microscopy images of brain cells. The results show that FDive enhances both the quality and understanding of relevance models and can thus lead to new insights for brain research.
title FDive: Learning Relevance Models using Pattern-based Similarity Measures
topic Machine Learning
url https://arxiv.org/abs/1907.12489