Saved in:
Bibliographic Details
Main Authors: Song, Mooho, Lee, Jay-Yoon
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2406.01647
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916363700797440
author Song, Mooho
Lee, Jay-Yoon
author_facet Song, Mooho
Lee, Jay-Yoon
contents Neural networks (NN) perform well in diverse tasks, but sometimes produce nonsensical results to humans. Most NN models "solely" learn from (input, output) pairs, occasionally conflicting with human knowledge. Many studies indicate injecting human knowledge by reducing output constraints during training can improve model performance and reduce constraint violations. While there have been several attempts to compare different existing algorithms under the same programming framework, nonetheless, there has been no previous work that categorizes learning algorithms with output constraints in a unified manner. Our contributions are as follows: (1) We categorize the previous studies based on three axes: type of constraint loss used (e.g. probabilistic soft logic, REINFORCE), exploration strategy of constraint-violating examples, and integration mechanism of learning signals from main task and constraint. (2) We propose new algorithms to integrate the information of main task and constraint injection, inspired by continual-learning algorithms. (3) Furthermore, we propose the $Hβ$-score as a metric for considering the main task metric and constraint violation simultaneously. To provide a thorough analysis, we examine all the algorithms on three NLP tasks: natural language inference (NLI), synthetic transduction examples (STE), and semantic role labeling (SRL). We explore and reveal the key factors of various algorithms associated with achieving high $Hβ$-scores.
format Preprint
id arxiv_https___arxiv_org_abs_2406_01647
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle An Analysis under a Unified Fomulation of Learning Algorithms with Output Constraints
Song, Mooho
Lee, Jay-Yoon
Machine Learning
Artificial Intelligence
Neural networks (NN) perform well in diverse tasks, but sometimes produce nonsensical results to humans. Most NN models "solely" learn from (input, output) pairs, occasionally conflicting with human knowledge. Many studies indicate injecting human knowledge by reducing output constraints during training can improve model performance and reduce constraint violations. While there have been several attempts to compare different existing algorithms under the same programming framework, nonetheless, there has been no previous work that categorizes learning algorithms with output constraints in a unified manner. Our contributions are as follows: (1) We categorize the previous studies based on three axes: type of constraint loss used (e.g. probabilistic soft logic, REINFORCE), exploration strategy of constraint-violating examples, and integration mechanism of learning signals from main task and constraint. (2) We propose new algorithms to integrate the information of main task and constraint injection, inspired by continual-learning algorithms. (3) Furthermore, we propose the $Hβ$-score as a metric for considering the main task metric and constraint violation simultaneously. To provide a thorough analysis, we examine all the algorithms on three NLP tasks: natural language inference (NLI), synthetic transduction examples (STE), and semantic role labeling (SRL). We explore and reveal the key factors of various algorithms associated with achieving high $Hβ$-scores.
title An Analysis under a Unified Fomulation of Learning Algorithms with Output Constraints
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2406.01647