Saved in:
Bibliographic Details
Main Authors: Terven, Juan, Cordova-Esparza, Diana M., Ramirez-Pedraza, Alfonso, Chavez-Urbiola, Edgar A., Romero-Gonzalez, Julio A.
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2307.02694
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912322746843136
author Terven, Juan
Cordova-Esparza, Diana M.
Ramirez-Pedraza, Alfonso
Chavez-Urbiola, Edgar A.
Romero-Gonzalez, Julio A.
author_facet Terven, Juan
Cordova-Esparza, Diana M.
Ramirez-Pedraza, Alfonso
Chavez-Urbiola, Edgar A.
Romero-Gonzalez, Julio A.
contents This paper presents a comprehensive review of loss functions and performance metrics in deep learning, highlighting key developments and practical insights across diverse application areas. We begin by outlining fundamental considerations in classic tasks such as regression and classification, then extend our analysis to specialized domains like computer vision and natural language processing including retrieval-augmented generation. In each setting, we systematically examine how different loss functions and evaluation metrics can be paired to address task-specific challenges such as class imbalance, outliers, and sequence-level optimization. Key contributions of this work include: (1) a unified framework for understanding how losses and metrics align with different learning objectives, (2) an in-depth discussion of multi-loss setups that balance competing goals, and (3) new insights into specialized metrics used to evaluate modern applications like retrieval-augmented generation, where faithfulness and context relevance are pivotal. Along the way, we highlight best practices for selecting or combining losses and metrics based on empirical behaviors and domain constraints. Finally, we identify open problems and promising directions, including the automation of loss-function search and the development of robust, interpretable evaluation measures for increasingly complex deep learning tasks. Our review aims to equip researchers and practitioners with clearer guidance in designing effective training pipelines and reliable model assessments for a wide spectrum of real-world applications.
format Preprint
id arxiv_https___arxiv_org_abs_2307_02694
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle Loss Functions and Metrics in Deep Learning
Terven, Juan
Cordova-Esparza, Diana M.
Ramirez-Pedraza, Alfonso
Chavez-Urbiola, Edgar A.
Romero-Gonzalez, Julio A.
Machine Learning
Artificial Intelligence
Computer Vision and Pattern Recognition
I.4.0
This paper presents a comprehensive review of loss functions and performance metrics in deep learning, highlighting key developments and practical insights across diverse application areas. We begin by outlining fundamental considerations in classic tasks such as regression and classification, then extend our analysis to specialized domains like computer vision and natural language processing including retrieval-augmented generation. In each setting, we systematically examine how different loss functions and evaluation metrics can be paired to address task-specific challenges such as class imbalance, outliers, and sequence-level optimization. Key contributions of this work include: (1) a unified framework for understanding how losses and metrics align with different learning objectives, (2) an in-depth discussion of multi-loss setups that balance competing goals, and (3) new insights into specialized metrics used to evaluate modern applications like retrieval-augmented generation, where faithfulness and context relevance are pivotal. Along the way, we highlight best practices for selecting or combining losses and metrics based on empirical behaviors and domain constraints. Finally, we identify open problems and promising directions, including the automation of loss-function search and the development of robust, interpretable evaluation measures for increasingly complex deep learning tasks. Our review aims to equip researchers and practitioners with clearer guidance in designing effective training pipelines and reliable model assessments for a wide spectrum of real-world applications.
title Loss Functions and Metrics in Deep Learning
topic Machine Learning
Artificial Intelligence
Computer Vision and Pattern Recognition
I.4.0
url https://arxiv.org/abs/2307.02694