Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Terven, Juan, Cordova-Esparza, Diana M., Ramirez-Pedraza, Alfonso, Chavez-Urbiola, Edgar A., Romero-Gonzalez, Julio A.
Format:	Preprint
Published:	2023
Subjects:	Machine Learning Artificial Intelligence Computer Vision and Pattern Recognition I.4.0
Online Access:	https://arxiv.org/abs/2307.02694
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912322746843136
author	Terven, Juan Cordova-Esparza, Diana M. Ramirez-Pedraza, Alfonso Chavez-Urbiola, Edgar A. Romero-Gonzalez, Julio A.
author_facet	Terven, Juan Cordova-Esparza, Diana M. Ramirez-Pedraza, Alfonso Chavez-Urbiola, Edgar A. Romero-Gonzalez, Julio A.
contents	This paper presents a comprehensive review of loss functions and performance metrics in deep learning, highlighting key developments and practical insights across diverse application areas. We begin by outlining fundamental considerations in classic tasks such as regression and classification, then extend our analysis to specialized domains like computer vision and natural language processing including retrieval-augmented generation. In each setting, we systematically examine how different loss functions and evaluation metrics can be paired to address task-specific challenges such as class imbalance, outliers, and sequence-level optimization. Key contributions of this work include: (1) a unified framework for understanding how losses and metrics align with different learning objectives, (2) an in-depth discussion of multi-loss setups that balance competing goals, and (3) new insights into specialized metrics used to evaluate modern applications like retrieval-augmented generation, where faithfulness and context relevance are pivotal. Along the way, we highlight best practices for selecting or combining losses and metrics based on empirical behaviors and domain constraints. Finally, we identify open problems and promising directions, including the automation of loss-function search and the development of robust, interpretable evaluation measures for increasingly complex deep learning tasks. Our review aims to equip researchers and practitioners with clearer guidance in designing effective training pipelines and reliable model assessments for a wide spectrum of real-world applications.
format	Preprint
id	arxiv_https___arxiv_org_abs_2307_02694
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	Loss Functions and Metrics in Deep Learning Terven, Juan Cordova-Esparza, Diana M. Ramirez-Pedraza, Alfonso Chavez-Urbiola, Edgar A. Romero-Gonzalez, Julio A. Machine Learning Artificial Intelligence Computer Vision and Pattern Recognition I.4.0 This paper presents a comprehensive review of loss functions and performance metrics in deep learning, highlighting key developments and practical insights across diverse application areas. We begin by outlining fundamental considerations in classic tasks such as regression and classification, then extend our analysis to specialized domains like computer vision and natural language processing including retrieval-augmented generation. In each setting, we systematically examine how different loss functions and evaluation metrics can be paired to address task-specific challenges such as class imbalance, outliers, and sequence-level optimization. Key contributions of this work include: (1) a unified framework for understanding how losses and metrics align with different learning objectives, (2) an in-depth discussion of multi-loss setups that balance competing goals, and (3) new insights into specialized metrics used to evaluate modern applications like retrieval-augmented generation, where faithfulness and context relevance are pivotal. Along the way, we highlight best practices for selecting or combining losses and metrics based on empirical behaviors and domain constraints. Finally, we identify open problems and promising directions, including the automation of loss-function search and the development of robust, interpretable evaluation measures for increasingly complex deep learning tasks. Our review aims to equip researchers and practitioners with clearer guidance in designing effective training pipelines and reliable model assessments for a wide spectrum of real-world applications.
title	Loss Functions and Metrics in Deep Learning
topic	Machine Learning Artificial Intelligence Computer Vision and Pattern Recognition I.4.0
url	https://arxiv.org/abs/2307.02694

Similar Items