Saved in:
Bibliographic Details
Main Authors: Farabi, Ahsan, Khandaker, Israt, Shanto, Ibrahim Khalil, Minhaz, Md Abdul Ahad, Zaman, Tanisha
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.03066
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914072674435072
author Farabi, Ahsan
Khandaker, Israt
Shanto, Ibrahim Khalil
Minhaz, Md Abdul Ahad
Zaman, Tanisha
author_facet Farabi, Ahsan
Khandaker, Israt
Shanto, Ibrahim Khalil
Minhaz, Md Abdul Ahad
Zaman, Tanisha
contents Facial Emotion Recognition (FER) is a key task in affective computing, enabling applications in human-computer interaction, e-learning, healthcare, and safety systems. Despite advances in deep learning, FER remains challenging due to occlusions, illumination and pose variations, subtle intra-class differences, and dataset imbalance that hinders recognition of minority emotions. We present InsideOut, a reproducible FER framework built on EfficientNetV2-S with transfer learning, strong data augmentation, and imbalance-aware optimization. The approach standardizes FER2013 images, applies stratified splitting and augmentation, and fine-tunes a lightweight classification head with class-weighted loss to address skewed distributions. InsideOut achieves 62.8% accuracy with a macro averaged F1 of 0.590 on FER2013, showing competitive results compared to conventional CNN baselines. The novelty lies in demonstrating that efficient architectures, combined with tailored imbalance handling, can provide practical, transparent, and reproducible FER solutions.
format Preprint
id arxiv_https___arxiv_org_abs_2510_03066
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle InsideOut: An EfficientNetV2-S Based Deep Learning Framework for Robust Multi-Class Facial Emotion Recognition
Farabi, Ahsan
Khandaker, Israt
Shanto, Ibrahim Khalil
Minhaz, Md Abdul Ahad
Zaman, Tanisha
Computer Vision and Pattern Recognition
Facial Emotion Recognition (FER) is a key task in affective computing, enabling applications in human-computer interaction, e-learning, healthcare, and safety systems. Despite advances in deep learning, FER remains challenging due to occlusions, illumination and pose variations, subtle intra-class differences, and dataset imbalance that hinders recognition of minority emotions. We present InsideOut, a reproducible FER framework built on EfficientNetV2-S with transfer learning, strong data augmentation, and imbalance-aware optimization. The approach standardizes FER2013 images, applies stratified splitting and augmentation, and fine-tunes a lightweight classification head with class-weighted loss to address skewed distributions. InsideOut achieves 62.8% accuracy with a macro averaged F1 of 0.590 on FER2013, showing competitive results compared to conventional CNN baselines. The novelty lies in demonstrating that efficient architectures, combined with tailored imbalance handling, can provide practical, transparent, and reproducible FER solutions.
title InsideOut: An EfficientNetV2-S Based Deep Learning Framework for Robust Multi-Class Facial Emotion Recognition
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2510.03066