ARCHIVES

Original Article

Deep Learning Based Facial Emotion Recognition System

Kiran Kumar Raja1 P Sanjay Kumar2 Ch Venkata Gowtham3
1 Assistant Professor, Department of Computer Science and Engineering, Vignan’s Foundation for Science, Technology and Research, Andhra Pradesh, India. 2 3 Department of Computer Science and Engineering, Vignan’s Foundation for Science, Technology and Research, Andhra Pradesh, India.

Published Online: May-August 2026

Pages: 466-478

Abstract

Facial Emotion Recognition (FER) constitutes a fundamental building block of affective computing and human-computer interaction, equipping machines with the capacity to interpret and respond to human emotional states in real time. Even with considerable advances in deep learning, contemporary FER systems continue to face difficulties related to cross-dataset gen-eralization, ambiguity among visually similar emotion categories, and severe class imbalance inherent in curated laboratory datasets. This study proposes a domain-adapted deep Convolutional Neural Network (CNN) trained entirely from scratch on a unified corpus assembled from the Extended Cohn-Kanade (CK+) and Karolinska Directed Emotional Faces (KDEF) benchmarks. The merged dataset comprises 6,530 labeled grayscale facial images at 48×48 pixel resolution, partitioned into 5,224 training samples and 1,306 test samples distributed across ten emotion categories. The proposed network consists of four progressively expanding convolutional blocks, each incorporating paired 3×3 convolution layers, batch normalization, ReLU activation, 2×2 max-pooling, and spatial dropout, culminating in global average pooling followed by two fully connected layers with a ten-way softmax output. The model is optimized over 40 epochs using the Adam algorithm (η = 10−3) with ReduceL ROnPlateau scheduling and early stopping. On the held-out test partition, the proposed model attains a peak accuracy of 90.51% and a macro-averaged F1-score of 0.92, outperforming a fine-tuned MobileNetV2 transfer-learning baseline (88.06%, macro F1 = 0.90) while requiring 36% fewer parameters (1.44 M versus 2.26 M). Comprehensive per-class precision, recall, F1-score, and confusion-matrix analyses confirm the benefits of domain-adapted architectural decisions over generic ImageNet-pretrained backbones. Supplementary mathematical derivations encompassing receptive-field growth, computational complexity, information-theoretic loss bounds, and regularization analysis further substantiate the proposed framework.

Related Articles

2026

Artificial Intelligence in Learning and Teaching

2026

Admin Assist: An AI – Driven Configuration and Orchestration for Enterprise Application

2026

Enhancing Blood Group Identification using pigeon inspired optimization: An Innovative Approach

2026

Eco-Genius: Power Up Smart, Power Down Waste

2026

Crowd-Sourced Disaster Response and Rescue Assistant

2026

Unveiling Deepfake Detection Using Vision Transformers: A Survey and Experimental Study

Share Article

X
LinkedIn
Facebook
WhatsApp

Or copy link

https://www.indjcst.com/archives/10.59256/indjcst.20260502053

*Instagram doesn't support direct link sharing from web. Copy the link and share it in your Instagram story or post.