What question did this study set out to answer?

The study aims to investigate the effectiveness of morphological convolutional neural networks in enhancing facial expression recognition.

April 17, 2026Open Access

Morphological Convolutional Neural Network for Efficient Facial Expression Recognition

Read Full Paperexternally

Key Points

The study aims to investigate the effectiveness of morphological convolutional neural networks in enhancing facial expression recognition.
Developed a morphological convolutional neural network architecture incorporating morphological operations and CNN layers.
Created a facial expression dataset combining multiple sources with 3684 images across seven expression classes.
Applied subject-independent data splitting with 10-fold cross-validation for evaluation.
MCNN1 model achieved an average accuracy of 88.16%; MCNN2 reached 88.7%.
Compared to baseline models, the MCNN shows competitive performance, with MobileNetV2 at 88.27% and VGG19 at 87.58%.
The model demonstrated lower inference latency by 21% and reduced GPU memory usage by 64%.

Abstract

This study proposes a morphological convolutional neural network (MCNN) architecture that integrates morphological operations with CNN layers for facial expression recognition (FER). Conventional CNN-based FER models primarily rely on appearance features and may be sensitive to illumination and demographic variations. This work investigates whether morphological structural representations provide complementary information to convolutional features. A multi-source and multi-ethnic FER dataset was constructed by combining CK+, JAFFE, KDEF, TFEID, and a newly collected Indonesian Facial Expression dataset, resulting in 3684 images from 326 subjects across seven expression classes. Subject-independent data splitting with 10-fold cross-validation was applied to ensure reliable evaluation. Experimental results show that the proposed MCNN1 model achieves an average accuracy of 88.16%, while the best MCNN2 variant achieves 88.7%, demonstrating competitive performance compared to MobileNetV2 (88.27%), VGG19 (87.58%), and the morphological baseline MNN (50.73%). The proposed model also demonstrates improved computational efficiency, achieving lower inference latency (21%) and reduced GPU memory usage (64%) compared to baseline models. These results indicate that integrating morphological representations into convolutional architectures provides a modest but consistent improvement in FER performance while enhancing generalization and efficiency under heterogeneous data conditions.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Robert

Gunadarma University

Sarifuddin Madenda

Suryadi Harmanto

Journals

Journal of Imaging

Actions

Institutions

Centre National de la Recherche Scientifique

Université de Bourgogne

Gunadarma University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Morphological Convolutional Neural Network for Efficient Facial Expression Recognition

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study