Deep Learning-Based Multiclass Classification of Oral Lesions with Stratified Augmentation
| dc.Affiliation | October University for modern sciences and Arts MSA | |
| dc.contributor.author | Joy Naoum | |
| dc.contributor.author | Revana Salama | |
| dc.contributor.author | Ali Hamdi | |
| dc.date.accessioned | 2026-06-02T07:36:07Z | |
| dc.date.issued | 2026-05-01 | |
| dc.description | SJR 2025 0.119 Q4 H-Index 40 Subject Area and Category: Computer Science Computer Networks and Communications Computer Science Applications Information Systems Engineering Electrical and Electronic Engineering Media Technology | |
| dc.description.abstract | Oral cancer is frequently diagnosed at later stages due to its similarity to other lesions. Existing research on computer-aided diagnosis has made progress using deep learning; however, most approaches remain limited by small, imbalanced datasets and a dependence on single-modality features, which restricts model generalization in real-world clinical settings. To address these limitations, this study proposes a novel data-augmentation–driven multimodal feature-fusion framework integrated within a (Vision Recognition)VR-assisted oral cancer recognition system. Our method combines extensive data-centric augmentation with fused clinical and image-based representations to enhance model robustness and reduce diagnostic ambiguity. Using a stratified training pipeline and an EfficientNetV2-B1 backbone, the system improves feature diversity, mitigates imbalance, and strengthens the learned multimodal embedding. Experimental evaluation demonstrates that the proposed framework achieves an overall accuracy of 82.57% on 2 classes, 65.13% on 3 classes, and 54.97% on 4 classes, outperforming traditional single-stream CNN models. These results highlight the effectiveness of multimodal feature fusion combined with strategic augmentation for reliable early oral cancer lesion recognition and serve as a foundation for immersive VR-based clinical decision-support tools. | |
| dc.description.uri | https://www.scimagojr.com/journalsearch.php?q=21100975545&tip=sid&clean=0 | |
| dc.identifier.citation | Naoum, J., Salama, R., & Hamdi, A. (2026). Deep Learning-Based Multiclass Classification of Oral Lesions with Stratified Augmentation. Lecture Notes on Data Engineering and Communications Technologies, 200–209. https://doi.org/10.1007/978-3-032-23035-5_19 | |
| dc.identifier.doi | https://doi.org/10.1007/978-3-032-23035-5_19 | |
| dc.identifier.other | https://doi.org/10.1007/978-3-032-23035-5_19 | |
| dc.identifier.uri | https://repository.msa.edu.eg/handle/123456789/6776 | |
| dc.language.iso | en_US | |
| dc.publisher | Springer International Publishing AG | |
| dc.relation.ispartofseries | Lecture Notes on Data Engineering and Communications Technologies ; Volume 293 , Pages 200 - 209 | |
| dc.subject | Data-augmentation | |
| dc.subject | Deep learning | |
| dc.subject | Multimodal | |
| dc.subject | Oral cancer | |
| dc.subject | Stratified-sampling | |
| dc.subject | Vision recognition | |
| dc.title | Deep Learning-Based Multiclass Classification of Oral Lesions with Stratified Augmentation | |
| dc.type | Book chapter |
