Cross-Language Approach for Quranic QA

dc.AffiliationOctober University for modern sciences and Arts MSA
dc.contributor.authorIslam Oshallah
dc.contributor.authorMohamed Basem
dc.contributor.authorAli Hamdi
dc.contributor.authorAmmar Mohammed
dc.date.accessioned2025-11-04T09:04:53Z
dc.date.issued2025-10-01
dc.descriptionSJR 2024 0.166 Q4 H-Index 48
dc.description.abstractQuestion answering systems face critical limitations in languages with limited resources and scarce data, making the development of robust models especially challenging. The Quranic QA system holds significant importance as it facilitates a deeper understanding of the Quran, a Holy text for over a billion people worldwide. However, these systems face unique challenges, including the linguistic disparity between questions written in Modern Standard Arabic and answers found in Quranic verses written in Classical Arabic, and the small size of existing datasets, which further restricts model performance. To address these challenges, we adopt a cross-language approach by (1) Dataset Augmentation: expanding and enriching the dataset through machine translation to convert Arabic questions into English, paraphrasing questions to create linguistic diversity, and retrieving answers from an English translation of the Quran to align with multilingual training requirements; and (2) Language Model Fine-Tuning: utilizing pre-trained models such as BERT-Medium, RoBERTa-Base, DeBERTa-v3-Base, ELECTRA-Large, Flan-T5, Bloom, and Falcon to address the specific requirements of Quranic QA. Experimental results demonstrate that this cross-language approach significantly improves model performance, with RoBERTa-Base achieving the highest MAP@10 (0.34) and MRR (0.52), while DeBERTa-v3-Base excels in Recall@10 (0.50) and Precision@10 (0.24). These findings underscore the effectiveness of cross-language strategies in overcoming linguistic barriers and advancing Quranic QA systems.
dc.description.urihttps://www.scimagojr.com/journalsearch.php?q=21100901469&tip=sid&clean=0
dc.identifier.citationOshallah, I., Basem, M., Hamdi, A., & Mohammed, A. (2025b). Cross-Language approach for Quranic QA. arXiv (Cornell University). https://doi.org/10.48550/arxiv.2501.17449
dc.identifier.doihttps://doi.org/10.48550/arXiv.2501.17449
dc.identifier.otherhttps://doi.org/10.48550/arXiv.2501.17449
dc.identifier.urihttps://repository.msa.edu.eg/handle/123456789/6585
dc.language.isoen_US
dc.publisherSpringer International Publishing AG
dc.relation.ispartofseriesLecture Notes in Networks and Systems ; Volume 1416 LNNS , Pages 385 - 396
dc.subjectClassical arabic
dc.subjectDataset expansion
dc.subjectFine-tuning
dc.subjectModern standard arabic
dc.subjectPassage retrieval
dc.subjectQuran question answering
dc.titleCross-Language Approach for Quranic QA
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2501.17449v1.pdf
Size:
368.38 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
51 B
Format:
Item-specific license agreed upon to submission
Description: