Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems
| dc.Affiliation | October University for modern sciences and Arts MSA | |
| dc.contributor.author | Baraa Hikal | |
| dc.contributor.author | Ahmed Nasreldin | |
| dc.contributor.author | Ali Hamdi | |
| dc.contributor.author | Ammar Mohammed | |
| dc.date.accessioned | 2025-11-04T08:05:48Z | |
| dc.date.issued | 2025-10-01 | |
| dc.description | SJR 2024 0.166 Q4 H-Index 48 | |
| dc.description.abstract | Hallucination detection in text generation remains an ongoing struggle for natural language processing (NLP) systems, frequently resulting in unreliable outputs in applications such as machine translation and definition modeling. Existing methods struggle with data scarcity and the limitations of unlabeled datasets, as highlighted by the SHROOM shared task at SemEval-2024. In this work, we propose a novel framework to address these challenges, introducing DeepSeek Few-shot Optimization to enhance weak label generation through iterative prompt engineering. We achieved high-quality annotations that considerably enhanced the performance of downstream models by restructuring data to align with instruct generative models. We further fine-tuned the Mistral-7B-Instruct-v0.3 model on these optimized annotations, enabling it to accurately detect hallucinations in resource-limited settings. Combining this fine-tuned model with ensemble learning strategies, our approach achieved 85.5% accuracy on the test set, setting a new benchmark for the SHROOM task. This study demonstrates the effectiveness of data restructuring, few-shot optimization, and fine-tuning in building scalable and robust hallucination detection frameworks for resource-constrained NLP systems. | |
| dc.description.uri | https://www.scimagojr.com/journalsearch.php?q=21100901469&tip=sid&clean=0 | |
| dc.identifier.citation | Hikal, B., Nasreldin, A., Hamdi, A., & Mohammed, A. (2025). Few-Shot Optimized Framework for hallucination Detection in Resource-Limited NLP Systems. In Lecture notes in networks and systems (pp. 169–179). https://doi.org/10.1007/978-981-96-6441-2_16 | |
| dc.identifier.doi | https://doi.org/10.1007/978-981-96-6441-2_16 | |
| dc.identifier.other | https://doi.org/10.1007/978-981-96-6441-2_16 | |
| dc.identifier.uri | https://repository.msa.edu.eg/handle/123456789/6583 | |
| dc.language.iso | en_US | |
| dc.publisher | Springer International Publishing AG | |
| dc.relation.ispartofseries | Lecture Notes in Networks and Systems ; Volume 1416 LNNS , Pages 169 - 179 | |
| dc.subject | Few shot | |
| dc.subject | Generative NLP | |
| dc.subject | Hallucination detection | |
| dc.subject | Prompt engineering | |
| dc.subject | Transformer architectures | |
| dc.subject | Weak labeling | |
| dc.title | Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems | |
| dc.type | Article |
