Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems
Loading...
Date
Journal Title
Journal ISSN
Volume Title
Publisher
Springer International Publishing AG
Series Info
Lecture Notes in Networks and Systems ; Volume 1416 LNNS , Pages 169 - 179
Scientific Journal Rankings
Orcid
Abstract
Hallucination detection in text generation remains an ongoing struggle for natural language processing (NLP) systems, frequently resulting in unreliable outputs in applications such as machine translation and definition modeling. Existing methods struggle with data scarcity and the limitations of unlabeled datasets, as highlighted by the SHROOM shared task at SemEval-2024. In this work, we propose a novel framework to address these challenges, introducing DeepSeek Few-shot Optimization to enhance weak label generation through iterative prompt engineering. We achieved high-quality annotations that considerably enhanced the performance of downstream models by restructuring data to align with instruct generative models. We further fine-tuned the Mistral-7B-Instruct-v0.3 model on these optimized annotations, enabling it to accurately detect hallucinations in resource-limited settings. Combining this fine-tuned model with ensemble learning strategies, our approach achieved 85.5% accuracy on the test set, setting a new benchmark for the SHROOM task. This study demonstrates the effectiveness of data restructuring, few-shot optimization, and fine-tuning in building scalable and robust hallucination detection frameworks for resource-constrained NLP systems.
Description
SJR 2024
0.166
Q4
H-Index
48
Citation
Hikal, B., Nasreldin, A., Hamdi, A., & Mohammed, A. (2025). Few-Shot Optimized Framework for hallucination Detection in Resource-Limited NLP Systems. In Lecture notes in networks and systems (pp. 169–179). https://doi.org/10.1007/978-981-96-6441-2_16
