Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems

Loading...
Thumbnail Image

Journal Title

Journal ISSN

Volume Title

Publisher

Springer International Publishing AG

Series Info

Lecture Notes in Networks and Systems ; Volume 1416 LNNS , Pages 169 - 179

Orcid

Abstract

Hallucination detection in text generation remains an ongoing struggle for natural language processing (NLP) systems, frequently resulting in unreliable outputs in applications such as machine translation and definition modeling. Existing methods struggle with data scarcity and the limitations of unlabeled datasets, as highlighted by the SHROOM shared task at SemEval-2024. In this work, we propose a novel framework to address these challenges, introducing DeepSeek Few-shot Optimization to enhance weak label generation through iterative prompt engineering. We achieved high-quality annotations that considerably enhanced the performance of downstream models by restructuring data to align with instruct generative models. We further fine-tuned the Mistral-7B-Instruct-v0.3 model on these optimized annotations, enabling it to accurately detect hallucinations in resource-limited settings. Combining this fine-tuned model with ensemble learning strategies, our approach achieved 85.5% accuracy on the test set, setting a new benchmark for the SHROOM task. This study demonstrates the effectiveness of data restructuring, few-shot optimization, and fine-tuning in building scalable and robust hallucination detection frameworks for resource-constrained NLP systems.

Description

SJR 2024 0.166 Q4 H-Index 48

Citation

Hikal, B., Nasreldin, A., Hamdi, A., & Mohammed, A. (2025). Few-Shot Optimized Framework for hallucination Detection in Resource-Limited NLP Systems. In Lecture notes in networks and systems (pp. 169–179). https://doi.org/10.1007/978-981-96-6441-2_16

Endorsement

Review

Supplemented By

Referenced By