Retrieval Augmented Generation Based LLM Evaluation For Protocol State Machine Inference With Chain-of-Thought Reasoning

dc.AffiliationOctober University for modern sciences and Arts MSA
dc.contributor.authorYoussef Maklad
dc.contributor.authorFares Wael
dc.contributor.authorWael Elsersy
dc.contributor.authorAli Hamdi
dc.date.accessioned2025-11-04T07:38:43Z
dc.date.issued2025-10-01
dc.descriptionSJR 2024 0.166 Q4 H-Index 48
dc.description.abstractThis paper presents a novel approach to evaluate the efficiency of a RAG-based agentic Large Language Model (LLM) architecture in network packet seed generation for network protocol fuzzing. Enhanced by chain-of-thought (COT) prompting techniques, the proposed approach focuses on the improvement of the seeds’ structural quality in order to guide protocol fuzzing frameworks through a wide exploration of the protocol state space. Our method leverages RAG and text embeddings in two stages. In the first stage, the agent dynamically refers to the Request For Comments (RFC) documents knowledge base for answering queries regarding the protocol Finite State Machine (FSM), then it iteratively reasons through the retrieved knowledge, for output refinement and proper seed placement. In the second stage, we evaluate the response structure quality of the agent’s output, based on metrics as BLEU, ROUGE, and Word Error Rate (WER) by comparing the generated packets against the ground truth packets. Our experiments demonstrate significant improvements of up to 18.19%, 14.81%, and 23.45% in BLEU, ROUGE, and WER, respectively, over baseline models. These results confirm the potential of such approach, improving LLM-based protocol fuzzing frameworks for the identification of hidden vulnerabilities.
dc.description.urihttps://www.scimagojr.com/journalsearch.php?q=21100901469&tip=sid&clean=0
dc.identifier.citationMaklad, Y., Wael, F., Elsersy, W., & Hamdi, A. (2025). Retrieval Augmented Generation Based LLM Evaluation for Protocol State Machine Inference with Chain-of-Thought Reasoning. In Lecture notes in networks and systems (pp. 313–323). https://doi.org/10.1007/978-981-96-6441-2_27
dc.identifier.doihttps://doi.org/10.1007/978-981-96-6441-2_27
dc.identifier.otherhttps://doi.org/10.1007/978-981-96-6441-2_27
dc.identifier.urihttps://repository.msa.edu.eg/handle/123456789/6582
dc.language.isoen_US
dc.publisherSpringer International Publishing AG
dc.relation.ispartofseriesLecture Notes in Networks and Systems ; Volume 1416 LNNS , Pages 313 - 323
dc.subjectFinite-state-machine
dc.subjectInitial seeds
dc.subjectLarge language models
dc.subjectProtocol fuzzing
dc.subjectRetrieval augmented generation
dc.subjectReverse engineering
dc.titleRetrieval Augmented Generation Based LLM Evaluation For Protocol State Machine Inference With Chain-of-Thought Reasoning
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2502.15727v2.pdf
Size:
507.87 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
51 B
Format:
Item-specific license agreed upon to submission
Description: