FuzzDistill: Intelligent Fuzzing Target Selection using Compile-Time Analysis and Machine Learning

Abstract

Fuzz testing is a fundamental technique employed to identify vulnerabilities within software systems. However, the process can be protracted and resource-intensive, especially when confronted with extensive codebases. In this work, I present FuzzDistill, an approach that harnesses compile-time data and machine learning to refine fuzzing targets. By analyzing compile-time information, such as function call graphs' features, loop information, and memory operations, FuzzDistill identifies high-priority areas of the codebase that are more probable to contain vulnerabilities. I demonstrate the efficacy of my approach through experiments conducted on real-world software.

Modules

Module	Description	GitHub	License
FuzzDistillCC	Compiler back-end for feature extraction	github/Saket-Upadhyay/FuzzDistillCC	CC BY
FuzzDistillML	Model training component	github/Saket-Upadhyay/FuzzDistillML	CC BY
FuzzDistillWeb	Prediction front-end	github/Saket-Upadhyay/FuzzDistillWeb	MIT

Paper

What	Where
PDF	arXiv/pdf/2412.08100
arXiv	arXiv/2412.0810
Presentation	Presentation.pdf

Cite

If you utilize this project or any portion thereof, please ensure proper citation of the following work:

@misc{upadhyay2024fuzzdistillintelligentfuzzingtarget,
      title={FuzzDistill: Intelligent Fuzzing Target Selection using Compile-Time Analysis and Machine Learning}, 
      author={Saket Upadhyay},
      year={2024},
      eprint={2412.08100},
      archivePrefix={arXiv},
      primaryClass={cs.SE},
      url={https://arxiv.org/abs/2412.08100}, 
}

Acknowledgement

This project was facilitated by the opportunities provided in the graduate class ``CS 6501: Software Security Testing'', led by Dr. Jack Davidson, Dr. Jason Hiser, and Dr. Anh Nguyen-Tuong at the University of Virginia. I would like to express my sincere gratitude to all the instructors for creating an environment that fostered exploration and innovation, enabling me to delve into the intersection of fuzz testing and machine learning. The guidance and support received throughout the course were instrumental in shaping this project.

Not peer-reviewed; I mean, my peers did review it though, so idk :D

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
FuzzDistillCC @ d8f5069		FuzzDistillCC @ d8f5069
FuzzDistillML @ ca3ece1		FuzzDistillML @ ca3ece1
FuzzDistillWeb @ 1d12b47		FuzzDistillWeb @ 1d12b47
docs		docs
.gitmodules		.gitmodules
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FuzzDistill: Intelligent Fuzzing Target Selection using Compile-Time Analysis and Machine Learning

Abstract

Modules

Paper

Cite

Acknowledgement

About

Uh oh!

Saket-Upadhyay/FuzzDistill

Folders and files

Latest commit

History

Repository files navigation

FuzzDistill: Intelligent Fuzzing Target Selection using Compile-Time Analysis and Machine Learning

Abstract

Modules

Paper

Cite

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks