Saved in:
Bibliographic Details
Main Authors: Zhou, Xin, Zhang, Ting, Lo, David
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2401.15468
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917576994455552
author Zhou, Xin
Zhang, Ting
Lo, David
author_facet Zhou, Xin
Zhang, Ting
Lo, David
contents Previous learning-based vulnerability detection methods relied on either medium-sized pre-trained models or smaller neural networks from scratch. Recent advancements in Large Pre-Trained Language Models (LLMs) have showcased remarkable few-shot learning capabilities in various tasks. However, the effectiveness of LLMs in detecting software vulnerabilities is largely unexplored. This paper aims to bridge this gap by exploring how LLMs perform with various prompts, particularly focusing on two state-of-the-art LLMs: GPT-3.5 and GPT-4. Our experimental results showed that GPT-3.5 achieves competitive performance with the prior state-of-the-art vulnerability detection approach and GPT-4 consistently outperformed the state-of-the-art.
format Preprint
id arxiv_https___arxiv_org_abs_2401_15468
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Large Language Model for Vulnerability Detection: Emerging Results and Future Directions
Zhou, Xin
Zhang, Ting
Lo, David
Software Engineering
Previous learning-based vulnerability detection methods relied on either medium-sized pre-trained models or smaller neural networks from scratch. Recent advancements in Large Pre-Trained Language Models (LLMs) have showcased remarkable few-shot learning capabilities in various tasks. However, the effectiveness of LLMs in detecting software vulnerabilities is largely unexplored. This paper aims to bridge this gap by exploring how LLMs perform with various prompts, particularly focusing on two state-of-the-art LLMs: GPT-3.5 and GPT-4. Our experimental results showed that GPT-3.5 achieves competitive performance with the prior state-of-the-art vulnerability detection approach and GPT-4 consistently outperformed the state-of-the-art.
title Large Language Model for Vulnerability Detection: Emerging Results and Future Directions
topic Software Engineering
url https://arxiv.org/abs/2401.15468