Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Lin, Jiongliang, Guo, Yiwen, Chen, Hao
Format:	Preprint
Published:	2024
Subjects:	Cryptography and Security Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2404.13402
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929321908633600
author	Lin, Jiongliang Guo, Yiwen Chen, Hao
author_facet	Lin, Jiongliang Guo, Yiwen Chen, Hao
contents	Intrusion detection is a long standing and crucial problem in security. A system capable of detecting intrusions automatically is on great demand in enterprise security solutions. Existing solutions rely heavily on hand-crafted rules designed by security operators, which suffer from high false negative rates and poor generalization ability to new, zero-day attacks at scale. AI and machine learning offer promising solutions to address the issues, by inspecting abnormal user behaviors intelligently and automatically from data. However, existing learning-based intrusion detection systems in the literature are mostly designed for small data, and they lack the ability to leverage the power of big data in cloud environments. In this paper, we target at this problem and introduce an intrusion detection system which incorporates large-scale pre-training, so as to train a large language model based on tens of millions of command lines for AI-based intrusion detection. Experiments performed on 30 million training samples and 10 million test samples verify the effectiveness of our solution.
format	Preprint
id	arxiv_https___arxiv_org_abs_2404_13402
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Intrusion Detection at Scale with the Assistance of a Command-line Language Model Lin, Jiongliang Guo, Yiwen Chen, Hao Cryptography and Security Artificial Intelligence Computation and Language Intrusion detection is a long standing and crucial problem in security. A system capable of detecting intrusions automatically is on great demand in enterprise security solutions. Existing solutions rely heavily on hand-crafted rules designed by security operators, which suffer from high false negative rates and poor generalization ability to new, zero-day attacks at scale. AI and machine learning offer promising solutions to address the issues, by inspecting abnormal user behaviors intelligently and automatically from data. However, existing learning-based intrusion detection systems in the literature are mostly designed for small data, and they lack the ability to leverage the power of big data in cloud environments. In this paper, we target at this problem and introduce an intrusion detection system which incorporates large-scale pre-training, so as to train a large language model based on tens of millions of command lines for AI-based intrusion detection. Experiments performed on 30 million training samples and 10 million test samples verify the effectiveness of our solution.
title	Intrusion Detection at Scale with the Assistance of a Command-line Language Model
topic	Cryptography and Security Artificial Intelligence Computation and Language
url	https://arxiv.org/abs/2404.13402

Similar Items