Saved in:
Bibliographic Details
Main Authors: Timko, Daniel, Rahman, Muhammad Lutfor
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.18430
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913332900921344
author Timko, Daniel
Rahman, Muhammad Lutfor
author_facet Timko, Daniel
Rahman, Muhammad Lutfor
contents While smishing (SMS Phishing) attacks have risen to become one of the most common types of social engineering attacks, there is a lack of relevant smishing datasets. One of the biggest challenges in the domain of smishing prevention is the availability of fresh smishing datasets. Additionally, as time persists, smishing campaigns are shut down and the crucial information related to the attack are lost. With the changing nature of smishing attacks, a consistent flow of new smishing examples is needed by both researchers and engineers to create effective defenses. In this paper, we present the community-sourced smishing datasets from the smishtank.com. It provides a wealth of information relevant to combating smishing attacks through the breakdown and analysis of smishing samples at the point of submission. In the contribution of our work, we provide a corpus of 1090 smishing samples that have been publicly submitted through the site. Each message includes information relating to the sender, message body, and any brands referenced in the message. Additionally, when a URL is found, we provide additional information on the domain, VirusTotal results, and a characterization of the URL. Through the open access of fresh smishing data, we empower academia and industries to create robust defenses against this evolving threat.
format Preprint
id arxiv_https___arxiv_org_abs_2402_18430
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Smishing Dataset I: Phishing SMS Dataset from Smishtank.com
Timko, Daniel
Rahman, Muhammad Lutfor
Cryptography and Security
While smishing (SMS Phishing) attacks have risen to become one of the most common types of social engineering attacks, there is a lack of relevant smishing datasets. One of the biggest challenges in the domain of smishing prevention is the availability of fresh smishing datasets. Additionally, as time persists, smishing campaigns are shut down and the crucial information related to the attack are lost. With the changing nature of smishing attacks, a consistent flow of new smishing examples is needed by both researchers and engineers to create effective defenses. In this paper, we present the community-sourced smishing datasets from the smishtank.com. It provides a wealth of information relevant to combating smishing attacks through the breakdown and analysis of smishing samples at the point of submission. In the contribution of our work, we provide a corpus of 1090 smishing samples that have been publicly submitted through the site. Each message includes information relating to the sender, message body, and any brands referenced in the message. Additionally, when a URL is found, we provide additional information on the domain, VirusTotal results, and a characterization of the URL. Through the open access of fresh smishing data, we empower academia and industries to create robust defenses against this evolving threat.
title Smishing Dataset I: Phishing SMS Dataset from Smishtank.com
topic Cryptography and Security
url https://arxiv.org/abs/2402.18430