Salvato in:
Dettagli Bibliografici
Autore principale: Gaudance Stanslaus Tesha
Natura: Recurso digital
Lingua:
Pubblicazione: Zenodo 2026
Soggetti:
Accesso online:https://doi.org/10.5281/zenodo.18738549
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
_version_ 1866901569431142400
author Gaudance Stanslaus Tesha
author_facet Gaudance Stanslaus Tesha
contents <p><span lang="EN-US">In recent years, cybersecurity threats have become increasingly prevalent worldwide, with scam messages especially those delivered via text and online platforms emerging as a major concern. These messages aim to trick recipients into disclosing personal or financial information, leading to identity theft, privacy infringements, and financial loss. In Tanzania, the rapid growth of mobile communication technologies and mobile money services has intensified this threat. Kiswahili, being the dominant language in digital communication, adds a unique dimension to the problem. This study develops Natural Language Processing (NLP)-driven Machine Learning (ML) models to detect scam messages written in Kiswahili. A labeled dataset was constructed and models such as Support Vector Machine (SVM), Random Forest (RF), Bi-LSTM, and Swahili-BERT were evaluated. Results indicate that Swahili-BERT achieved the highest detection accuracy. The study highlights the importance of context-aware, language-specific NLP tools and recommends real-time, user-friendly, and privacy-compliant deployment strategies.</span></p>
format Recurso digital
id zenodo_https___doi_org_10_5281_zenodo_18738549
institution Zenodo
language
publishDate 2026
publisher Zenodo
record_format zenodo
spellingShingle AUTOMATED DETECTION OF SWAHILI SCAM MESSAGES IN TANZANIA USING NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING MODELS
Gaudance Stanslaus Tesha
Cybersecurity, Scam Detection, Kiswahili NLP, Machine Learning, Swahili-BERT, Mobile Money Fraud, Low-Resource Languages.
<p><span lang="EN-US">In recent years, cybersecurity threats have become increasingly prevalent worldwide, with scam messages especially those delivered via text and online platforms emerging as a major concern. These messages aim to trick recipients into disclosing personal or financial information, leading to identity theft, privacy infringements, and financial loss. In Tanzania, the rapid growth of mobile communication technologies and mobile money services has intensified this threat. Kiswahili, being the dominant language in digital communication, adds a unique dimension to the problem. This study develops Natural Language Processing (NLP)-driven Machine Learning (ML) models to detect scam messages written in Kiswahili. A labeled dataset was constructed and models such as Support Vector Machine (SVM), Random Forest (RF), Bi-LSTM, and Swahili-BERT were evaluated. Results indicate that Swahili-BERT achieved the highest detection accuracy. The study highlights the importance of context-aware, language-specific NLP tools and recommends real-time, user-friendly, and privacy-compliant deployment strategies.</span></p>
title AUTOMATED DETECTION OF SWAHILI SCAM MESSAGES IN TANZANIA USING NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING MODELS
topic Cybersecurity, Scam Detection, Kiswahili NLP, Machine Learning, Swahili-BERT, Mobile Money Fraud, Low-Resource Languages.
url https://doi.org/10.5281/zenodo.18738549