Saved in:
Bibliographic Details
Main Authors: Liang, Zhanbo, Guo, Jie, Qiu, Weidong, Huang, Zheng, Li, Shujun
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2311.15917
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912295778516992
author Liang, Zhanbo
Guo, Jie
Qiu, Weidong
Huang, Zheng
Li, Shujun
author_facet Liang, Zhanbo
Guo, Jie
Qiu, Weidong
Huang, Zheng
Li, Shujun
contents With the rise of Web 2.0 platforms such as online social media, people's private information, such as their location, occupation and even family information, is often inadvertently disclosed through online discussions. Therefore, it is important to detect such unwanted privacy disclosures to help alert people affected and the online platform. In this paper, privacy disclosure detection is modeled as a multi-label text classification (MLTC) problem, and a new privacy disclosure detection model is proposed to construct an MLTC classifier for detecting online privacy disclosures. This classifier takes an online post as the input and outputs multiple labels, each reflecting a possible privacy disclosure. The proposed presentation method combines three different sources of information, the input text itself, the label-to-text correlation and the label-to-label correlation. A double-attention mechanism is used to combine the first two sources of information, and a graph convolutional network (GCN) is employed to extract the third source of information that is then used to help fuse features extracted from the first two sources of information. Our extensive experimental results, obtained on a public dataset of privacy-disclosing posts on Twitter, demonstrated that our proposed privacy disclosure detection method significantly and consistently outperformed other state-of-the-art methods in terms of all key performance indicators.
format Preprint
id arxiv_https___arxiv_org_abs_2311_15917
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle When Graph Convolution Meets Double Attention: Online Privacy Disclosure Detection with Multi-Label Text Classification
Liang, Zhanbo
Guo, Jie
Qiu, Weidong
Huang, Zheng
Li, Shujun
Cryptography and Security
With the rise of Web 2.0 platforms such as online social media, people's private information, such as their location, occupation and even family information, is often inadvertently disclosed through online discussions. Therefore, it is important to detect such unwanted privacy disclosures to help alert people affected and the online platform. In this paper, privacy disclosure detection is modeled as a multi-label text classification (MLTC) problem, and a new privacy disclosure detection model is proposed to construct an MLTC classifier for detecting online privacy disclosures. This classifier takes an online post as the input and outputs multiple labels, each reflecting a possible privacy disclosure. The proposed presentation method combines three different sources of information, the input text itself, the label-to-text correlation and the label-to-label correlation. A double-attention mechanism is used to combine the first two sources of information, and a graph convolutional network (GCN) is employed to extract the third source of information that is then used to help fuse features extracted from the first two sources of information. Our extensive experimental results, obtained on a public dataset of privacy-disclosing posts on Twitter, demonstrated that our proposed privacy disclosure detection method significantly and consistently outperformed other state-of-the-art methods in terms of all key performance indicators.
title When Graph Convolution Meets Double Attention: Online Privacy Disclosure Detection with Multi-Label Text Classification
topic Cryptography and Security
url https://arxiv.org/abs/2311.15917