Saved in:
Bibliographic Details
Main Authors: D'Elia, Marco, Finocchi, Irene, Patrignani, Maurizio
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.23948
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915823190278144
author D'Elia, Marco
Finocchi, Irene
Patrignani, Maurizio
author_facet D'Elia, Marco
Finocchi, Irene
Patrignani, Maurizio
contents Natural Language Processing (NLP) provides highly effective tools for interpreting and handling human language, offering a broad spectrum of applications. In this paper, we address a classic combinatorial problem -- finding graph partitions with high modularity -- by applying NLP techniques that compute term frequency and inverse document frequency (TF-IDF) alongside machine learning clustering algorithms. We present a new framework, called Clique-TF-IDF, designed for graph partitioning, a task that holds significant relevance across various network analysis contexts. This approach uses dense substructures of the graph, specifically maximal cliques, to represent each vertex in terms of the cliques it is part of, in a manner akin to term-document matrices. Experiments show that Clique-TF-IDF yields results that are comparable to or outperform the current state-of-the-art algorithms, whether or not the number of partitions is known in advance. Although this framework emphasizes on cliques and partitioning, it can be extended to devise AI-driven solutions for a variety of challenging combinatorial problems that can leverage efficiently enumerable substructures.
format Preprint
id arxiv_https___arxiv_org_abs_2602_23948
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle High-Modularity Graph Partitioning Through NLP Techniques and Maximal Clique Enumeration
D'Elia, Marco
Finocchi, Irene
Patrignani, Maurizio
Social and Information Networks
Natural Language Processing (NLP) provides highly effective tools for interpreting and handling human language, offering a broad spectrum of applications. In this paper, we address a classic combinatorial problem -- finding graph partitions with high modularity -- by applying NLP techniques that compute term frequency and inverse document frequency (TF-IDF) alongside machine learning clustering algorithms. We present a new framework, called Clique-TF-IDF, designed for graph partitioning, a task that holds significant relevance across various network analysis contexts. This approach uses dense substructures of the graph, specifically maximal cliques, to represent each vertex in terms of the cliques it is part of, in a manner akin to term-document matrices. Experiments show that Clique-TF-IDF yields results that are comparable to or outperform the current state-of-the-art algorithms, whether or not the number of partitions is known in advance. Although this framework emphasizes on cliques and partitioning, it can be extended to devise AI-driven solutions for a variety of challenging combinatorial problems that can leverage efficiently enumerable substructures.
title High-Modularity Graph Partitioning Through NLP Techniques and Maximal Clique Enumeration
topic Social and Information Networks
url https://arxiv.org/abs/2602.23948