Saved in:
Bibliographic Details
Main Authors: Marques, Miguel, Fernandes, Ana Luísa, Pacheco, Ana Filipa, Rebouças, Rute, Cantante, Inês, Isidro, José, Cunha, Luís Filipe, Jorge, Alípio, Guimarães, Nuno, Nunes, Sérgio, Leal, António, Silvano, Purificação, Campos, Ricardo
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.16607
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914337570947072
author Marques, Miguel
Fernandes, Ana Luísa
Pacheco, Ana Filipa
Rebouças, Rute
Cantante, Inês
Isidro, José
Cunha, Luís Filipe
Jorge, Alípio
Guimarães, Nuno
Nunes, Sérgio
Leal, António
Silvano, Purificação
Campos, Ricardo
author_facet Marques, Miguel
Fernandes, Ana Luísa
Pacheco, Ana Filipa
Rebouças, Rute
Cantante, Inês
Isidro, José
Cunha, Luís Filipe
Jorge, Alípio
Guimarães, Nuno
Nunes, Sérgio
Leal, António
Silvano, Purificação
Campos, Ricardo
contents Municipal meeting minutes are formal records documenting the discussions and decisions of local government, yet their content is often lengthy, dense, and difficult for citizens to navigate. Automatic summarization can help address this challenge by producing concise summaries for each discussion subject. Despite its potential, research on summarizing discussion subjects in municipal meeting minutes remains largely unexplored, especially in low-resource languages, where the inherent complexity of these documents adds further challenges. A major bottleneck is the scarcity of datasets containing high-quality, manually crafted summaries, which limits the development and evaluation of effective summarization models for this domain. In this paper, we present CitiLink-Summ, a new corpus of European Portuguese municipal meeting minutes, comprising 100 documents and 2,322 manually hand-written summaries, each corresponding to a distinct discussion subject. Leveraging this dataset, we establish baseline results for automatic summarization in this domain, employing state-of-the-art generative models (e.g., BART, PRIMERA) as well as large language models (LLMs), evaluated with both lexical and semantic metrics such as ROUGE, BLEU, METEOR, and BERTScore. CitiLink-Summ provides the first benchmark for municipal-domain summarization in European Portuguese, offering a valuable resource for advancing NLP research on complex administrative texts.
format Preprint
id arxiv_https___arxiv_org_abs_2602_16607
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle CitiLink-Summ: Summarization of Discussion Subjects in European Portuguese Municipal Meeting Minutes
Marques, Miguel
Fernandes, Ana Luísa
Pacheco, Ana Filipa
Rebouças, Rute
Cantante, Inês
Isidro, José
Cunha, Luís Filipe
Jorge, Alípio
Guimarães, Nuno
Nunes, Sérgio
Leal, António
Silvano, Purificação
Campos, Ricardo
Computation and Language
Municipal meeting minutes are formal records documenting the discussions and decisions of local government, yet their content is often lengthy, dense, and difficult for citizens to navigate. Automatic summarization can help address this challenge by producing concise summaries for each discussion subject. Despite its potential, research on summarizing discussion subjects in municipal meeting minutes remains largely unexplored, especially in low-resource languages, where the inherent complexity of these documents adds further challenges. A major bottleneck is the scarcity of datasets containing high-quality, manually crafted summaries, which limits the development and evaluation of effective summarization models for this domain. In this paper, we present CitiLink-Summ, a new corpus of European Portuguese municipal meeting minutes, comprising 100 documents and 2,322 manually hand-written summaries, each corresponding to a distinct discussion subject. Leveraging this dataset, we establish baseline results for automatic summarization in this domain, employing state-of-the-art generative models (e.g., BART, PRIMERA) as well as large language models (LLMs), evaluated with both lexical and semantic metrics such as ROUGE, BLEU, METEOR, and BERTScore. CitiLink-Summ provides the first benchmark for municipal-domain summarization in European Portuguese, offering a valuable resource for advancing NLP research on complex administrative texts.
title CitiLink-Summ: Summarization of Discussion Subjects in European Portuguese Municipal Meeting Minutes
topic Computation and Language
url https://arxiv.org/abs/2602.16607