:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Steffes, Bianca, Wiedemann, Nils Torben, Gratz, Alexander, Hochreither, Pamela, Meyer, Jana Elina, Schilke, Katharina Luise
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2505.05947
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ATLAS: Improving Lay Summarisation with Attribute-based Control
by: Zhang, Zhihao, et al.
Published: (2024)

Evaluating LLM-Driven Summarisation of Parliamentary Debates with Computational Argumentation
by: Cunningham, Eoghan, et al.
Published: (2026)

A Critical Look at Meta-evaluating Summarisation Evaluation Metrics
by: Dai, Xiang, et al.
Published: (2024)

MARS: Multilingual Aspect-centric Review Summarisation
by: Mukku, Sandeep Sricharan, et al.
Published: (2024)

Leveraging Entailment Judgements in Cross-Lingual Summarisation
by: Zhang, Huajian, et al.
Published: (2024)

Log Summarisation for Defect Evolution Analysis
by: Dolga, Rares, et al.
Published: (2024)

Simple and Effective Baselines for Code Summarisation Evaluation
by: Robinson, Jade, et al.
Published: (2025)

Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks
by: Zhang, Huajian, et al.
Published: (2024)

Textual Summarisation of Large Sets: Towards a General Approach
by: Kuptavanich, Kittipitch, et al.
Published: (2024)

When Bigger Isn't Better: A Comprehensive Fairness Evaluation of Political Bias in Multi-News Summarisation
by: Huang, Nannan, et al.
Published: (2026)

Faithful Summarisation under Disagreement via Belief-Level Aggregation
by: Aghaebe, Favour Yahdii, et al.
Published: (2026)

Enhancing Long Document Long Form Summarisation with Self-Planning
by: Du, Xiaotang, et al.
Published: (2025)

M2DS: Multilingual Dataset for Multi-document Summarisation
by: Hewapathirana, Kushan, et al.
Published: (2024)

REPA: Russian Error Types Annotation for Evaluating Text Generation and Judgment Capabilities
by: Pugachev, Alexander, et al.
Published: (2025)

Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation
by: Li, Hao, et al.
Published: (2024)

LongDocFACTScore: Evaluating the Factuality of Long Document Abstractive Summarisation
by: Bishop, Jennifer A, et al.
Published: (2023)

AraFinNews: Arabic Financial Summarisation with Domain-Adapted LLMs
by: El-Haj, Mo, et al.
Published: (2025)

REFER: Mitigating Bias in Opinion Summarisation via Frequency Framed Prompting
by: Huang, Nannan, et al.
Published: (2025)

Leveraging Large Language Models for Zero-shot Lay Summarisation in Biomedicine and Beyond
by: Goldsack, Tomas, et al.
Published: (2025)

Discrimination by LLMs: Cross-lingual Bias Assessment and Mitigation in Decision-Making and Summarisation
by: Huijzer, Willem, et al.
Published: (2025)

Less Is More? Examining Fairness in Pruned Large Language Models for Summarising Opinions
by: Huang, Nannan, et al.
Published: (2025)

Fine-Grained Detection of Solidarity for Women and Migrants in 155 Years of German Parliamentary Debates
by: Kostikova, Aida, et al.
Published: (2022)

Enhancing Human Evaluation in Machine Translation with Comparative Judgment
by: Song, Yixiao, et al.
Published: (2025)

Beyond Extraction: Contextualising Tabular Data for Efficient Summarisation by Language Models
by: Allu, Uday, et al.
Published: (2024)

FreeTxt-Vi: A Benchmarked Vietnamese-English Toolkit for Segmentation, Sentiment, and Summarisation
by: Huy, Hung Nguyen, et al.
Published: (2026)

Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles
by: Touileb, Samia, et al.
Published: (2025)

Bias in Opinion Summarisation from Pre-training to Adaptation: A Case Study in Political Bias
by: Huang, Nannan, et al.
Published: (2024)

PETapter: Leveraging PET-style classification heads for modular few-shot parameter-efficient fine-tuning
by: Rieger, Jonas, et al.
Published: (2024)

Benchmarking LLM-based Relevance Judgment Methods
by: Arabzadeh, Negar, et al.
Published: (2025)

Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes
by: Frei, Johann, et al.
Published: (2025)

Mitigating Hallucinations in Zero-Shot Scientific Summarisation: A Pilot Study
by: Jaaouine, Imane, et al.
Published: (2025)

Machine Learning Information Retrieval and Summarisation to Support Systematic Review on Outcomes Based Contracting
by: Bilal, Iman Munire, et al.
Published: (2024)

Proof of Time: A Benchmark for Evaluating Scientific Idea Judgments
by: Ye, Bingyang, et al.
Published: (2026)

Evaluating and Optimizing Educational Content with Large Language Model Judgments
by: He-Yueya, Joy, et al.
Published: (2024)

Real Images, Worse Judgments: Evaluating Vision-Language Models on Concreteness and Imagery
by: Jiang, Yifan, et al.
Published: (2026)

Harmonising the Clinical Melody: Tuning Large Language Models for Hospital Course Summarisation in Clinical Coding
by: Bi, Bokang, et al.
Published: (2024)

Persona Prompting as a Lens on LLM Social Reasoning
by: Yang, Jing, et al.
Published: (2026)

MaLei at MultiClinSUM: Summarisation of Clinical Documents using Perspective-Aware Iterative Self-Prompting with LLMs
by: Ren, Libo, et al.
Published: (2025)

Evaluating the Correctness of Inference Patterns Used by LLMs for Judgment
by: Chen, Lu, et al.
Published: (2024)

Improving LLM-as-a-Judge Inference with the Judgment Distribution
by: Wang, Victor, et al.
Published: (2025)