Saved in:
| Main Authors: | Steffes, Bianca, Wiedemann, Nils Torben, Gratz, Alexander, Hochreither, Pamela, Meyer, Jana Elina, Schilke, Katharina Luise |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.05947 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ATLAS: Improving Lay Summarisation with Attribute-based Control
by: Zhang, Zhihao, et al.
Published: (2024)
by: Zhang, Zhihao, et al.
Published: (2024)
Evaluating LLM-Driven Summarisation of Parliamentary Debates with Computational Argumentation
by: Cunningham, Eoghan, et al.
Published: (2026)
by: Cunningham, Eoghan, et al.
Published: (2026)
A Critical Look at Meta-evaluating Summarisation Evaluation Metrics
by: Dai, Xiang, et al.
Published: (2024)
by: Dai, Xiang, et al.
Published: (2024)
MARS: Multilingual Aspect-centric Review Summarisation
by: Mukku, Sandeep Sricharan, et al.
Published: (2024)
by: Mukku, Sandeep Sricharan, et al.
Published: (2024)
Leveraging Entailment Judgements in Cross-Lingual Summarisation
by: Zhang, Huajian, et al.
Published: (2024)
by: Zhang, Huajian, et al.
Published: (2024)
Log Summarisation for Defect Evolution Analysis
by: Dolga, Rares, et al.
Published: (2024)
by: Dolga, Rares, et al.
Published: (2024)
Simple and Effective Baselines for Code Summarisation Evaluation
by: Robinson, Jade, et al.
Published: (2025)
by: Robinson, Jade, et al.
Published: (2025)
Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks
by: Zhang, Huajian, et al.
Published: (2024)
by: Zhang, Huajian, et al.
Published: (2024)
Textual Summarisation of Large Sets: Towards a General Approach
by: Kuptavanich, Kittipitch, et al.
Published: (2024)
by: Kuptavanich, Kittipitch, et al.
Published: (2024)
When Bigger Isn't Better: A Comprehensive Fairness Evaluation of Political Bias in Multi-News Summarisation
by: Huang, Nannan, et al.
Published: (2026)
by: Huang, Nannan, et al.
Published: (2026)
Faithful Summarisation under Disagreement via Belief-Level Aggregation
by: Aghaebe, Favour Yahdii, et al.
Published: (2026)
by: Aghaebe, Favour Yahdii, et al.
Published: (2026)
Enhancing Long Document Long Form Summarisation with Self-Planning
by: Du, Xiaotang, et al.
Published: (2025)
by: Du, Xiaotang, et al.
Published: (2025)
M2DS: Multilingual Dataset for Multi-document Summarisation
by: Hewapathirana, Kushan, et al.
Published: (2024)
by: Hewapathirana, Kushan, et al.
Published: (2024)
REPA: Russian Error Types Annotation for Evaluating Text Generation and Judgment Capabilities
by: Pugachev, Alexander, et al.
Published: (2025)
by: Pugachev, Alexander, et al.
Published: (2025)
Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation
by: Li, Hao, et al.
Published: (2024)
by: Li, Hao, et al.
Published: (2024)
LongDocFACTScore: Evaluating the Factuality of Long Document Abstractive Summarisation
by: Bishop, Jennifer A, et al.
Published: (2023)
by: Bishop, Jennifer A, et al.
Published: (2023)
AraFinNews: Arabic Financial Summarisation with Domain-Adapted LLMs
by: El-Haj, Mo, et al.
Published: (2025)
by: El-Haj, Mo, et al.
Published: (2025)
REFER: Mitigating Bias in Opinion Summarisation via Frequency Framed Prompting
by: Huang, Nannan, et al.
Published: (2025)
by: Huang, Nannan, et al.
Published: (2025)
Leveraging Large Language Models for Zero-shot Lay Summarisation in Biomedicine and Beyond
by: Goldsack, Tomas, et al.
Published: (2025)
by: Goldsack, Tomas, et al.
Published: (2025)
Discrimination by LLMs: Cross-lingual Bias Assessment and Mitigation in Decision-Making and Summarisation
by: Huijzer, Willem, et al.
Published: (2025)
by: Huijzer, Willem, et al.
Published: (2025)
Less Is More? Examining Fairness in Pruned Large Language Models for Summarising Opinions
by: Huang, Nannan, et al.
Published: (2025)
by: Huang, Nannan, et al.
Published: (2025)
Fine-Grained Detection of Solidarity for Women and Migrants in 155 Years of German Parliamentary Debates
by: Kostikova, Aida, et al.
Published: (2022)
by: Kostikova, Aida, et al.
Published: (2022)
Enhancing Human Evaluation in Machine Translation with Comparative Judgment
by: Song, Yixiao, et al.
Published: (2025)
by: Song, Yixiao, et al.
Published: (2025)
Beyond Extraction: Contextualising Tabular Data for Efficient Summarisation by Language Models
by: Allu, Uday, et al.
Published: (2024)
by: Allu, Uday, et al.
Published: (2024)
FreeTxt-Vi: A Benchmarked Vietnamese-English Toolkit for Segmentation, Sentiment, and Summarisation
by: Huy, Hung Nguyen, et al.
Published: (2026)
by: Huy, Hung Nguyen, et al.
Published: (2026)
Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles
by: Touileb, Samia, et al.
Published: (2025)
by: Touileb, Samia, et al.
Published: (2025)
Bias in Opinion Summarisation from Pre-training to Adaptation: A Case Study in Political Bias
by: Huang, Nannan, et al.
Published: (2024)
by: Huang, Nannan, et al.
Published: (2024)
PETapter: Leveraging PET-style classification heads for modular few-shot parameter-efficient fine-tuning
by: Rieger, Jonas, et al.
Published: (2024)
by: Rieger, Jonas, et al.
Published: (2024)
Benchmarking LLM-based Relevance Judgment Methods
by: Arabzadeh, Negar, et al.
Published: (2025)
by: Arabzadeh, Negar, et al.
Published: (2025)
Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes
by: Frei, Johann, et al.
Published: (2025)
by: Frei, Johann, et al.
Published: (2025)
Mitigating Hallucinations in Zero-Shot Scientific Summarisation: A Pilot Study
by: Jaaouine, Imane, et al.
Published: (2025)
by: Jaaouine, Imane, et al.
Published: (2025)
Machine Learning Information Retrieval and Summarisation to Support Systematic Review on Outcomes Based Contracting
by: Bilal, Iman Munire, et al.
Published: (2024)
by: Bilal, Iman Munire, et al.
Published: (2024)
Proof of Time: A Benchmark for Evaluating Scientific Idea Judgments
by: Ye, Bingyang, et al.
Published: (2026)
by: Ye, Bingyang, et al.
Published: (2026)
Evaluating and Optimizing Educational Content with Large Language Model Judgments
by: He-Yueya, Joy, et al.
Published: (2024)
by: He-Yueya, Joy, et al.
Published: (2024)
Real Images, Worse Judgments: Evaluating Vision-Language Models on Concreteness and Imagery
by: Jiang, Yifan, et al.
Published: (2026)
by: Jiang, Yifan, et al.
Published: (2026)
Harmonising the Clinical Melody: Tuning Large Language Models for Hospital Course Summarisation in Clinical Coding
by: Bi, Bokang, et al.
Published: (2024)
by: Bi, Bokang, et al.
Published: (2024)
Persona Prompting as a Lens on LLM Social Reasoning
by: Yang, Jing, et al.
Published: (2026)
by: Yang, Jing, et al.
Published: (2026)
MaLei at MultiClinSUM: Summarisation of Clinical Documents using Perspective-Aware Iterative Self-Prompting with LLMs
by: Ren, Libo, et al.
Published: (2025)
by: Ren, Libo, et al.
Published: (2025)
Evaluating the Correctness of Inference Patterns Used by LLMs for Judgment
by: Chen, Lu, et al.
Published: (2024)
by: Chen, Lu, et al.
Published: (2024)
Improving LLM-as-a-Judge Inference with the Judgment Distribution
by: Wang, Victor, et al.
Published: (2025)
by: Wang, Victor, et al.
Published: (2025)
Similar Items
-
ATLAS: Improving Lay Summarisation with Attribute-based Control
by: Zhang, Zhihao, et al.
Published: (2024) -
Evaluating LLM-Driven Summarisation of Parliamentary Debates with Computational Argumentation
by: Cunningham, Eoghan, et al.
Published: (2026) -
A Critical Look at Meta-evaluating Summarisation Evaluation Metrics
by: Dai, Xiang, et al.
Published: (2024) -
MARS: Multilingual Aspect-centric Review Summarisation
by: Mukku, Sandeep Sricharan, et al.
Published: (2024) -
Leveraging Entailment Judgements in Cross-Lingual Summarisation
by: Zhang, Huajian, et al.
Published: (2024)