:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Dai, Xiang, Karimi, Sarvnaz, Fang, Biaoyan
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2409.19507
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Identifying Health Risks from Family History: A Survey of Natural Language Processing Techniques
by: Dai, Xiang, et al.
Published: (2024)

MultiADE: A Multi-domain Benchmark for Adverse Drug Event Extraction
by: Dai, Xiang, et al.
Published: (2024)

CAIRNS: Balancing Readability and Scientific Accuracy in Climate Adaptation Question Answering
by: Kong, Liangji, et al.
Published: (2025)

Can AI Extract Antecedent Factors of Human Trust in AI? An Application of Information Extraction for Scientific Literature in Behavioural and Computer Sciences
by: McGrath, Melanie, et al.
Published: (2024)

CSIRO-LT at SemEval-2025 Task 11: Adapting LLMs for Emotion Recognition for Multiple Languages
by: Chen, Jiyu, et al.
Published: (2025)

A Closer Look at Classification Evaluation Metrics and a Critical Reflection of Common Evaluation Practice
by: Opitz, Juri
Published: (2024)

Summarisation of German Judgments in conjunction with a Class-based Evaluation
by: Steffes, Bianca, et al.
Published: (2025)

Evaluating LLM-Driven Summarisation of Parliamentary Debates with Computational Argumentation
by: Cunningham, Eoghan, et al.
Published: (2026)

Contextual Metric Meta-Evaluation by Measuring Local Metric Accuracy
by: Deviyani, Athiya, et al.
Published: (2025)

Controlling Distributional Bias in Multi-Round LLM Generation via KL-Optimized Fine-Tuning
by: Jiang, Yanbei, et al.
Published: (2026)

MARS: Multilingual Aspect-centric Review Summarisation
by: Mukku, Sandeep Sricharan, et al.
Published: (2024)

Leveraging Entailment Judgements in Cross-Lingual Summarisation
by: Zhang, Huajian, et al.
Published: (2024)

Log Summarisation for Defect Evolution Analysis
by: Dolga, Rares, et al.
Published: (2024)

Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks
by: Zhang, Huajian, et al.
Published: (2024)

Simple and Effective Baselines for Code Summarisation Evaluation
by: Robinson, Jade, et al.
Published: (2025)

When Bigger Isn't Better: A Comprehensive Fairness Evaluation of Political Bias in Multi-News Summarisation
by: Huang, Nannan, et al.
Published: (2026)

ATLAS: Improving Lay Summarisation with Attribute-based Control
by: Zhang, Zhihao, et al.
Published: (2024)

Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation
by: Li, Hao, et al.
Published: (2024)

Textual Summarisation of Large Sets: Towards a General Approach
by: Kuptavanich, Kittipitch, et al.
Published: (2024)

M2DS: Multilingual Dataset for Multi-document Summarisation
by: Hewapathirana, Kushan, et al.
Published: (2024)

Faithful Summarisation under Disagreement via Belief-Level Aggregation
by: Aghaebe, Favour Yahdii, et al.
Published: (2026)

Enhancing Long Document Long Form Summarisation with Self-Planning
by: Du, Xiaotang, et al.
Published: (2025)

LongDocFACTScore: Evaluating the Factuality of Long Document Abstractive Summarisation
by: Bishop, Jennifer A, et al.
Published: (2023)

AraFinNews: Arabic Financial Summarisation with Domain-Adapted LLMs
by: El-Haj, Mo, et al.
Published: (2025)

REFER: Mitigating Bias in Opinion Summarisation via Frequency Framed Prompting
by: Huang, Nannan, et al.
Published: (2025)

Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth
by: Gur-Arieh, Yoav, et al.
Published: (2026)

FreeTxt-Vi: A Benchmarked Vietnamese-English Toolkit for Segmentation, Sentiment, and Summarisation
by: Huy, Hung Nguyen, et al.
Published: (2026)

Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles
by: Touileb, Samia, et al.
Published: (2025)

Machine Learning Information Retrieval and Summarisation to Support Systematic Review on Outcomes Based Contracting
by: Bilal, Iman Munire, et al.
Published: (2024)

Dynamic Meta-Metrics: Source-Sentence Conditioned Weighting for MT Evaluation
by: Zhang, Luke, et al.
Published: (2026)

Mind the Style Gap: Meta-Evaluation of Style and Attribute Transfer Metrics
by: Pauli, Amalie Brogaard, et al.
Published: (2025)

Leveraging Large Language Models for Zero-shot Lay Summarisation in Biomedicine and Beyond
by: Goldsack, Tomas, et al.
Published: (2025)

Discrimination by LLMs: Cross-lingual Bias Assessment and Mitigation in Decision-Making and Summarisation
by: Huijzer, Willem, et al.
Published: (2025)

Less Is More? Examining Fairness in Pruned Large Language Models for Summarising Opinions
by: Huang, Nannan, et al.
Published: (2025)

Do Automatic Factuality Metrics Measure Factuality? A Critical Evaluation
by: Ramprasad, Sanjana, et al.
Published: (2024)

Bias in Opinion Summarisation from Pre-training to Adaptation: A Case Study in Political Bias
by: Huang, Nannan, et al.
Published: (2024)

Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In!
by: Perrella, Stefano, et al.
Published: (2024)

Beyond Extraction: Contextualising Tabular Data for Efficient Summarisation by Language Models
by: Allu, Uday, et al.
Published: (2024)

LLM as a Meta-Judge: Synthetic Data for NLP Evaluation Metric Validation
by: Eigler, Lukáš, et al.
Published: (2026)

KG-EDAS: A Meta-Metric Framework for Evaluating Knowledge Graph Completion Models
by: Gul, Haji, et al.
Published: (2025)