Saved in:
| Main Authors: | Dai, Xiang, Karimi, Sarvnaz, Fang, Biaoyan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.19507 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Identifying Health Risks from Family History: A Survey of Natural Language Processing Techniques
by: Dai, Xiang, et al.
Published: (2024)
by: Dai, Xiang, et al.
Published: (2024)
MultiADE: A Multi-domain Benchmark for Adverse Drug Event Extraction
by: Dai, Xiang, et al.
Published: (2024)
by: Dai, Xiang, et al.
Published: (2024)
CAIRNS: Balancing Readability and Scientific Accuracy in Climate Adaptation Question Answering
by: Kong, Liangji, et al.
Published: (2025)
by: Kong, Liangji, et al.
Published: (2025)
Can AI Extract Antecedent Factors of Human Trust in AI? An Application of Information Extraction for Scientific Literature in Behavioural and Computer Sciences
by: McGrath, Melanie, et al.
Published: (2024)
by: McGrath, Melanie, et al.
Published: (2024)
CSIRO-LT at SemEval-2025 Task 11: Adapting LLMs for Emotion Recognition for Multiple Languages
by: Chen, Jiyu, et al.
Published: (2025)
by: Chen, Jiyu, et al.
Published: (2025)
A Closer Look at Classification Evaluation Metrics and a Critical Reflection of Common Evaluation Practice
by: Opitz, Juri
Published: (2024)
by: Opitz, Juri
Published: (2024)
Summarisation of German Judgments in conjunction with a Class-based Evaluation
by: Steffes, Bianca, et al.
Published: (2025)
by: Steffes, Bianca, et al.
Published: (2025)
Evaluating LLM-Driven Summarisation of Parliamentary Debates with Computational Argumentation
by: Cunningham, Eoghan, et al.
Published: (2026)
by: Cunningham, Eoghan, et al.
Published: (2026)
Contextual Metric Meta-Evaluation by Measuring Local Metric Accuracy
by: Deviyani, Athiya, et al.
Published: (2025)
by: Deviyani, Athiya, et al.
Published: (2025)
Controlling Distributional Bias in Multi-Round LLM Generation via KL-Optimized Fine-Tuning
by: Jiang, Yanbei, et al.
Published: (2026)
by: Jiang, Yanbei, et al.
Published: (2026)
MARS: Multilingual Aspect-centric Review Summarisation
by: Mukku, Sandeep Sricharan, et al.
Published: (2024)
by: Mukku, Sandeep Sricharan, et al.
Published: (2024)
Leveraging Entailment Judgements in Cross-Lingual Summarisation
by: Zhang, Huajian, et al.
Published: (2024)
by: Zhang, Huajian, et al.
Published: (2024)
Log Summarisation for Defect Evolution Analysis
by: Dolga, Rares, et al.
Published: (2024)
by: Dolga, Rares, et al.
Published: (2024)
Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks
by: Zhang, Huajian, et al.
Published: (2024)
by: Zhang, Huajian, et al.
Published: (2024)
Simple and Effective Baselines for Code Summarisation Evaluation
by: Robinson, Jade, et al.
Published: (2025)
by: Robinson, Jade, et al.
Published: (2025)
When Bigger Isn't Better: A Comprehensive Fairness Evaluation of Political Bias in Multi-News Summarisation
by: Huang, Nannan, et al.
Published: (2026)
by: Huang, Nannan, et al.
Published: (2026)
ATLAS: Improving Lay Summarisation with Attribute-based Control
by: Zhang, Zhihao, et al.
Published: (2024)
by: Zhang, Zhihao, et al.
Published: (2024)
Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation
by: Li, Hao, et al.
Published: (2024)
by: Li, Hao, et al.
Published: (2024)
Textual Summarisation of Large Sets: Towards a General Approach
by: Kuptavanich, Kittipitch, et al.
Published: (2024)
by: Kuptavanich, Kittipitch, et al.
Published: (2024)
M2DS: Multilingual Dataset for Multi-document Summarisation
by: Hewapathirana, Kushan, et al.
Published: (2024)
by: Hewapathirana, Kushan, et al.
Published: (2024)
Faithful Summarisation under Disagreement via Belief-Level Aggregation
by: Aghaebe, Favour Yahdii, et al.
Published: (2026)
by: Aghaebe, Favour Yahdii, et al.
Published: (2026)
Enhancing Long Document Long Form Summarisation with Self-Planning
by: Du, Xiaotang, et al.
Published: (2025)
by: Du, Xiaotang, et al.
Published: (2025)
LongDocFACTScore: Evaluating the Factuality of Long Document Abstractive Summarisation
by: Bishop, Jennifer A, et al.
Published: (2023)
by: Bishop, Jennifer A, et al.
Published: (2023)
AraFinNews: Arabic Financial Summarisation with Domain-Adapted LLMs
by: El-Haj, Mo, et al.
Published: (2025)
by: El-Haj, Mo, et al.
Published: (2025)
REFER: Mitigating Bias in Opinion Summarisation via Frequency Framed Prompting
by: Huang, Nannan, et al.
Published: (2025)
by: Huang, Nannan, et al.
Published: (2025)
Faithfulness Metrics Don't Measure Faithfulness: A Meta-Evaluation with Ground Truth
by: Gur-Arieh, Yoav, et al.
Published: (2026)
by: Gur-Arieh, Yoav, et al.
Published: (2026)
FreeTxt-Vi: A Benchmarked Vietnamese-English Toolkit for Segmentation, Sentiment, and Summarisation
by: Huy, Hung Nguyen, et al.
Published: (2026)
by: Huy, Hung Nguyen, et al.
Published: (2026)
Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles
by: Touileb, Samia, et al.
Published: (2025)
by: Touileb, Samia, et al.
Published: (2025)
Machine Learning Information Retrieval and Summarisation to Support Systematic Review on Outcomes Based Contracting
by: Bilal, Iman Munire, et al.
Published: (2024)
by: Bilal, Iman Munire, et al.
Published: (2024)
Dynamic Meta-Metrics: Source-Sentence Conditioned Weighting for MT Evaluation
by: Zhang, Luke, et al.
Published: (2026)
by: Zhang, Luke, et al.
Published: (2026)
Mind the Style Gap: Meta-Evaluation of Style and Attribute Transfer Metrics
by: Pauli, Amalie Brogaard, et al.
Published: (2025)
by: Pauli, Amalie Brogaard, et al.
Published: (2025)
Leveraging Large Language Models for Zero-shot Lay Summarisation in Biomedicine and Beyond
by: Goldsack, Tomas, et al.
Published: (2025)
by: Goldsack, Tomas, et al.
Published: (2025)
Discrimination by LLMs: Cross-lingual Bias Assessment and Mitigation in Decision-Making and Summarisation
by: Huijzer, Willem, et al.
Published: (2025)
by: Huijzer, Willem, et al.
Published: (2025)
Less Is More? Examining Fairness in Pruned Large Language Models for Summarising Opinions
by: Huang, Nannan, et al.
Published: (2025)
by: Huang, Nannan, et al.
Published: (2025)
Do Automatic Factuality Metrics Measure Factuality? A Critical Evaluation
by: Ramprasad, Sanjana, et al.
Published: (2024)
by: Ramprasad, Sanjana, et al.
Published: (2024)
Bias in Opinion Summarisation from Pre-training to Adaptation: A Case Study in Political Bias
by: Huang, Nannan, et al.
Published: (2024)
by: Huang, Nannan, et al.
Published: (2024)
Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In!
by: Perrella, Stefano, et al.
Published: (2024)
by: Perrella, Stefano, et al.
Published: (2024)
Beyond Extraction: Contextualising Tabular Data for Efficient Summarisation by Language Models
by: Allu, Uday, et al.
Published: (2024)
by: Allu, Uday, et al.
Published: (2024)
LLM as a Meta-Judge: Synthetic Data for NLP Evaluation Metric Validation
by: Eigler, Lukáš, et al.
Published: (2026)
by: Eigler, Lukáš, et al.
Published: (2026)
KG-EDAS: A Meta-Metric Framework for Evaluating Knowledge Graph Completion Models
by: Gul, Haji, et al.
Published: (2025)
by: Gul, Haji, et al.
Published: (2025)
Similar Items
-
Identifying Health Risks from Family History: A Survey of Natural Language Processing Techniques
by: Dai, Xiang, et al.
Published: (2024) -
MultiADE: A Multi-domain Benchmark for Adverse Drug Event Extraction
by: Dai, Xiang, et al.
Published: (2024) -
CAIRNS: Balancing Readability and Scientific Accuracy in Climate Adaptation Question Answering
by: Kong, Liangji, et al.
Published: (2025) -
Can AI Extract Antecedent Factors of Human Trust in AI? An Application of Information Extraction for Scientific Literature in Behavioural and Computer Sciences
by: McGrath, Melanie, et al.
Published: (2024) -
CSIRO-LT at SemEval-2025 Task 11: Adapting LLMs for Emotion Recognition for Multiple Languages
by: Chen, Jiyu, et al.
Published: (2025)