:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Zhang, Huajian, Xu, Yumo, Perez-Beltrachini, Laura
Format:	Preprint
Veröffentlicht:	2024
Schlagworte:	Computation and Language
Online-Zugang:	https://arxiv.org/abs/2402.17630
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Leveraging Entailment Judgements in Cross-Lingual Summarisation
von: Zhang, Huajian, et al.
Veröffentlicht: (2024)

Enhancing Long Document Long Form Summarisation with Self-Planning
von: Du, Xiaotang, et al.
Veröffentlicht: (2025)

Uncertainty Quantification in Retrieval Augmented Question Answering
von: Perez-Beltrachini, Laura, et al.
Veröffentlicht: (2025)

Dancing in Chains: Reconciling Instruction Following and Faithfulness in Language Models
von: Wu, Zhengxuan, et al.
Veröffentlicht: (2024)

Faithful Summarisation under Disagreement via Belief-Level Aggregation
von: Aghaebe, Favour Yahdii, et al.
Veröffentlicht: (2026)

Reliable Fine-Grained Evaluation of Natural Language Math Proofs
von: Ma, Wenjie, et al.
Veröffentlicht: (2025)

Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics
von: Zhang, Weijia, et al.
Veröffentlicht: (2024)

Atomic-SNLI: Fine-Grained Natural Language Inference through Atomic Fact Decomposition
von: Huang, Minghui
Veröffentlicht: (2026)

SuperCLUE-Fin: Graded Fine-Grained Analysis of Chinese LLMs on Diverse Financial Tasks and Applications
von: Xu, Liang, et al.
Veröffentlicht: (2024)

MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs
von: Liu, Gabrielle Kaili-May, et al.
Veröffentlicht: (2025)

Learning to Refine with Fine-Grained Natural Language Feedback
von: Wadhwa, Manya, et al.
Veröffentlicht: (2024)

A Critical Look at Meta-evaluating Summarisation Evaluation Metrics
von: Dai, Xiang, et al.
Veröffentlicht: (2024)

Summarisation of German Judgments in conjunction with a Class-based Evaluation
von: Steffes, Bianca, et al.
Veröffentlicht: (2025)

Evaluating LLM-Driven Summarisation of Parliamentary Debates with Computational Argumentation
von: Cunningham, Eoghan, et al.
Veröffentlicht: (2026)

When Scale Meets Diversity: Evaluating Language Models on Fine-Grained Multilingual Claim Verification
von: Shcharbakova, Hanna, et al.
Veröffentlicht: (2025)

Less Is More? Examining Fairness in Pruned Large Language Models for Summarising Opinions
von: Huang, Nannan, et al.
Veröffentlicht: (2025)

Self-Critique and Refinement for Faithful Natural Language Explanations
von: Wang, Yingming, et al.
Veröffentlicht: (2025)

ATLAS: Improving Lay Summarisation with Attribute-based Control
von: Zhang, Zhihao, et al.
Veröffentlicht: (2024)

Harmonising the Clinical Melody: Tuning Large Language Models for Hospital Course Summarisation in Clinical Coding
von: Bi, Bokang, et al.
Veröffentlicht: (2024)

Generate, Discriminate, Evolve: Enhancing Context Faithfulness via Fine-Grained Sentence-Level Self-Evolution
von: Li, Kun, et al.
Veröffentlicht: (2025)

MSciNLI: A Diverse Benchmark for Scientific Natural Language Inference
von: Sadat, Mobashir, et al.
Veröffentlicht: (2024)

MARS: Multilingual Aspect-centric Review Summarisation
von: Mukku, Sandeep Sricharan, et al.
Veröffentlicht: (2024)

Leveraging Large Language Models for Zero-shot Lay Summarisation in Biomedicine and Beyond
von: Goldsack, Tomas, et al.
Veröffentlicht: (2025)

Log Summarisation for Defect Evolution Analysis
von: Dolga, Rares, et al.
Veröffentlicht: (2024)

Simple and Effective Baselines for Code Summarisation Evaluation
von: Robinson, Jade, et al.
Veröffentlicht: (2025)

Beyond Extraction: Contextualising Tabular Data for Efficient Summarisation by Language Models
von: Allu, Uday, et al.
Veröffentlicht: (2024)

Faithful Model Evaluation for Model-Based Metrics
von: Goyal, Palash, et al.
Veröffentlicht: (2023)

The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models
von: Hong, Giwon, et al.
Veröffentlicht: (2024)

REFER: Mitigating Bias in Opinion Summarisation via Frequency Framed Prompting
von: Huang, Nannan, et al.
Veröffentlicht: (2025)

KPEval: Towards Fine-Grained Semantic-Based Keyphrase Evaluation
von: Wu, Di, et al.
Veröffentlicht: (2023)

Beyond Relevant Documents: A Knowledge-Intensive Approach for Query-Focused Summarization using Large Language Models
von: Zhang, Weijia, et al.
Veröffentlicht: (2024)

Hierarchical Divide-and-Conquer for Fine-Grained Alignment in LLM-Based Medical Evaluation
von: Zheng, Shunfan, et al.
Veröffentlicht: (2025)

Taiyi: A Bilingual Fine-Tuned Large Language Model for Diverse Biomedical Tasks
von: Luo, Ling, et al.
Veröffentlicht: (2023)

FineSteer: A Unified Framework for Fine-Grained Inference-Time Steering in Large Language Models
von: Weng, Zixuan, et al.
Veröffentlicht: (2026)

New Faithfulness-Centric Interpretability Paradigms for Natural Language Processing
von: Madsen, Andreas
Veröffentlicht: (2024)

Fine-Grained Evaluation of Large Vision-Language Models in Autonomous Driving
von: Li, Yue, et al.
Veröffentlicht: (2025)

LNE-Blocking: An Efficient Framework for Contamination Mitigation Evaluation on Large Language Models
von: Hou, Ruijie, et al.
Veröffentlicht: (2025)

C-FAITH: A Chinese Fine-Grained Benchmark for Automated Hallucination Evaluation
von: Zhang, Xu, et al.
Veröffentlicht: (2025)

ReFF: Reinforcing Format Faithfulness in Language Models across Varied Tasks
von: Yao, Jiashu, et al.
Veröffentlicht: (2024)

Diverse and Fine-Grained Instruction-Following Ability Exploration with Synthetic Data
von: Gu, Zihui, et al.
Veröffentlicht: (2024)