Saved in:
| Main Authors: | Ye, Yuxuan, Simpson, Edwin, Rodriguez, Raul Santos |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.15090 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Teaching Language Models to Check Grounded Claim Factuality with Human Test-Taking Strategies
by: Ye, Yuxuan, et al.
Published: (2026)
by: Ye, Yuxuan, et al.
Published: (2026)
mFACE: Multilingual Summarization with Factual Consistency Evaluation
by: Aharoni, Roee, et al.
Published: (2022)
by: Aharoni, Roee, et al.
Published: (2022)
Zero-shot Factual Consistency Evaluation Across Domains
by: Agarwal, Raunak
Published: (2024)
by: Agarwal, Raunak
Published: (2024)
Grounded Visual Factualization: Factual Anchor-Based Finetuning for Enhancing MLLM Factual Consistency
by: Morbiato, Filippo, et al.
Published: (2025)
by: Morbiato, Filippo, et al.
Published: (2025)
Less is More for Improving Automatic Evaluation of Factual Consistency
by: Wang, Tong, et al.
Published: (2024)
by: Wang, Tong, et al.
Published: (2024)
An Extensive Evaluation of Factual Consistency in Large Language Models for Data-to-Text Generation
by: Mahapatra, Joy, et al.
Published: (2024)
by: Mahapatra, Joy, et al.
Published: (2024)
BanglaSummEval: Reference-Free Factual Consistency Evaluation for Bangla Summarization
by: Rafid, Ahmed, et al.
Published: (2026)
by: Rafid, Ahmed, et al.
Published: (2026)
FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out Document
by: Yang, Joonho, et al.
Published: (2024)
by: Yang, Joonho, et al.
Published: (2024)
SIFiD: Reassess Summary Factual Inconsistency Detection with LLM
by: Yang, Jiuding, et al.
Published: (2024)
by: Yang, Jiuding, et al.
Published: (2024)
MedFactEval and MedAgentBrief: A Framework and Workflow for Generating and Evaluating Factual Clinical Summaries
by: Grolleau, François, et al.
Published: (2025)
by: Grolleau, François, et al.
Published: (2025)
Enhancing Factuality through Consensus and Consistency in Summarization Using Minimum Bayes Risk Decoding
by: Soetedjo, Riza Setiawan, et al.
Published: (2026)
by: Soetedjo, Riza Setiawan, et al.
Published: (2026)
Face4RAG: Factual Consistency Evaluation for Retrieval Augmented Generation in Chinese
by: Xu, Yunqi, et al.
Published: (2024)
by: Xu, Yunqi, et al.
Published: (2024)
Self-Consistent Decoding for More Factual Open Responses
by: Malon, Christopher, et al.
Published: (2024)
by: Malon, Christopher, et al.
Published: (2024)
Factual Consistency of Multilingual Pretrained Language Models
by: Fierro, Constanza, et al.
Published: (2022)
by: Fierro, Constanza, et al.
Published: (2022)
Adapting AlignScore Mertic for Factual Consistency Evaluation of Text in Russian: A Student Abstract
by: Zimin, Mikhail, et al.
Published: (2025)
by: Zimin, Mikhail, et al.
Published: (2025)
PlainQAFact: Retrieval-augmented Factual Consistency Evaluation Metric for Biomedical Plain Language Summarization
by: You, Zhiwen, et al.
Published: (2025)
by: You, Zhiwen, et al.
Published: (2025)
Automated essay scoring in Arabic: a dataset and analysis of a BERT-based system
by: Ghazawi, Rayed, et al.
Published: (2024)
by: Ghazawi, Rayed, et al.
Published: (2024)
How well can LLMs Grade Essays in Arabic?
by: Ghazawi, Rayed, et al.
Published: (2025)
by: Ghazawi, Rayed, et al.
Published: (2025)
Learning to Generate Answers with Citations via Factual Consistency Models
by: Aly, Rami, et al.
Published: (2024)
by: Aly, Rami, et al.
Published: (2024)
Exploring the Factual Consistency in Dialogue Comprehension of Large Language Models
by: She, Shuaijie, et al.
Published: (2023)
by: She, Shuaijie, et al.
Published: (2023)
Identifying Factual Inconsistencies in Summaries: Grounding LLM Inference via Task Taxonomy
by: Xu, Liyan, et al.
Published: (2024)
by: Xu, Liyan, et al.
Published: (2024)
Temporally Consistent Factuality Probing for Large Language Models
by: Bajpai, Ashutosh, et al.
Published: (2024)
by: Bajpai, Ashutosh, et al.
Published: (2024)
ReFEree: Reference-Free and Fine-Grained Method for Evaluating Factual Consistency in Real-World Code Summarization
by: Bae, Suyoung, et al.
Published: (2026)
by: Bae, Suyoung, et al.
Published: (2026)
Do Automatic Factuality Metrics Measure Factuality? A Critical Evaluation
by: Ramprasad, Sanjana, et al.
Published: (2024)
by: Ramprasad, Sanjana, et al.
Published: (2024)
FactEHR: A Dataset for Evaluating Factuality in Clinical Notes Using LLMs
by: Munnangi, Monica, et al.
Published: (2024)
by: Munnangi, Monica, et al.
Published: (2024)
Improving Factual Consistency of News Summarization by Contrastive Preference Optimization
by: Feng, Huawen, et al.
Published: (2023)
by: Feng, Huawen, et al.
Published: (2023)
SummExecEdit: A Factual Consistency Benchmark in Summarization with Executable Edits
by: Thorat, Onkar, et al.
Published: (2024)
by: Thorat, Onkar, et al.
Published: (2024)
UniArk: Improving Generalisation and Consistency for Factual Knowledge Extraction through Debiasing
by: Yang, Yijun, et al.
Published: (2024)
by: Yang, Yijun, et al.
Published: (2024)
Beyond Factual Accuracy: Evaluating Coverage of Diverse Factual Information in Long-form Text Generation
by: Samarinas, Chris, et al.
Published: (2025)
by: Samarinas, Chris, et al.
Published: (2025)
Towards Effective Extraction and Evaluation of Factual Claims
by: Metropolitansky, Dasha, et al.
Published: (2025)
by: Metropolitansky, Dasha, et al.
Published: (2025)
Semantic Consistency-Based Uncertainty Quantification for Factuality in Radiology Report Generation
by: Wang, Chenyu, et al.
Published: (2024)
by: Wang, Chenyu, et al.
Published: (2024)
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models
by: Qi, Jirui, et al.
Published: (2023)
by: Qi, Jirui, et al.
Published: (2023)
Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation
by: Zhang, Xiaoying, et al.
Published: (2024)
by: Zhang, Xiaoying, et al.
Published: (2024)
Stress Testing Factual Consistency Metrics for Long-Document Summarization
by: Mujahid, Zain Muhammad, et al.
Published: (2025)
by: Mujahid, Zain Muhammad, et al.
Published: (2025)
ConsistencyAI: A Benchmark to Assess LLMs' Factual Consistency When Responding to Different Demographic Groups
by: Banyas, Peter, et al.
Published: (2025)
by: Banyas, Peter, et al.
Published: (2025)
ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall
by: Yang, Jiayu, et al.
Published: (2025)
by: Yang, Jiayu, et al.
Published: (2025)
Fine-grained and Explainable Factuality Evaluation for Multimodal Summarization
by: Zhang, Yue, et al.
Published: (2024)
by: Zhang, Yue, et al.
Published: (2024)
DecMetrics: Structured Claim Decomposition Scoring for Factually Consistent LLM Outputs
by: Huang, Minghui
Published: (2025)
by: Huang, Minghui
Published: (2025)
AlignCheck: a Semantic Open-Domain Metric for Factual Consistency Assessment
by: Aghaebrahimian, Ahmad
Published: (2025)
by: Aghaebrahimian, Ahmad
Published: (2025)
Transferable and Efficient Non-Factual Content Detection via Probe Training with Offline Consistency Checking
by: Zhang, Xiaokang, et al.
Published: (2024)
by: Zhang, Xiaokang, et al.
Published: (2024)
Similar Items
-
Teaching Language Models to Check Grounded Claim Factuality with Human Test-Taking Strategies
by: Ye, Yuxuan, et al.
Published: (2026) -
mFACE: Multilingual Summarization with Factual Consistency Evaluation
by: Aharoni, Roee, et al.
Published: (2022) -
Zero-shot Factual Consistency Evaluation Across Domains
by: Agarwal, Raunak
Published: (2024) -
Grounded Visual Factualization: Factual Anchor-Based Finetuning for Enhancing MLLM Factual Consistency
by: Morbiato, Filippo, et al.
Published: (2025) -
Less is More for Improving Automatic Evaluation of Factual Consistency
by: Wang, Tong, et al.
Published: (2024)