:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zarharan, Majid, Wullschleger, Pascal, Kia, Babak Behkam, Pilehvar, Mohammad Taher, Foster, Jennifer
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2405.09454
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

FoodTaxo: Generating Food Taxonomies with Large Language Models
by: Wullschleger, Pascal, et al.
Published: (2025)

FarExStance: Explainable Stance Detection for Farsi
by: Zarharan, Majid, et al.
Published: (2024)

Reference-Free Evaluation of Taxonomies
by: Wullschleger, Pascal, et al.
Published: (2025)

Gender Encoding Patterns in Pretrained Language Model Representations
by: Zakizadeh, Mahdi, et al.
Published: (2025)

Exploring State Tracking Capabilities of Large Language Models
by: Rezaee, Kiamehr, et al.
Published: (2025)

Blind Men and the Elephant: Diverse Perspectives on Gender Stereotypes in Benchmark Datasets
by: Zakizadeh, Mahdi, et al.
Published: (2025)

RepMatch: Quantifying Cross-Instance Similarities in Representation Space
by: Modarres, Mohammad Reza, et al.
Published: (2024)

NormXLogit: The Head-on-Top Never Lies
by: Abbasi, Sina, et al.
Published: (2024)

Show Me the Work: Fact-Checkers' Requirements for Explainable Automated Fact-Checking
by: Warren, Greta, et al.
Published: (2025)

Graph of Attacks: Improved Black-Box and Interpretable Jailbreaks for LLMs
by: Akbar-Tajari, Mohammad, et al.
Published: (2025)

Understanding LLM Performance Degradation in Multi-Instance Processing: The Roles of Instance Count and Context Length
by: Chen, Jingxuan, et al.
Published: (2026)

Synthia: Scalable Grounded Persona Generation from Social Media Data
by: Rahimzadeh, Vahid, et al.
Published: (2025)

TrendFact: A Benchmark for Explainable Hotspot Perception in Fact-Checking with Natural Language Explanation
by: Zhang, Xiaocheng, et al.
Published: (2024)

Tell Me Why: Designing an Explainable LLM-based Dialogue System for Student Problem Behavior Diagnosis
by: Fan, Zhilin, et al.
Published: (2026)

Give Me More Details: Improving Fact-Checking with Latent Retrieval
by: Hu, Xuming, et al.
Published: (2023)

ChartCheck: Explainable Fact-Checking over Real-World Chart Images
by: Akhtar, Mubashara, et al.
Published: (2023)

Generative Large Language Models in Automated Fact-Checking: A Survey
by: Vykopal, Ivan, et al.
Published: (2024)

Automated Fact-Checking of Climate Change Claims with Large Language Models
by: Leippold, Markus, et al.
Published: (2024)

Large Language Models for Multilingual Previously Fact-Checked Claim Detection
by: Vykopal, Ivan, et al.
Published: (2025)

RU22Fact: Optimizing Evidence for Multilingual Explainable Fact-Checking on Russia-Ukraine Conflict
by: Zeng, Yirong, et al.
Published: (2024)

MedFact: Benchmarking the Fact-Checking Capabilities of Large Language Models on Chinese Medical Texts
by: He, Jiayi, et al.
Published: (2025)

PerCul: A Story-Driven Cultural Evaluation of LLMs in Persian
by: Monazzah, Erfan Moosavi, et al.
Published: (2025)

Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models
by: Li, Miaoran, et al.
Published: (2023)

Towards Comprehensive Stage-wise Benchmarking of Large Language Models in Fact-Checking
by: Lin, Hongzhan, et al.
Published: (2026)

Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation
by: To, Long Truong, et al.
Published: (2024)

Multimodal Large Language Models to Support Real-World Fact-Checking
by: Geng, Jiahui, et al.
Published: (2024)

Fact-Checking with Large Language Models via Probabilistic Certainty and Consistency
by: Wang, Haoran, et al.
Published: (2026)

ClaimCheck: Real-Time Fact-Checking with Small Language Models
by: Putta, Akshith Reddy, et al.
Published: (2025)

RealFactBench: A Benchmark for Evaluating Large Language Models in Real-World Fact-Checking
by: Yang, Shuo, et al.
Published: (2025)

MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models
by: Wang, Shengkang, et al.
Published: (2024)

MORABLES: A Benchmark for Assessing Abstract Moral Reasoning in LLMs with Fables
by: Marcuzzo, Matteo, et al.
Published: (2025)

Pun Unintended: LLMs and the Illusion of Humor Understanding
by: Zangari, Alessandro, et al.
Published: (2025)

FactSim: Fact-Checking for Opinion Summarization
by: Anghinoni, Leandro, et al.
Published: (2026)

LEAF: Learning and Evaluation Augmented by Fact-Checking to Improve Factualness in Large Language Models
by: Tran, Hieu, et al.
Published: (2024)

TrumorGPT: Graph-Based Retrieval-Augmented Large Language Model for Fact-Checking
by: Hang, Ching Nam, et al.
Published: (2025)

REFLEX: Self-Refining Explainable Fact-Checking via Verdict-Anchored Style Control
by: Kong, Chuyi, et al.
Published: (2025)

Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate
by: Kim, Kyungha, et al.
Published: (2024)

(Fact) Check Your Bias
by: Bakke, Eivind Morris, et al.
Published: (2025)

FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models
by: Lin, Hongzhan, et al.
Published: (2025)

HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking
by: Vladika, Juraj, et al.
Published: (2023)