Saved in:
| Main Authors: | Zarharan, Majid, Wullschleger, Pascal, Kia, Babak Behkam, Pilehvar, Mohammad Taher, Foster, Jennifer |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.09454 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FoodTaxo: Generating Food Taxonomies with Large Language Models
by: Wullschleger, Pascal, et al.
Published: (2025)
by: Wullschleger, Pascal, et al.
Published: (2025)
FarExStance: Explainable Stance Detection for Farsi
by: Zarharan, Majid, et al.
Published: (2024)
by: Zarharan, Majid, et al.
Published: (2024)
Reference-Free Evaluation of Taxonomies
by: Wullschleger, Pascal, et al.
Published: (2025)
by: Wullschleger, Pascal, et al.
Published: (2025)
Gender Encoding Patterns in Pretrained Language Model Representations
by: Zakizadeh, Mahdi, et al.
Published: (2025)
by: Zakizadeh, Mahdi, et al.
Published: (2025)
Exploring State Tracking Capabilities of Large Language Models
by: Rezaee, Kiamehr, et al.
Published: (2025)
by: Rezaee, Kiamehr, et al.
Published: (2025)
Blind Men and the Elephant: Diverse Perspectives on Gender Stereotypes in Benchmark Datasets
by: Zakizadeh, Mahdi, et al.
Published: (2025)
by: Zakizadeh, Mahdi, et al.
Published: (2025)
RepMatch: Quantifying Cross-Instance Similarities in Representation Space
by: Modarres, Mohammad Reza, et al.
Published: (2024)
by: Modarres, Mohammad Reza, et al.
Published: (2024)
NormXLogit: The Head-on-Top Never Lies
by: Abbasi, Sina, et al.
Published: (2024)
by: Abbasi, Sina, et al.
Published: (2024)
Show Me the Work: Fact-Checkers' Requirements for Explainable Automated Fact-Checking
by: Warren, Greta, et al.
Published: (2025)
by: Warren, Greta, et al.
Published: (2025)
Graph of Attacks: Improved Black-Box and Interpretable Jailbreaks for LLMs
by: Akbar-Tajari, Mohammad, et al.
Published: (2025)
by: Akbar-Tajari, Mohammad, et al.
Published: (2025)
Understanding LLM Performance Degradation in Multi-Instance Processing: The Roles of Instance Count and Context Length
by: Chen, Jingxuan, et al.
Published: (2026)
by: Chen, Jingxuan, et al.
Published: (2026)
Synthia: Scalable Grounded Persona Generation from Social Media Data
by: Rahimzadeh, Vahid, et al.
Published: (2025)
by: Rahimzadeh, Vahid, et al.
Published: (2025)
TrendFact: A Benchmark for Explainable Hotspot Perception in Fact-Checking with Natural Language Explanation
by: Zhang, Xiaocheng, et al.
Published: (2024)
by: Zhang, Xiaocheng, et al.
Published: (2024)
Tell Me Why: Designing an Explainable LLM-based Dialogue System for Student Problem Behavior Diagnosis
by: Fan, Zhilin, et al.
Published: (2026)
by: Fan, Zhilin, et al.
Published: (2026)
Give Me More Details: Improving Fact-Checking with Latent Retrieval
by: Hu, Xuming, et al.
Published: (2023)
by: Hu, Xuming, et al.
Published: (2023)
ChartCheck: Explainable Fact-Checking over Real-World Chart Images
by: Akhtar, Mubashara, et al.
Published: (2023)
by: Akhtar, Mubashara, et al.
Published: (2023)
Generative Large Language Models in Automated Fact-Checking: A Survey
by: Vykopal, Ivan, et al.
Published: (2024)
by: Vykopal, Ivan, et al.
Published: (2024)
Automated Fact-Checking of Climate Change Claims with Large Language Models
by: Leippold, Markus, et al.
Published: (2024)
by: Leippold, Markus, et al.
Published: (2024)
Large Language Models for Multilingual Previously Fact-Checked Claim Detection
by: Vykopal, Ivan, et al.
Published: (2025)
by: Vykopal, Ivan, et al.
Published: (2025)
RU22Fact: Optimizing Evidence for Multilingual Explainable Fact-Checking on Russia-Ukraine Conflict
by: Zeng, Yirong, et al.
Published: (2024)
by: Zeng, Yirong, et al.
Published: (2024)
MedFact: Benchmarking the Fact-Checking Capabilities of Large Language Models on Chinese Medical Texts
by: He, Jiayi, et al.
Published: (2025)
by: He, Jiayi, et al.
Published: (2025)
PerCul: A Story-Driven Cultural Evaluation of LLMs in Persian
by: Monazzah, Erfan Moosavi, et al.
Published: (2025)
by: Monazzah, Erfan Moosavi, et al.
Published: (2025)
Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models
by: Li, Miaoran, et al.
Published: (2023)
by: Li, Miaoran, et al.
Published: (2023)
Towards Comprehensive Stage-wise Benchmarking of Large Language Models in Fact-Checking
by: Lin, Hongzhan, et al.
Published: (2026)
by: Lin, Hongzhan, et al.
Published: (2026)
Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation
by: To, Long Truong, et al.
Published: (2024)
by: To, Long Truong, et al.
Published: (2024)
Multimodal Large Language Models to Support Real-World Fact-Checking
by: Geng, Jiahui, et al.
Published: (2024)
by: Geng, Jiahui, et al.
Published: (2024)
Fact-Checking with Large Language Models via Probabilistic Certainty and Consistency
by: Wang, Haoran, et al.
Published: (2026)
by: Wang, Haoran, et al.
Published: (2026)
ClaimCheck: Real-Time Fact-Checking with Small Language Models
by: Putta, Akshith Reddy, et al.
Published: (2025)
by: Putta, Akshith Reddy, et al.
Published: (2025)
RealFactBench: A Benchmark for Evaluating Large Language Models in Real-World Fact-Checking
by: Yang, Shuo, et al.
Published: (2025)
by: Yang, Shuo, et al.
Published: (2025)
MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models
by: Wang, Shengkang, et al.
Published: (2024)
by: Wang, Shengkang, et al.
Published: (2024)
MORABLES: A Benchmark for Assessing Abstract Moral Reasoning in LLMs with Fables
by: Marcuzzo, Matteo, et al.
Published: (2025)
by: Marcuzzo, Matteo, et al.
Published: (2025)
Pun Unintended: LLMs and the Illusion of Humor Understanding
by: Zangari, Alessandro, et al.
Published: (2025)
by: Zangari, Alessandro, et al.
Published: (2025)
FactSim: Fact-Checking for Opinion Summarization
by: Anghinoni, Leandro, et al.
Published: (2026)
by: Anghinoni, Leandro, et al.
Published: (2026)
LEAF: Learning and Evaluation Augmented by Fact-Checking to Improve Factualness in Large Language Models
by: Tran, Hieu, et al.
Published: (2024)
by: Tran, Hieu, et al.
Published: (2024)
TrumorGPT: Graph-Based Retrieval-Augmented Large Language Model for Fact-Checking
by: Hang, Ching Nam, et al.
Published: (2025)
by: Hang, Ching Nam, et al.
Published: (2025)
REFLEX: Self-Refining Explainable Fact-Checking via Verdict-Anchored Style Control
by: Kong, Chuyi, et al.
Published: (2025)
by: Kong, Chuyi, et al.
Published: (2025)
Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate
by: Kim, Kyungha, et al.
Published: (2024)
by: Kim, Kyungha, et al.
Published: (2024)
(Fact) Check Your Bias
by: Bakke, Eivind Morris, et al.
Published: (2025)
by: Bakke, Eivind Morris, et al.
Published: (2025)
FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models
by: Lin, Hongzhan, et al.
Published: (2025)
by: Lin, Hongzhan, et al.
Published: (2025)
HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking
by: Vladika, Juraj, et al.
Published: (2023)
by: Vladika, Juraj, et al.
Published: (2023)
Similar Items
-
FoodTaxo: Generating Food Taxonomies with Large Language Models
by: Wullschleger, Pascal, et al.
Published: (2025) -
FarExStance: Explainable Stance Detection for Farsi
by: Zarharan, Majid, et al.
Published: (2024) -
Reference-Free Evaluation of Taxonomies
by: Wullschleger, Pascal, et al.
Published: (2025) -
Gender Encoding Patterns in Pretrained Language Model Representations
by: Zakizadeh, Mahdi, et al.
Published: (2025) -
Exploring State Tracking Capabilities of Large Language Models
by: Rezaee, Kiamehr, et al.
Published: (2025)