:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Yuxia, Reddy, Revanth Gangi, Mujahid, Zain Muhammad, Arora, Arnav, Rubashevskii, Aleksandr, Geng, Jiahui, Afzal, Osama Mohammed, Pan, Liangming, Borenstein, Nadav, Pillai, Aditya, Augenstein, Isabelle, Gurevych, Iryna, Nakov, Preslav
Format:	Preprint
Published:	2023
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2311.09000
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Investigating Human Values in Online Communities
by: Borenstein, Nadav, et al.
Published: (2024)

Beyond "Not Novel Enough": Enriching Scholarly Critique with LLM-Assisted Feedback
by: Afzal, Osama Mohammed, et al.
Published: (2025)

Multimodal Large Language Models to Support Real-World Fact-Checking
by: Geng, Jiahui, et al.
Published: (2024)

Can Community Notes Replace Professional Fact-Checkers?
by: Borenstein, Nadav, et al.
Published: (2025)

OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs
by: Iqbal, Hasan, et al.
Published: (2024)

Revealing Fine-Grained Values and Opinions in Large Language Models
by: Wright, Dustin, et al.
Published: (2024)

FIRE: Fact-checking with Iterative Retrieval and Verification
by: Xie, Zhuohan, et al.
Published: (2024)

A Survey of Confidence Estimation and Calibration in Large Language Models
by: Geng, Jiahui, et al.
Published: (2023)

BiasGym: A Simple and Generalizable Framework for Analyzing and Removing Biases through Elicitation
by: Islam, Sekh Mainul, et al.
Published: (2025)

Adaptive Conformal Prediction for Improving Factuality of Generations by Large Language Models
by: Rubashevskii, Aleksandr, et al.
Published: (2026)

Profiling News Media for Factuality and Bias Using LLMs and the Fact-Checking Methodology of Human Experts
by: Mujahid, Zain Muhammad, et al.
Published: (2025)

Con Instruction: Universal Jailbreaking of Multimodal Large Language Models via Non-Textual Modalities
by: Geng, Jiahui, et al.
Published: (2025)

Co-FactChecker: A Framework for Human-AI Collaborative Claim Verification Using Large Reasoning Models
by: Sahnan, Dhruv, et al.
Published: (2026)

AICD Bench: A Challenging Benchmark for AI-Generated Code Detection
by: Orel, Daniil, et al.
Published: (2026)

Stress Testing Factual Consistency Metrics for Long-Document Summarization
by: Mujahid, Zain Muhammad, et al.
Published: (2025)

OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs
by: Wang, Yuxia, et al.
Published: (2024)

ConspirED: A Dataset for Cognitive Traits of Conspiracy Theories and Large Language Model Safety
by: Bates, Luke, et al.
Published: (2025)

Missci: Reconstructing Fallacies in Misrepresented Science
by: Glockner, Max, et al.
Published: (2024)

$\texttt{Droid}$: A Resource Suite for AI-Generated Code Detection
by: Orel, Daniil, et al.
Published: (2025)

Grounding Fallacies Misrepresenting Scientific Publications in Evidence
by: Glockner, Max, et al.
Published: (2024)

Can Transformers Learn $n$-gram Language Models?
by: Svete, Anej, et al.
Published: (2024)

A Template Is All You Meme
by: Bates, Luke, et al.
Published: (2023)

Probing Pre-Trained Language Models for Cross-Cultural Differences in Values
by: Arora, Arnav, et al.
Published: (2022)

Why Should This Article Be Deleted? Transparent Stance Detection in Multilingual Wikipedia Editor Discussions
by: Kaffee, Lucie-Aimée, et al.
Published: (2023)

Faithfulness-Aware Uncertainty Quantification for Fact-Checking the Output of Retrieval Augmented Generation
by: Fadeeva, Ekaterina, et al.
Published: (2025)

M4FC: a Multimodal, Multilingual, Multicultural, Multitask Real-World Fact-Checking Dataset
by: Geng, Jiahui, et al.
Published: (2025)

Uncertainty Quantification for LLMs through Minimum Bayes Risk: Bridging Confidence and Consistency
by: Vashurin, Roman, et al.
Published: (2025)

Revisiting Noise in Natural Language Processing for Computational Social Science
by: Borenstein, Nadav
Published: (2025)

Community Moderation and the New Epistemology of Fact Checking on Social Media
by: Augenstein, Isabelle, et al.
Published: (2025)

Unstructured Evidence Attribution for Long Context Query Focused Summarization
by: Wright, Dustin, et al.
Published: (2025)

Can LLMs Automate Fact-Checking Article Writing?
by: Sahnan, Dhruv, et al.
Published: (2025)

Rethinking STS and NLI in Large Language Models
by: Wang, Yuxia, et al.
Published: (2023)

Presumed Cultural Identity: How Names Shape LLM Responses
by: Pawar, Siddhesh, et al.
Published: (2025)

Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
by: Fadeeva, Ekaterina, et al.
Published: (2024)

Don't Throw Away Your Beams: Improving Consistency-based Uncertainties in LLMs via Beam Search
by: Fadeeva, Ekaterina, et al.
Published: (2025)

UnsafeChain: Enhancing Reasoning Model Safety via Hard Cases
by: Tomar, Raj Vardhan, et al.
Published: (2025)

How Does Prefix Matter in Reasoning Model Tuning?
by: Tomar, Raj Vardhan, et al.
Published: (2026)

M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection
by: Wang, Yuxia, et al.
Published: (2024)

From Chaos to Clarity: Claim Normalization to Empower Fact-Checking
by: Sundriyal, Megha, et al.
Published: (2023)

Multi-Modal Framing Analysis of News
by: Arora, Arnav, et al.
Published: (2025)