Saved in:
| Main Authors: | Wang, Yuxia, Reddy, Revanth Gangi, Mujahid, Zain Muhammad, Arora, Arnav, Rubashevskii, Aleksandr, Geng, Jiahui, Afzal, Osama Mohammed, Pan, Liangming, Borenstein, Nadav, Pillai, Aditya, Augenstein, Isabelle, Gurevych, Iryna, Nakov, Preslav |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2311.09000 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Investigating Human Values in Online Communities
by: Borenstein, Nadav, et al.
Published: (2024)
by: Borenstein, Nadav, et al.
Published: (2024)
Beyond "Not Novel Enough": Enriching Scholarly Critique with LLM-Assisted Feedback
by: Afzal, Osama Mohammed, et al.
Published: (2025)
by: Afzal, Osama Mohammed, et al.
Published: (2025)
Multimodal Large Language Models to Support Real-World Fact-Checking
by: Geng, Jiahui, et al.
Published: (2024)
by: Geng, Jiahui, et al.
Published: (2024)
Can Community Notes Replace Professional Fact-Checkers?
by: Borenstein, Nadav, et al.
Published: (2025)
by: Borenstein, Nadav, et al.
Published: (2025)
OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs
by: Iqbal, Hasan, et al.
Published: (2024)
by: Iqbal, Hasan, et al.
Published: (2024)
Revealing Fine-Grained Values and Opinions in Large Language Models
by: Wright, Dustin, et al.
Published: (2024)
by: Wright, Dustin, et al.
Published: (2024)
FIRE: Fact-checking with Iterative Retrieval and Verification
by: Xie, Zhuohan, et al.
Published: (2024)
by: Xie, Zhuohan, et al.
Published: (2024)
A Survey of Confidence Estimation and Calibration in Large Language Models
by: Geng, Jiahui, et al.
Published: (2023)
by: Geng, Jiahui, et al.
Published: (2023)
BiasGym: A Simple and Generalizable Framework for Analyzing and Removing Biases through Elicitation
by: Islam, Sekh Mainul, et al.
Published: (2025)
by: Islam, Sekh Mainul, et al.
Published: (2025)
Adaptive Conformal Prediction for Improving Factuality of Generations by Large Language Models
by: Rubashevskii, Aleksandr, et al.
Published: (2026)
by: Rubashevskii, Aleksandr, et al.
Published: (2026)
Profiling News Media for Factuality and Bias Using LLMs and the Fact-Checking Methodology of Human Experts
by: Mujahid, Zain Muhammad, et al.
Published: (2025)
by: Mujahid, Zain Muhammad, et al.
Published: (2025)
Con Instruction: Universal Jailbreaking of Multimodal Large Language Models via Non-Textual Modalities
by: Geng, Jiahui, et al.
Published: (2025)
by: Geng, Jiahui, et al.
Published: (2025)
Co-FactChecker: A Framework for Human-AI Collaborative Claim Verification Using Large Reasoning Models
by: Sahnan, Dhruv, et al.
Published: (2026)
by: Sahnan, Dhruv, et al.
Published: (2026)
AICD Bench: A Challenging Benchmark for AI-Generated Code Detection
by: Orel, Daniil, et al.
Published: (2026)
by: Orel, Daniil, et al.
Published: (2026)
Stress Testing Factual Consistency Metrics for Long-Document Summarization
by: Mujahid, Zain Muhammad, et al.
Published: (2025)
by: Mujahid, Zain Muhammad, et al.
Published: (2025)
OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs
by: Wang, Yuxia, et al.
Published: (2024)
by: Wang, Yuxia, et al.
Published: (2024)
ConspirED: A Dataset for Cognitive Traits of Conspiracy Theories and Large Language Model Safety
by: Bates, Luke, et al.
Published: (2025)
by: Bates, Luke, et al.
Published: (2025)
Missci: Reconstructing Fallacies in Misrepresented Science
by: Glockner, Max, et al.
Published: (2024)
by: Glockner, Max, et al.
Published: (2024)
$\texttt{Droid}$: A Resource Suite for AI-Generated Code Detection
by: Orel, Daniil, et al.
Published: (2025)
by: Orel, Daniil, et al.
Published: (2025)
Grounding Fallacies Misrepresenting Scientific Publications in Evidence
by: Glockner, Max, et al.
Published: (2024)
by: Glockner, Max, et al.
Published: (2024)
Can Transformers Learn $n$-gram Language Models?
by: Svete, Anej, et al.
Published: (2024)
by: Svete, Anej, et al.
Published: (2024)
A Template Is All You Meme
by: Bates, Luke, et al.
Published: (2023)
by: Bates, Luke, et al.
Published: (2023)
Probing Pre-Trained Language Models for Cross-Cultural Differences in Values
by: Arora, Arnav, et al.
Published: (2022)
by: Arora, Arnav, et al.
Published: (2022)
Why Should This Article Be Deleted? Transparent Stance Detection in Multilingual Wikipedia Editor Discussions
by: Kaffee, Lucie-Aimée, et al.
Published: (2023)
by: Kaffee, Lucie-Aimée, et al.
Published: (2023)
Faithfulness-Aware Uncertainty Quantification for Fact-Checking the Output of Retrieval Augmented Generation
by: Fadeeva, Ekaterina, et al.
Published: (2025)
by: Fadeeva, Ekaterina, et al.
Published: (2025)
M4FC: a Multimodal, Multilingual, Multicultural, Multitask Real-World Fact-Checking Dataset
by: Geng, Jiahui, et al.
Published: (2025)
by: Geng, Jiahui, et al.
Published: (2025)
Uncertainty Quantification for LLMs through Minimum Bayes Risk: Bridging Confidence and Consistency
by: Vashurin, Roman, et al.
Published: (2025)
by: Vashurin, Roman, et al.
Published: (2025)
Revisiting Noise in Natural Language Processing for Computational Social Science
by: Borenstein, Nadav
Published: (2025)
by: Borenstein, Nadav
Published: (2025)
Community Moderation and the New Epistemology of Fact Checking on Social Media
by: Augenstein, Isabelle, et al.
Published: (2025)
by: Augenstein, Isabelle, et al.
Published: (2025)
Unstructured Evidence Attribution for Long Context Query Focused Summarization
by: Wright, Dustin, et al.
Published: (2025)
by: Wright, Dustin, et al.
Published: (2025)
Can LLMs Automate Fact-Checking Article Writing?
by: Sahnan, Dhruv, et al.
Published: (2025)
by: Sahnan, Dhruv, et al.
Published: (2025)
Rethinking STS and NLI in Large Language Models
by: Wang, Yuxia, et al.
Published: (2023)
by: Wang, Yuxia, et al.
Published: (2023)
Presumed Cultural Identity: How Names Shape LLM Responses
by: Pawar, Siddhesh, et al.
Published: (2025)
by: Pawar, Siddhesh, et al.
Published: (2025)
Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
by: Fadeeva, Ekaterina, et al.
Published: (2024)
by: Fadeeva, Ekaterina, et al.
Published: (2024)
Don't Throw Away Your Beams: Improving Consistency-based Uncertainties in LLMs via Beam Search
by: Fadeeva, Ekaterina, et al.
Published: (2025)
by: Fadeeva, Ekaterina, et al.
Published: (2025)
UnsafeChain: Enhancing Reasoning Model Safety via Hard Cases
by: Tomar, Raj Vardhan, et al.
Published: (2025)
by: Tomar, Raj Vardhan, et al.
Published: (2025)
How Does Prefix Matter in Reasoning Model Tuning?
by: Tomar, Raj Vardhan, et al.
Published: (2026)
by: Tomar, Raj Vardhan, et al.
Published: (2026)
M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection
by: Wang, Yuxia, et al.
Published: (2024)
by: Wang, Yuxia, et al.
Published: (2024)
From Chaos to Clarity: Claim Normalization to Empower Fact-Checking
by: Sundriyal, Megha, et al.
Published: (2023)
by: Sundriyal, Megha, et al.
Published: (2023)
Multi-Modal Framing Analysis of News
by: Arora, Arnav, et al.
Published: (2025)
by: Arora, Arnav, et al.
Published: (2025)
Similar Items
-
Investigating Human Values in Online Communities
by: Borenstein, Nadav, et al.
Published: (2024) -
Beyond "Not Novel Enough": Enriching Scholarly Critique with LLM-Assisted Feedback
by: Afzal, Osama Mohammed, et al.
Published: (2025) -
Multimodal Large Language Models to Support Real-World Fact-Checking
by: Geng, Jiahui, et al.
Published: (2024) -
Can Community Notes Replace Professional Fact-Checkers?
by: Borenstein, Nadav, et al.
Published: (2025) -
OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs
by: Iqbal, Hasan, et al.
Published: (2024)