:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Liu, Xin, Wang, Lu
Format:	Preprint
Veröffentlicht:	2026
Schlagworte:	Computation and Language
Online-Zugang:	https://arxiv.org/abs/2604.12046
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Only Say What You Know: Calibration-Aware Generation for Long-Form Factuality
von: Luo, Wen, et al.
Veröffentlicht: (2026)

Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided Decoding
von: Liu, Xin, et al.
Veröffentlicht: (2024)

VeriFact: Enhancing Long-Form Factuality Evaluation with Refined Fact Extraction and Reference Facts
von: Liu, Xin, et al.
Veröffentlicht: (2025)

Think-While-Generating: On-the-Fly Reasoning for Personalized Long-Form Generation
von: Wang, Chengbing, et al.
Veröffentlicht: (2025)

DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation
von: Wanner, Miriam, et al.
Veröffentlicht: (2024)

FaStfact: Faster, Stronger Long-Form Factuality Evaluations in LLMs
von: Wan, Yingjia, et al.
Veröffentlicht: (2025)

Beyond Precision: Importance-Aware Recall for Factuality Evaluation in Long-Form LLM Generation
von: Jafari, Nazanin, et al.
Veröffentlicht: (2026)

Investigating Factuality in Long-Form Text Generation: The Roles of Self-Known and Self-Unknown
von: Tu, Lifu, et al.
Veröffentlicht: (2024)

FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language Models
von: Marinescu, Radu, et al.
Veröffentlicht: (2025)

Linguistic Calibration of Long-Form Generations
von: Band, Neil, et al.
Veröffentlicht: (2024)

Atomic Calibration of LLMs in Long-Form Generations
von: Zhang, Caiqi, et al.
Veröffentlicht: (2024)

How Does Response Length Affect Long-Form Factuality
von: Zhao, James Xu, et al.
Veröffentlicht: (2025)

AdaThink-Med: Medical Adaptive Thinking with Uncertainty-Guided Length Calibration
von: Rui, Shaohao, et al.
Veröffentlicht: (2025)

Benchmarking Uncertainty Calibration in Large Language Model Long-Form Question Answering
von: Müller, Philip, et al.
Veröffentlicht: (2026)

UNCLE: Benchmarking Uncertainty Expressions in Long-Form Generation
von: Yang, Ruihan, et al.
Veröffentlicht: (2025)

Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form Generations
von: Chiang, Cheng-Han, et al.
Veröffentlicht: (2024)

Fine-Grained Self-Endorsement Improves Factuality and Reasoning
von: Wang, Ante, et al.
Veröffentlicht: (2024)

ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and Uncertainty
von: Zong, Qing, et al.
Veröffentlicht: (2024)

Knowledge-Level Consistency Reinforcement Learning: Dual-Fact Alignment for Long-Form Factuality
von: Li, Junliang, et al.
Veröffentlicht: (2025)

Learning to Reason for Long-Form Story Generation
von: Gurung, Alexander, et al.
Veröffentlicht: (2025)

LitCab: Lightweight Language Model Calibration over Short- and Long-form Responses
von: Liu, Xin, et al.
Veröffentlicht: (2023)

The Curious Case of Factuality Finetuning: Models' Internal Beliefs Can Improve Factuality
von: Newman, Benjamin, et al.
Veröffentlicht: (2025)

Learning to Reason for Factuality
von: Chen, Xilun, et al.
Veröffentlicht: (2025)

FACTORY: A Challenging Human-Verified Prompt Set for Long-Form Factuality
von: Chen, Mingda, et al.
Veröffentlicht: (2025)

Self-Improving Multilingual Long Reasoning via Translation-Reasoning Integrated Training
von: Liu, Junxiao, et al.
Veröffentlicht: (2026)

Reinforced Informativeness Optimization for Long-Form Retrieval-Augmented Generation
von: Wang, Yuhao, et al.
Veröffentlicht: (2025)

How Long Reasoning Chains Influence LLMs' Judgment of Answer Factuality
von: Tu, Minzhu, et al.
Veröffentlicht: (2026)

Beyond Factual Accuracy: Evaluating Coverage of Diverse Factual Information in Long-form Text Generation
von: Samarinas, Chris, et al.
Veröffentlicht: (2025)

Semantic Consistency-Based Uncertainty Quantification for Factuality in Radiology Report Generation
von: Wang, Chenyu, et al.
Veröffentlicht: (2024)

Think Deep, Not Just Long: Measuring LLM Reasoning Effort via Deep-Thinking Tokens
von: Chen, Wei-Lin, et al.
Veröffentlicht: (2026)

OLAPH: Improving Factuality in Biomedical Long-form Question Answering
von: Jeong, Minbyul, et al.
Veröffentlicht: (2024)

The Curious Case of Factual (Mis)Alignment between LLMs' Short- and Long-Form Answers
von: Islam, Saad Obaid ul, et al.
Veröffentlicht: (2025)

Improving Factuality for Dialogue Response Generation via Graph-Based Knowledge Augmentation
von: Chen, Xiangyan, et al.
Veröffentlicht: (2025)

Conformal Linguistic Calibration: Trading-off between Factuality and Specificity
von: Jiang, Zhengping, et al.
Veröffentlicht: (2025)

Think Before You Prune: Selective Self-Generated Calibration for Pruning Large Reasoning Models
von: Xiang, Yang, et al.
Veröffentlicht: (2025)

Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options
von: Nair, Lakshmi, et al.
Veröffentlicht: (2025)

STOP: Structured On-Policy Pruning of Long-Form Reasoning in Low-Data Regimes
von: Xu, Chenjun, et al.
Veröffentlicht: (2026)

Think-Augmented Function Calling: Improving LLM Parameter Accuracy Through Embedded Reasoning
von: Wei, Lei, et al.
Veröffentlicht: (2026)

Beyond Factual QA: Mentorship-Oriented Question Answering over Long-Form Multilingual Content
von: Bhalerao, Parth, et al.
Veröffentlicht: (2026)

MAD-Fact: A Multi-Agent Debate Framework for Long-Form Factuality Evaluation in LLMs
von: Ning, Yucheng, et al.
Veröffentlicht: (2025)