Saved in:
| Main Authors: | Koupaee, Mahnaz, Vincent, Jake W., Mansour, Saab, Shalyminov, Igor, He, Han, Song, Hwanjun, Shu, Raphael, He, Jianfeng, Nian, Yi, Wong, Amy Wing-mei, Han, Kyu J., Su, Hang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.08514 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Semi-Supervised Dialogue Abstractive Summarization via High-Quality Pseudolabel Selection
by: He, Jianfeng, et al.
Published: (2024)
by: He, Jianfeng, et al.
Published: (2024)
FineSurE: Fine-grained Summarization Evaluation using LLMs
by: Song, Hwanjun, et al.
Published: (2024)
by: Song, Hwanjun, et al.
Published: (2024)
Can Your Model Tell a Negation from an Implicature? Unravelling Challenges With Intent Encoders
by: Zhang, Yuwei, et al.
Published: (2024)
by: Zhang, Yuwei, et al.
Published: (2024)
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
by: Tang, Liyan, et al.
Published: (2024)
by: Tang, Liyan, et al.
Published: (2024)
Controllable Conversational Theme Detection Track at DSTC 12
by: Shalyminov, Igor, et al.
Published: (2025)
by: Shalyminov, Igor, et al.
Published: (2025)
CERET: Cost-Effective Extrinsic Refinement for Text Generation
by: Cai, Jason, et al.
Published: (2024)
by: Cai, Jason, et al.
Published: (2024)
MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets
by: Aboutalebi, Hossein, et al.
Published: (2024)
by: Aboutalebi, Hossein, et al.
Published: (2024)
MDSEval: A Meta-Evaluation Benchmark for Multimodal Dialogue Summarization
by: Liu, Yinhong, et al.
Published: (2025)
by: Liu, Yinhong, et al.
Published: (2025)
GLEAN: Active Generalized Category Discovery with Diverse LLM Feedback
by: Zou, Henry Peng, et al.
Published: (2025)
by: Zou, Henry Peng, et al.
Published: (2025)
Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlights
by: Mao, Shunqi, et al.
Published: (2024)
by: Mao, Shunqi, et al.
Published: (2024)
Dissociation of Faithful and Unfaithful Reasoning in LLMs
by: Yee, Evelyn, et al.
Published: (2024)
by: Yee, Evelyn, et al.
Published: (2024)
The Gray Zone of Faithfulness: Taming Ambiguity in Unfaithfulness Detection
by: Ding, Qiang, et al.
Published: (2025)
by: Ding, Qiang, et al.
Published: (2025)
Multilingual Self-Taught Faithfulness Evaluators
by: Alfano, Carlo, et al.
Published: (2025)
by: Alfano, Carlo, et al.
Published: (2025)
Multi-round, Chain-of-thought Post-editing for Unfaithful Summaries
by: Lee, Yi-Hui, et al.
Published: (2025)
by: Lee, Yi-Hui, et al.
Published: (2025)
Causal Graph based Event Reasoning using Semantic Relation Experts
by: Koupaee, Mahnaz, et al.
Published: (2025)
by: Koupaee, Mahnaz, et al.
Published: (2025)
MuSciClaims: Multimodal Scientific Claim Verification
by: Lal, Yash Kumar, et al.
Published: (2025)
by: Lal, Yash Kumar, et al.
Published: (2025)
Rethinking LLM-Based Recommendations: A Personalized Query-Driven Parallel Integration
by: Han, Donghee, et al.
Published: (2025)
by: Han, Donghee, et al.
Published: (2025)
Alignment Tuning for Large Language Models: A Data-Centric Lens on Alignment Data Pipelines
by: Song, Hwanjun
Published: (2026)
by: Song, Hwanjun
Published: (2026)
Structured List-Grounded Question Answering
by: Sung, Mujeen, et al.
Published: (2024)
by: Sung, Mujeen, et al.
Published: (2024)
Auditing Stance Asymmetry in Generative Explanations
by: Han, Jiarui
Published: (2026)
by: Han, Jiarui
Published: (2026)
$\texttt{DIAMONDs}$: A Dataset for $\mathbb{D}$ynamic $\mathbb{I}$nformation $\mathbb{A}$nd $\mathbb{M}$ental modeling $\mathbb{O}$f $\mathbb{N}$umeric $\mathbb{D}$iscussions
by: Ghosh, Sayontan, et al.
Published: (2025)
by: Ghosh, Sayontan, et al.
Published: (2025)
Counterfactual Debating with Preset Stances for Hallucination Elimination of LLMs
by: Fang, Yi, et al.
Published: (2024)
by: Fang, Yi, et al.
Published: (2024)
The Dialectics of Faith in the Poetry of José Bergamín
by: Wing, Helen
Published: (2025)
by: Wing, Helen
Published: (2025)
Chain-of-Thought Unfaithfulness as Disguised Accuracy
by: Bentham, Oliver, et al.
Published: (2024)
by: Bentham, Oliver, et al.
Published: (2024)
FiDeLiS: Faithful Reasoning in Large Language Model for Knowledge Graph Question Answering
by: Sui, Yuan, et al.
Published: (2024)
by: Sui, Yuan, et al.
Published: (2024)
Learning to Verify Summary Facts with Fine-Grained LLM Feedback
by: Oh, Jihwan, et al.
Published: (2024)
by: Oh, Jihwan, et al.
Published: (2024)
DSCD-Nav: Dual-Stance Cooperative Debate for Object Navigation
by: An, Weitao, et al.
Published: (2026)
by: An, Weitao, et al.
Published: (2026)
Peacemaker or Troublemaker: How Sycophancy Shapes Multi-Agent Debate
by: Yao, Binwei, et al.
Published: (2025)
by: Yao, Binwei, et al.
Published: (2025)
Exploring Vision Language Models for Multimodal and Multilingual Stance Detection
by: Vasilakes, Jake, et al.
Published: (2025)
by: Vasilakes, Jake, et al.
Published: (2025)
RADIUS: Ranking, Distribution, and Significance - A Comprehensive Alignment Suite for Survey Simulation
by: Łajewska, Weronika, et al.
Published: (2026)
by: Łajewska, Weronika, et al.
Published: (2026)
Hierarchical Multi-field Representations for Two-Stage E-commerce Retrieval
by: Freymuth, Niklas, et al.
Published: (2025)
by: Freymuth, Niklas, et al.
Published: (2025)
Cross-Lingual LLM-Judge Transfer via Evaluation Decomposition
by: Sheth, Ivaxi, et al.
Published: (2026)
by: Sheth, Ivaxi, et al.
Published: (2026)
MM-StanceDet: Retrieval-Augmented Multi-modal Multi-agent Stance Detection
by: Lu, Weihai, et al.
Published: (2026)
by: Lu, Weihai, et al.
Published: (2026)
IFDID: Information Filter upon Diversity-Improved Decoding for Diversity-Faithfulness Tradeoff in NLG
by: Meng, Han, et al.
Published: (2022)
by: Meng, Han, et al.
Published: (2022)
Reasoning over Video: Evaluating How MLLMs Extract, Integrate, and Reconstruct Spatiotemporal Evidence
by: Bang, Seunghwan, et al.
Published: (2026)
by: Bang, Seunghwan, et al.
Published: (2026)
LLM-based User Profile Management for Recommender System
by: Bang, Seunghwan, et al.
Published: (2025)
by: Bang, Seunghwan, et al.
Published: (2025)
SCRum-9: Multilingual Stance Classification over Rumours on Social Media
by: Li, Yue, et al.
Published: (2025)
by: Li, Yue, et al.
Published: (2025)
Breaking Event Rumor Detection via Stance-Separated Multi-Agent Debate
by: Zhang, Mingqing, et al.
Published: (2024)
by: Zhang, Mingqing, et al.
Published: (2024)
UniSumEval: Towards Unified, Fine-Grained, Multi-Dimensional Summarization Evaluation for LLMs
by: Lee, Yuho, et al.
Published: (2024)
by: Lee, Yuho, et al.
Published: (2024)
Zero-Shot Conversational Stance Detection: Dataset and Approaches
by: Ding, Yuzhe, et al.
Published: (2025)
by: Ding, Yuzhe, et al.
Published: (2025)
Similar Items
-
Semi-Supervised Dialogue Abstractive Summarization via High-Quality Pseudolabel Selection
by: He, Jianfeng, et al.
Published: (2024) -
FineSurE: Fine-grained Summarization Evaluation using LLMs
by: Song, Hwanjun, et al.
Published: (2024) -
Can Your Model Tell a Negation from an Implicature? Unravelling Challenges With Intent Encoders
by: Zhang, Yuwei, et al.
Published: (2024) -
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
by: Tang, Liyan, et al.
Published: (2024) -
Controllable Conversational Theme Detection Track at DSTC 12
by: Shalyminov, Igor, et al.
Published: (2025)