Saved in:
| Main Authors: | Özer, Atahan, Yıldız, Çağatay |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.07270 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Consensus or Conflict? Fine-Grained Evaluation of Conflicting Answers in Question-Answering
by: Nachshoni, Eviatar, et al.
Published: (2025)
by: Nachshoni, Eviatar, et al.
Published: (2025)
From Raw Corpora to Domain Benchmarks: Automated Evaluation of LLM Domain Expertise
by: Sharma, Nitin, et al.
Published: (2025)
by: Sharma, Nitin, et al.
Published: (2025)
Multi-hop Question Answering under Temporal Knowledge Editing
by: Cheng, Keyuan, et al.
Published: (2024)
by: Cheng, Keyuan, et al.
Published: (2024)
Mitigating Knowledge Conflicts in Language Model-Driven Question Answering
by: Cao, Han, et al.
Published: (2024)
by: Cao, Han, et al.
Published: (2024)
Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge
by: Rezaei, Mohammad Reza, et al.
Published: (2025)
by: Rezaei, Mohammad Reza, et al.
Published: (2025)
Self-Improvement Programming for Temporal Knowledge Graph Question Answering
by: Chen, Zhuo, et al.
Published: (2024)
by: Chen, Zhuo, et al.
Published: (2024)
LLM-Based Multi-Hop Question Answering with Knowledge Graph Integration in Evolving Environments
by: Chen, Ruirui, et al.
Published: (2024)
by: Chen, Ruirui, et al.
Published: (2024)
A Persona-Based Evaluation Framework for Pluralistic Alignment in Generative AI
by: Karagoz, Atahan
Published: (2026)
by: Karagoz, Atahan
Published: (2026)
A Reproduction Study: The Kernel PCA Interpretation of Self-Attention Fails Under Scrutiny
by: Sarıtaş, Karahan, et al.
Published: (2025)
by: Sarıtaş, Karahan, et al.
Published: (2025)
Combining LLMs and Knowledge Graphs to Reduce Hallucinations in Question Answering
by: Pusch, Larissa, et al.
Published: (2024)
by: Pusch, Larissa, et al.
Published: (2024)
DebateQA: Evaluating Question Answering on Debatable Knowledge
by: Xu, Rongwu, et al.
Published: (2024)
by: Xu, Rongwu, et al.
Published: (2024)
MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge
by: He, Jie, et al.
Published: (2024)
by: He, Jie, et al.
Published: (2024)
Temporal Knowledge Graph Question Answering: A Survey
by: Su, Miao, et al.
Published: (2024)
by: Su, Miao, et al.
Published: (2024)
Open Domain Question Answering with Conflicting Contexts
by: Liu, Siyi, et al.
Published: (2024)
by: Liu, Siyi, et al.
Published: (2024)
Adaptive Question Answering: Enhancing Language Model Proficiency for Addressing Knowledge Conflicts with Source Citations
by: Shaier, Sagi, et al.
Published: (2024)
by: Shaier, Sagi, et al.
Published: (2024)
EvolveRouter: Co-Evolving Routing and Prompt for Multi-Agent Question Answering
by: Huang, Jiatan, et al.
Published: (2026)
by: Huang, Jiatan, et al.
Published: (2026)
Improving TCM Question Answering through Tree-Organized Self-Reflective Retrieval with LLMs
by: Liu, Chang, et al.
Published: (2025)
by: Liu, Chang, et al.
Published: (2025)
EvoWiki: Evaluating LLMs on Evolving Knowledge
by: Tang, Wei, et al.
Published: (2024)
by: Tang, Wei, et al.
Published: (2024)
SEAL: Self-Evolving Agentic Learning for Conversational Question Answering over Knowledge Graphs
by: Wang, Hao, et al.
Published: (2025)
by: Wang, Hao, et al.
Published: (2025)
Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering
by: Zhang, Yichi, et al.
Published: (2023)
by: Zhang, Yichi, et al.
Published: (2023)
Evaluating LLMs' Mathematical Reasoning in Financial Document Question Answering
by: Srivastava, Pragya, et al.
Published: (2024)
by: Srivastava, Pragya, et al.
Published: (2024)
Automatic Evaluation of Healthcare LLMs Beyond Question-Answering
by: Arias-Duart, Anna, et al.
Published: (2025)
by: Arias-Duart, Anna, et al.
Published: (2025)
Plan of Knowledge: Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering
by: Qian, Xinying, et al.
Published: (2025)
by: Qian, Xinying, et al.
Published: (2025)
COMPKE: Complex Question Answering under Knowledge Editing
by: Cheng, Keyuan, et al.
Published: (2025)
by: Cheng, Keyuan, et al.
Published: (2025)
Temporal Knowledge Question Answering via Abstract Reasoning Induction
by: Chen, Ziyang, et al.
Published: (2023)
by: Chen, Ziyang, et al.
Published: (2023)
Coal Mining Question Answering with LLMs
by: Rivera, Antonio Carlos, et al.
Published: (2024)
by: Rivera, Antonio Carlos, et al.
Published: (2024)
Question Answering Over Spatio-Temporal Knowledge Graph
by: Dai, Xinbang, et al.
Published: (2024)
by: Dai, Xinbang, et al.
Published: (2024)
Question Calibration and Multi-Hop Modeling for Temporal Question Answering
by: Xue, Chao, et al.
Published: (2024)
by: Xue, Chao, et al.
Published: (2024)
Evaluating Robustness of LLMs in Question Answering on Multilingual Noisy OCR Data
by: Piryani, Bhawna, et al.
Published: (2025)
by: Piryani, Bhawna, et al.
Published: (2025)
Continual Learning for Temporal-Sensitive Question Answering
by: Yang, Wanqi, et al.
Published: (2024)
by: Yang, Wanqi, et al.
Published: (2024)
HOLMES: Hyper-Relational Knowledge Graphs for Multi-hop Question Answering using LLMs
by: Panda, Pranoy, et al.
Published: (2024)
by: Panda, Pranoy, et al.
Published: (2024)
Two-stage Generative Question Answering on Temporal Knowledge Graph Using Large Language Models
by: Gao, Yifu, et al.
Published: (2024)
by: Gao, Yifu, et al.
Published: (2024)
On the Calibration of Multilingual Question Answering LLMs
by: Yang, Yahan, et al.
Published: (2023)
by: Yang, Yahan, et al.
Published: (2023)
Knowledge Dependency Estimation for Reliable Question Answering
by: Tong, Chaodong, et al.
Published: (2026)
by: Tong, Chaodong, et al.
Published: (2026)
Question-guided Knowledge Graph Re-scoring and Injection for Knowledge Graph Question Answering
by: Zhang, Yu, et al.
Published: (2024)
by: Zhang, Yu, et al.
Published: (2024)
MQA-KEAL: Multi-hop Question Answering under Knowledge Editing for Arabic Language
by: Ali, Muhammad Asif, et al.
Published: (2024)
by: Ali, Muhammad Asif, et al.
Published: (2024)
Knowledge Extraction on Semi-Structured Content: Does It Remain Relevant for Question Answering in the Era of LLMs?
by: Sun, Kai, et al.
Published: (2025)
by: Sun, Kai, et al.
Published: (2025)
Accurate Table Question Answering with Accessible LLMs
by: Jiang, Yangfan, et al.
Published: (2026)
by: Jiang, Yangfan, et al.
Published: (2026)
Comprehensive Evaluation for a Large Scale Knowledge Graph Question Answering Service
by: Potdar, Saloni, et al.
Published: (2025)
by: Potdar, Saloni, et al.
Published: (2025)
L3Cube-IndicQuest: A Benchmark Question Answering Dataset for Evaluating Knowledge of LLMs in Indic Context
by: Rohera, Pritika, et al.
Published: (2024)
by: Rohera, Pritika, et al.
Published: (2024)
Similar Items
-
Consensus or Conflict? Fine-Grained Evaluation of Conflicting Answers in Question-Answering
by: Nachshoni, Eviatar, et al.
Published: (2025) -
From Raw Corpora to Domain Benchmarks: Automated Evaluation of LLM Domain Expertise
by: Sharma, Nitin, et al.
Published: (2025) -
Multi-hop Question Answering under Temporal Knowledge Editing
by: Cheng, Keyuan, et al.
Published: (2024) -
Mitigating Knowledge Conflicts in Language Model-Driven Question Answering
by: Cao, Han, et al.
Published: (2024) -
Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge
by: Rezaei, Mohammad Reza, et al.
Published: (2025)