Saved in:
| Main Author: | Baidya, Madhav S |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.04565 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Detecting the Machine: A Comprehensive Benchmark of AI-Generated Text Detectors Across Architectures, Domains, and Adversarial Conditions
by: Baidya, Madhav S., et al.
Published: (2026)
by: Baidya, Madhav S., et al.
Published: (2026)
SensorQA: A Question Answering Benchmark for Daily-Life Monitoring
by: Reichman, Benjamin, et al.
Published: (2025)
by: Reichman, Benjamin, et al.
Published: (2025)
RAG-BioQA: A Retrieval-Augmented Generation Framework for Long-Form Biomedical Question Answering
by: Panchumarthi, Lovely Yeswanth, et al.
Published: (2025)
by: Panchumarthi, Lovely Yeswanth, et al.
Published: (2025)
Long-Span Question-Answering: Automatic Question Generation and QA-System Ranking via Side-by-Side Evaluation
by: Bohnet, Bernd, et al.
Published: (2024)
by: Bohnet, Bernd, et al.
Published: (2024)
pdfQA: Diverse, Challenging, and Realistic Question Answering over PDFs
by: Schimanski, Tobias, et al.
Published: (2026)
by: Schimanski, Tobias, et al.
Published: (2026)
QA-TOOLBOX: Conversational Question-Answering for process task guidance in manufacturing
by: Manuvinakurike, Ramesh, et al.
Published: (2024)
by: Manuvinakurike, Ramesh, et al.
Published: (2024)
MedExQA: Medical Question Answering Benchmark with Multiple Explanations
by: Kim, Yunsoo, et al.
Published: (2024)
by: Kim, Yunsoo, et al.
Published: (2024)
FinTextQA: A Dataset for Long-form Financial Question Answering
by: Chen, Jian, et al.
Published: (2024)
by: Chen, Jian, et al.
Published: (2024)
Memory-QA: Answering Recall Questions Based on Multimodal Memories
by: Jiang, Hongda, et al.
Published: (2025)
by: Jiang, Hongda, et al.
Published: (2025)
DataFrame QA: A Universal LLM Framework on DataFrame Question Answering Without Data Exposure
by: Ye, Junyi, et al.
Published: (2024)
by: Ye, Junyi, et al.
Published: (2024)
ExpliCIT-QA: Explainable Code-Based Image Table Question Answering
by: Lagos, Maximiliano Hormazábal, et al.
Published: (2025)
by: Lagos, Maximiliano Hormazábal, et al.
Published: (2025)
A Semantic-Sampling Framework for Evaluating Calibration in Open-Ended Question Answering
by: Wang, Zhanliang, et al.
Published: (2026)
by: Wang, Zhanliang, et al.
Published: (2026)
ResearchQA: Evaluating Scholarly Question Answering at Scale Across 75 Fields with Survey-Mined Questions and Rubrics
by: Yifei, Li S., et al.
Published: (2025)
by: Yifei, Li S., et al.
Published: (2025)
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content
by: Monteiro, Joao, et al.
Published: (2024)
by: Monteiro, Joao, et al.
Published: (2024)
MedEthicsQA: A Comprehensive Question Answering Benchmark for Medical Ethics Evaluation of LLMs
by: Wei, Jianhui, et al.
Published: (2025)
by: Wei, Jianhui, et al.
Published: (2025)
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-Checking
by: Tran, Dien X., et al.
Published: (2025)
by: Tran, Dien X., et al.
Published: (2025)
MM-PhyQA: Multimodal Physics Question-Answering With Multi-Image CoT Prompting
by: Anand, Avinash, et al.
Published: (2024)
by: Anand, Avinash, et al.
Published: (2024)
MFORT-QA: Multi-hop Few-shot Open Rich Table Question Answering
by: Guan, Che, et al.
Published: (2024)
by: Guan, Che, et al.
Published: (2024)
MapQA: Open-domain Geospatial Question Answering on Map Data
by: Li, Zekun, et al.
Published: (2025)
by: Li, Zekun, et al.
Published: (2025)
PeerQA: A Scientific Question Answering Dataset from Peer Reviews
by: Baumgärtner, Tim, et al.
Published: (2025)
by: Baumgärtner, Tim, et al.
Published: (2025)
Evaluating Monolingual and Multilingual Large Language Models for Greek Question Answering: The DemosQA Benchmark
by: Mastrokostas, Charalampos, et al.
Published: (2026)
by: Mastrokostas, Charalampos, et al.
Published: (2026)
Beyond Factual QA: Mentorship-Oriented Question Answering over Long-Form Multilingual Content
by: Bhalerao, Parth, et al.
Published: (2026)
by: Bhalerao, Parth, et al.
Published: (2026)
CT2C-QA: Multimodal Question Answering over Chinese Text, Table and Chart
by: Zhao, Bowen, et al.
Published: (2024)
by: Zhao, Bowen, et al.
Published: (2024)
RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering
by: Han, Rujun, et al.
Published: (2024)
by: Han, Rujun, et al.
Published: (2024)
WikiMixQA: A Multimodal Benchmark for Question Answering over Tables and Charts
by: Foroutan, Negar, et al.
Published: (2025)
by: Foroutan, Negar, et al.
Published: (2025)
MizanQA: Benchmarking Large Language Models on Moroccan Legal Question Answering
by: Bahaj, Adil, et al.
Published: (2025)
by: Bahaj, Adil, et al.
Published: (2025)
MMToM-QA: Multimodal Theory of Mind Question Answering
by: Jin, Chuanyang, et al.
Published: (2024)
by: Jin, Chuanyang, et al.
Published: (2024)
FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models
by: Zhu, Andrew, et al.
Published: (2024)
by: Zhu, Andrew, et al.
Published: (2024)
PASemiQA: Plan-Assisted Agent for Question Answering on Semi-Structured Data with Text and Relational Information
by: Yang, Hansi, et al.
Published: (2025)
by: Yang, Hansi, et al.
Published: (2025)
LiTransProQA: an LLM-based Literary Translation evaluation metric with Professional Question Answering
by: Zhang, Ran, et al.
Published: (2025)
by: Zhang, Ran, et al.
Published: (2025)
MedHopQA: A Disease-Centered Multi-Hop Reasoning Benchmark and Evaluation Framework for LLM-Based Biomedical Question Answering
by: Islamaj, Rezarta, et al.
Published: (2026)
by: Islamaj, Rezarta, et al.
Published: (2026)
A Knowledge-Injected Curriculum Pretraining Framework for Question Answering
by: Lin, Xin, et al.
Published: (2024)
by: Lin, Xin, et al.
Published: (2024)
iQUEST: An Iterative Question-Guided Framework for Knowledge Base Question Answering
by: Wang, Shuai, et al.
Published: (2025)
by: Wang, Shuai, et al.
Published: (2025)
Conv-CoA: Improving Open-domain Question Answering in Large Language Models via Conversational Chain-of-Action
by: Pan, Zhenyu, et al.
Published: (2024)
by: Pan, Zhenyu, et al.
Published: (2024)
QA-Dragon: Query-Aware Dynamic RAG System for Knowledge-Intensive Visual Question Answering
by: Jiang, Zhuohang, et al.
Published: (2025)
by: Jiang, Zhuohang, et al.
Published: (2025)
BR-TaxQA-R: A Dataset for Question Answering with References for Brazilian Personal Income Tax Law, including case law
by: Júnior, Juvenal Domingos, et al.
Published: (2025)
by: Júnior, Juvenal Domingos, et al.
Published: (2025)
LLM-MedQA: Enhancing Medical Question Answering through Case Studies in Large Language Models
by: Yang, Hang, et al.
Published: (2024)
by: Yang, Hang, et al.
Published: (2024)
Compositional Consistency-Guided Decoding for Three-Way Logical Question Answering
by: Huang, Tianyi, et al.
Published: (2026)
by: Huang, Tianyi, et al.
Published: (2026)
GNN2R: Weakly-Supervised Rationale-Providing Question Answering over Knowledge Graphs
by: Wang, Ruijie, et al.
Published: (2023)
by: Wang, Ruijie, et al.
Published: (2023)
TrustUQA: A Trustful Framework for Unified Structured Data Question Answering
by: Zhang, Wen, et al.
Published: (2024)
by: Zhang, Wen, et al.
Published: (2024)
Similar Items
-
Detecting the Machine: A Comprehensive Benchmark of AI-Generated Text Detectors Across Architectures, Domains, and Adversarial Conditions
by: Baidya, Madhav S., et al.
Published: (2026) -
SensorQA: A Question Answering Benchmark for Daily-Life Monitoring
by: Reichman, Benjamin, et al.
Published: (2025) -
RAG-BioQA: A Retrieval-Augmented Generation Framework for Long-Form Biomedical Question Answering
by: Panchumarthi, Lovely Yeswanth, et al.
Published: (2025) -
Long-Span Question-Answering: Automatic Question Generation and QA-System Ranking via Side-by-Side Evaluation
by: Bohnet, Bernd, et al.
Published: (2024) -
pdfQA: Diverse, Challenging, and Realistic Question Answering over PDFs
by: Schimanski, Tobias, et al.
Published: (2026)