Saved in:
| Main Authors: | Ke, Yusong, Lin, Hongru, Ruan, Yuting, Tang, Junya, Li, Li |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.05505 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Conformal Sets in Multiple-Choice Question Answering under Black-Box Settings with Provable Coverage Guarantees
by: Yang, Guang, et al.
Published: (2025)
by: Yang, Guang, et al.
Published: (2025)
Conformal P-Value in Multiple-Choice Question Answering Tasks with Provable Risk Control
by: Ye, Yuanchang
Published: (2025)
by: Ye, Yuanchang
Published: (2025)
Biomedical Entity Linking as Multiple Choice Question Answering
by: Lin, Zhenxi, et al.
Published: (2024)
by: Lin, Zhenxi, et al.
Published: (2024)
Differentiating Choices via Commonality for Multiple-Choice Question Answering
by: Deng, Wenqing, et al.
Published: (2024)
by: Deng, Wenqing, et al.
Published: (2024)
More Bias, Less Bias: BiasPrompting for Enhanced Multiple-Choice Question Answering
by: Vu, Duc Anh, et al.
Published: (2025)
by: Vu, Duc Anh, et al.
Published: (2025)
Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Output Prefilling
by: Cappelletti, Silvia, et al.
Published: (2025)
by: Cappelletti, Silvia, et al.
Published: (2025)
Right Answer, Wrong Score: Uncovering the Inconsistencies of LLM Evaluation in Multiple-Choice Question Answering
by: Molfese, Francesco Maria, et al.
Published: (2025)
by: Molfese, Francesco Maria, et al.
Published: (2025)
LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering
by: Sutanto, Patrick, et al.
Published: (2024)
by: Sutanto, Patrick, et al.
Published: (2024)
TRAQ: Trustworthy Retrieval Augmented Question Answering via Conformal Prediction
by: Li, Shuo, et al.
Published: (2023)
by: Li, Shuo, et al.
Published: (2023)
Applying Relation Extraction and Graph Matching to Answering Multiple Choice Questions
by: Shimoda, Naoki, et al.
Published: (2025)
by: Shimoda, Naoki, et al.
Published: (2025)
Hierarchical Vision-Language Reasoning for Multimodal Multiple-Choice Question Answering
by: Zhou, Ao, et al.
Published: (2025)
by: Zhou, Ao, et al.
Published: (2025)
Mind the Gap: A Closer Look at Tokenization for Multiple-Choice Question Answering with LLMs
by: Sanz-Guerrero, Mario, et al.
Published: (2025)
by: Sanz-Guerrero, Mario, et al.
Published: (2025)
Multiple-Choice Questions are Efficient and Robust LLM Evaluators
by: Zhang, Ziyin, et al.
Published: (2024)
by: Zhang, Ziyin, et al.
Published: (2024)
Trustworthy Medical Question Answering: An Evaluation-Centric Survey
by: Wang, Yinuo, et al.
Published: (2025)
by: Wang, Yinuo, et al.
Published: (2025)
To Reason or Not to: Selective Chain-of-Thought in Medical Question Answering
by: Zhan, Zaifu, et al.
Published: (2026)
by: Zhan, Zaifu, et al.
Published: (2026)
Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions
by: Park, Yoonah, et al.
Published: (2025)
by: Park, Yoonah, et al.
Published: (2025)
Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
by: Hou, Yutao, et al.
Published: (2024)
by: Hou, Yutao, et al.
Published: (2024)
A Study on Large Language Models' Limitations in Multiple-Choice Question Answering
by: Khatun, Aisha, et al.
Published: (2024)
by: Khatun, Aisha, et al.
Published: (2024)
Evaluating Small Open LLMs for Medical Question Answering: A Practical Framework
by: Buskila, Avi-ad Avraam
Published: (2026)
by: Buskila, Avi-ad Avraam
Published: (2026)
Enhancing Clinical Multiple-Choice Questions Benchmarks with Knowledge Graph Guided Distractor Generation
by: Yang, Running, et al.
Published: (2025)
by: Yang, Running, et al.
Published: (2025)
Addressing Blind Guessing: Calibration of Selection Bias in Multiple-Choice Question Answering by Video Language Models
by: Loginova, Olga, et al.
Published: (2024)
by: Loginova, Olga, et al.
Published: (2024)
When Models Decide and When They Bind: A Two-Stage Computation for Multiple-Choice Question-Answering
by: Wong, Hugh Mee, et al.
Published: (2026)
by: Wong, Hugh Mee, et al.
Published: (2026)
Evaluating and Calibrating LLM Confidence on Questions with Multiple Correct Answers
by: Wang, Yuhan, et al.
Published: (2026)
by: Wang, Yuhan, et al.
Published: (2026)
Collaboration among Multiple Large Language Models for Medical Question Answering
by: Shang, Kexin, et al.
Published: (2025)
by: Shang, Kexin, et al.
Published: (2025)
MedExQA: Medical Question Answering Benchmark with Multiple Explanations
by: Kim, Yunsoo, et al.
Published: (2024)
by: Kim, Yunsoo, et al.
Published: (2024)
MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison Feedback
by: Yao, Zonghai, et al.
Published: (2024)
by: Yao, Zonghai, et al.
Published: (2024)
Generating Plausible Distractors for Multiple-Choice Questions via Student Choice Prediction
by: Lee, Yooseop, et al.
Published: (2025)
by: Lee, Yooseop, et al.
Published: (2025)
MKG-Rank: Enhancing Large Language Models with Knowledge Graph for Multilingual Medical Question Answering
by: Li, Feiyang, et al.
Published: (2025)
by: Li, Feiyang, et al.
Published: (2025)
MKRAG: Medical Knowledge Retrieval Augmented Generation for Medical Question Answering
by: Shi, Yucheng, et al.
Published: (2023)
by: Shi, Yucheng, et al.
Published: (2023)
Pattern Recognition or Medical Knowledge? The Problem with Multiple-Choice Questions in Medicine
by: Griot, Maxime, et al.
Published: (2024)
by: Griot, Maxime, et al.
Published: (2024)
Leveraging Inter-Chunk Interactions for Enhanced Retrieval in Large Language Model-Based Question Answering
by: Guo, Tiezheng, et al.
Published: (2024)
by: Guo, Tiezheng, et al.
Published: (2024)
Bias Evaluation and Mitigation in Retrieval-Augmented Medical Question-Answering Systems
by: Ji, Yuelyu, et al.
Published: (2025)
by: Ji, Yuelyu, et al.
Published: (2025)
The Role of the Availability Heuristic in Multiple-Choice Answering Behaviour
by: Zotos, Leonidas, et al.
Published: (2026)
by: Zotos, Leonidas, et al.
Published: (2026)
Can Large Language Models Self-Correct in Medical Question Answering? An Exploratory Study
by: Zhan, Zaifu, et al.
Published: (2026)
by: Zhan, Zaifu, et al.
Published: (2026)
(WhyPHI) Fine-Tuning PHI-3 for Multiple-Choice Question Answering: Methodology, Results, and Challenges
by: Abdellatif, Mohamed Hisham
Published: (2025)
by: Abdellatif, Mohamed Hisham
Published: (2025)
Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering
by: Adlakha, Vaibhav, et al.
Published: (2023)
by: Adlakha, Vaibhav, et al.
Published: (2023)
It is Too Many Options: Pitfalls of Multiple-Choice Questions in Generative AI and Medical Education
by: Singh, Shrutika, et al.
Published: (2025)
by: Singh, Shrutika, et al.
Published: (2025)
Option-ID Based Elimination For Multiple Choice Questions
by: Zhu, Zhenhao, et al.
Published: (2025)
by: Zhu, Zhenhao, et al.
Published: (2025)
Domain Fine-Tuning vs. Retrieval-Augmented Generation for Medical Multiple-Choice Question Answering: A Controlled Comparison at the 4B-Parameter Scale
by: Buskila, Avi-ad Avraam
Published: (2026)
by: Buskila, Avi-ad Avraam
Published: (2026)
Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration
by: Yu, Han-Cheng, et al.
Published: (2024)
by: Yu, Han-Cheng, et al.
Published: (2024)
Similar Items
-
Conformal Sets in Multiple-Choice Question Answering under Black-Box Settings with Provable Coverage Guarantees
by: Yang, Guang, et al.
Published: (2025) -
Conformal P-Value in Multiple-Choice Question Answering Tasks with Provable Risk Control
by: Ye, Yuanchang
Published: (2025) -
Biomedical Entity Linking as Multiple Choice Question Answering
by: Lin, Zhenxi, et al.
Published: (2024) -
Differentiating Choices via Commonality for Multiple-Choice Question Answering
by: Deng, Wenqing, et al.
Published: (2024) -
More Bias, Less Bias: BiasPrompting for Enhanced Multiple-Choice Question Answering
by: Vu, Duc Anh, et al.
Published: (2025)