:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ke, Yusong, Lin, Hongru, Ruan, Yuting, Tang, Junya, Li, Li
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2503.05505
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Conformal Sets in Multiple-Choice Question Answering under Black-Box Settings with Provable Coverage Guarantees
by: Yang, Guang, et al.
Published: (2025)

Conformal P-Value in Multiple-Choice Question Answering Tasks with Provable Risk Control
by: Ye, Yuanchang
Published: (2025)

Biomedical Entity Linking as Multiple Choice Question Answering
by: Lin, Zhenxi, et al.
Published: (2024)

Differentiating Choices via Commonality for Multiple-Choice Question Answering
by: Deng, Wenqing, et al.
Published: (2024)

More Bias, Less Bias: BiasPrompting for Enhanced Multiple-Choice Question Answering
by: Vu, Duc Anh, et al.
Published: (2025)

Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Output Prefilling
by: Cappelletti, Silvia, et al.
Published: (2025)

Right Answer, Wrong Score: Uncovering the Inconsistencies of LLM Evaluation in Multiple-Choice Question Answering
by: Molfese, Francesco Maria, et al.
Published: (2025)

LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering
by: Sutanto, Patrick, et al.
Published: (2024)

TRAQ: Trustworthy Retrieval Augmented Question Answering via Conformal Prediction
by: Li, Shuo, et al.
Published: (2023)

Applying Relation Extraction and Graph Matching to Answering Multiple Choice Questions
by: Shimoda, Naoki, et al.
Published: (2025)

Hierarchical Vision-Language Reasoning for Multimodal Multiple-Choice Question Answering
by: Zhou, Ao, et al.
Published: (2025)

Mind the Gap: A Closer Look at Tokenization for Multiple-Choice Question Answering with LLMs
by: Sanz-Guerrero, Mario, et al.
Published: (2025)

Multiple-Choice Questions are Efficient and Robust LLM Evaluators
by: Zhang, Ziyin, et al.
Published: (2024)

Trustworthy Medical Question Answering: An Evaluation-Centric Survey
by: Wang, Yinuo, et al.
Published: (2025)

To Reason or Not to: Selective Chain-of-Thought in Medical Question Answering
by: Zhan, Zaifu, et al.
Published: (2026)

Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions
by: Park, Yoonah, et al.
Published: (2025)

Compound-QA: A Benchmark for Evaluating LLMs on Compound Questions
by: Hou, Yutao, et al.
Published: (2024)

A Study on Large Language Models' Limitations in Multiple-Choice Question Answering
by: Khatun, Aisha, et al.
Published: (2024)

Evaluating Small Open LLMs for Medical Question Answering: A Practical Framework
by: Buskila, Avi-ad Avraam
Published: (2026)

Enhancing Clinical Multiple-Choice Questions Benchmarks with Knowledge Graph Guided Distractor Generation
by: Yang, Running, et al.
Published: (2025)

Addressing Blind Guessing: Calibration of Selection Bias in Multiple-Choice Question Answering by Video Language Models
by: Loginova, Olga, et al.
Published: (2024)

When Models Decide and When They Bind: A Two-Stage Computation for Multiple-Choice Question-Answering
by: Wong, Hugh Mee, et al.
Published: (2026)

Evaluating and Calibrating LLM Confidence on Questions with Multiple Correct Answers
by: Wang, Yuhan, et al.
Published: (2026)

Collaboration among Multiple Large Language Models for Medical Question Answering
by: Shang, Kexin, et al.
Published: (2025)

MedExQA: Medical Question Answering Benchmark with Multiple Explanations
by: Kim, Yunsoo, et al.
Published: (2024)

MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison Feedback
by: Yao, Zonghai, et al.
Published: (2024)

Generating Plausible Distractors for Multiple-Choice Questions via Student Choice Prediction
by: Lee, Yooseop, et al.
Published: (2025)

MKG-Rank: Enhancing Large Language Models with Knowledge Graph for Multilingual Medical Question Answering
by: Li, Feiyang, et al.
Published: (2025)

MKRAG: Medical Knowledge Retrieval Augmented Generation for Medical Question Answering
by: Shi, Yucheng, et al.
Published: (2023)

Pattern Recognition or Medical Knowledge? The Problem with Multiple-Choice Questions in Medicine
by: Griot, Maxime, et al.
Published: (2024)

Leveraging Inter-Chunk Interactions for Enhanced Retrieval in Large Language Model-Based Question Answering
by: Guo, Tiezheng, et al.
Published: (2024)

Bias Evaluation and Mitigation in Retrieval-Augmented Medical Question-Answering Systems
by: Ji, Yuelyu, et al.
Published: (2025)

The Role of the Availability Heuristic in Multiple-Choice Answering Behaviour
by: Zotos, Leonidas, et al.
Published: (2026)

Can Large Language Models Self-Correct in Medical Question Answering? An Exploratory Study
by: Zhan, Zaifu, et al.
Published: (2026)

(WhyPHI) Fine-Tuning PHI-3 for Multiple-Choice Question Answering: Methodology, Results, and Challenges
by: Abdellatif, Mohamed Hisham
Published: (2025)

Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering
by: Adlakha, Vaibhav, et al.
Published: (2023)

It is Too Many Options: Pitfalls of Multiple-Choice Questions in Generative AI and Medical Education
by: Singh, Shrutika, et al.
Published: (2025)

Option-ID Based Elimination For Multiple Choice Questions
by: Zhu, Zhenhao, et al.
Published: (2025)

Domain Fine-Tuning vs. Retrieval-Augmented Generation for Medical Multiple-Choice Question Answering: A Controlled Comparison at the 4B-Parameter Scale
by: Buskila, Avi-ad Avraam
Published: (2026)

Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration
by: Yu, Han-Cheng, et al.
Published: (2024)