Saved in:
| Main Authors: | Abacha, Asma Ben, Yim, Wen-wai, Fu, Yujuan, Sun, Zhaoyi, Yetisgen, Meliha, Xia, Fei, Lin, Thomas |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.19260 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MORQA: Benchmarking Evaluation Metrics for Medical Open-Ended Question Answering
by: Yim, Wen-wai, et al.
Published: (2025)
by: Yim, Wen-wai, et al.
Published: (2025)
RADAR: A Multimodal Benchmark for 3D Image-Based Radiology Report Review
by: Sun, Zhaoyi, et al.
Published: (2026)
by: Sun, Zhaoyi, et al.
Published: (2026)
DermaVQA-DAS: Dermatology Assessment Schema (DAS) & Datasets for Closed-Ended Question Answering & Segmentation in Patient-Generated Dermatology Images
by: Yim, Wen-wai, et al.
Published: (2025)
by: Yim, Wen-wai, et al.
Published: (2025)
A Scoping Review of Natural Language Processing in Addressing Medically Inaccurate Information: Errors, Misinformation, and Hallucination
by: Sun, Zhaoyi, et al.
Published: (2025)
by: Sun, Zhaoyi, et al.
Published: (2025)
Does Data Contamination Detection Work (Well) for LLMs? A Survey and Evaluation on Detection Assumptions
by: Fu, Yujuan, et al.
Published: (2024)
by: Fu, Yujuan, et al.
Published: (2024)
A Systematic Analysis of Large Language Models with RAG-enabled Dynamic Prompting for Medical Error Detection and Correction
by: Ahmed, Farzad, et al.
Published: (2025)
by: Ahmed, Farzad, et al.
Published: (2025)
BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning
by: Fu, Yujuan Velvin, et al.
Published: (2024)
by: Fu, Yujuan Velvin, et al.
Published: (2024)
CACER: Clinical Concept Annotations for Cancer Events and Relations
by: Fu, Yujuan, et al.
Published: (2024)
by: Fu, Yujuan, et al.
Published: (2024)
Extracting Social Determinants of Health from Pediatric Patient Notes Using Large Language Models: Novel Corpus and Methods
by: Fu, Yujuan, et al.
Published: (2024)
by: Fu, Yujuan, et al.
Published: (2024)
RadTimeline: Timeline Summarization for Longitudinal Radiological Lung Findings
by: Zhou, Sitong, et al.
Published: (2026)
by: Zhou, Sitong, et al.
Published: (2026)
UW-BioNLP at ChemoTimelines 2025: Thinking, Fine-Tuning, and Dictionary-Enhanced LLM Systems for Chemotherapy Timeline Extraction
by: Zhang, Tianmai M., et al.
Published: (2025)
by: Zhang, Tianmai M., et al.
Published: (2025)
Automated Identification of Incidentalomas Requiring Follow-Up: A Multi-Anatomy Evaluation of LLM-Based and Supervised Approaches
by: Park, Namu, et al.
Published: (2025)
by: Park, Namu, et al.
Published: (2025)
Large Language Model-Based Agents for Automated Research Reproducibility: An Exploratory Study in Alzheimer's Disease
by: Dobbins, Nic, et al.
Published: (2025)
by: Dobbins, Nic, et al.
Published: (2025)
Identifying Imaging Follow-Up in Radiology Reports: A Comparative Analysis of Traditional ML and LLM Approaches
by: Park, Namu, et al.
Published: (2025)
by: Park, Namu, et al.
Published: (2025)
VERT: Reliable LLM Judges for Radiology Report Evaluation
by: Bologna, Federica, et al.
Published: (2026)
by: Bologna, Federica, et al.
Published: (2026)
Adapting Biomedical Abstracts into Plain language using Large Language Models
by: Gangavarapu, Haritha, et al.
Published: (2025)
by: Gangavarapu, Haritha, et al.
Published: (2025)
CoRe-BT: A Multimodal Radiology-Pathology-Text Benchmark for Robust Brain Tumor Typing
by: Rivera, Juampablo E. Heras, et al.
Published: (2026)
by: Rivera, Juampablo E. Heras, et al.
Published: (2026)
A Novel Corpus of Annotated Medical Imaging Reports and Information Extraction Results Using BERT-based Language Models
by: Park, Namu, et al.
Published: (2024)
by: Park, Namu, et al.
Published: (2024)
Spurious Correlations and Beyond: Understanding and Mitigating Shortcut Learning in SDOH Extraction with Large Language Models
by: Sakib, Fardin Ahsan, et al.
Published: (2025)
by: Sakib, Fardin Ahsan, et al.
Published: (2025)
A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment
by: Corbeil, Jean-Philippe, et al.
Published: (2025)
by: Corbeil, Jean-Philippe, et al.
Published: (2025)
Overview of the MEDIQA-OE 2025 Shared Task on Medical Order Extraction from Doctor-Patient Consultations
by: Corbeil, Jean-Philippe, et al.
Published: (2025)
by: Corbeil, Jean-Philippe, et al.
Published: (2025)
MedErrBench: A Fine-Grained Multilingual Benchmark for Medical Error Detection and Correction with Clinical Expert Annotations
by: Ma, Congbo, et al.
Published: (2026)
by: Ma, Congbo, et al.
Published: (2026)
MedRECT: A Medical Reasoning Benchmark for Error Correction in Clinical Texts
by: Iwase, Naoto, et al.
Published: (2025)
by: Iwase, Naoto, et al.
Published: (2025)
Large Language Model Capabilities in Perioperative Risk Prediction and Prognostication
by: Chung, Philip, et al.
Published: (2024)
by: Chung, Philip, et al.
Published: (2024)
Traj-CoA: Patient Trajectory Modeling via Chain-of-Agents for Lung Cancer Risk Prediction
by: Zeng, Sihang, et al.
Published: (2025)
by: Zeng, Sihang, et al.
Published: (2025)
Importance of Prompt Optimisation for Error Detection in Medical Notes Using Language Models
by: Myles, Craig, et al.
Published: (2026)
by: Myles, Craig, et al.
Published: (2026)
Overconfidence and Calibration in Medical VQA: Empirical Findings and Hallucination-Aware Mitigation
by: Byun, Ji Young, et al.
Published: (2026)
by: Byun, Ji Young, et al.
Published: (2026)
Empowering Healthcare Practitioners with Language Models: Structuring Speech Transcripts in Two Real-World Clinical Applications
by: Corbeil, Jean-Philippe, et al.
Published: (2025)
by: Corbeil, Jean-Philippe, et al.
Published: (2025)
Term2Note: Synthesising Differentially Private Clinical Notes from Medical Terms
by: Wu, Yuping, et al.
Published: (2025)
by: Wu, Yuping, et al.
Published: (2025)
AR-BENCH: Benchmarking Legal Reasoning with Judgment Error Detection, Classification and Correction
by: Li, Yifei, et al.
Published: (2026)
by: Li, Yifei, et al.
Published: (2026)
WangLab at MEDIQA-CORR 2024: Optimized LLM-based Programs for Medical Error Detection and Correction
by: Toma, Augustin, et al.
Published: (2024)
by: Toma, Augustin, et al.
Published: (2024)
IryoNLP at MEDIQA-CORR 2024: Tackling the Medical Error Detection & Correction Task On the Shoulders of Medical Agents
by: Corbeil, Jean-Philippe
Published: (2024)
by: Corbeil, Jean-Philippe
Published: (2024)
The Task-oriented Queries Benchmark (ToQB)
by: Yim, Keun Soo
Published: (2024)
by: Yim, Keun Soo
Published: (2024)
Note2Chat: Improving LLMs for Multi-Turn Clinical History Taking Using Medical Notes
by: Zhou, Yang, et al.
Published: (2026)
by: Zhou, Yang, et al.
Published: (2026)
TrajSurv: Learning Continuous Latent Trajectories from Electronic Health Records for Trustworthy Survival Prediction
by: Zeng, Sihang, et al.
Published: (2025)
by: Zeng, Sihang, et al.
Published: (2025)
Chapter GenRecipe for Generating Recipes from Videos through Deep Learning
by: Sin-wai, Chan
Published: (2026)
by: Sin-wai, Chan
Published: (2026)
Chapter TransRecipe for Translating Recipes through Machine Processing
by: Sin-wai, Chan
Published: (2026)
by: Sin-wai, Chan
Published: (2026)
Chapter TransRecipe for Translating Recipes through Machine Processing
by: Sin-wai, Chan
Published: (2026)
by: Sin-wai, Chan
Published: (2026)
Chapter VisualRecipe for Visualizing Recipes through Image Technology
by: Sin-wai, Chan
Published: (2026)
by: Sin-wai, Chan
Published: (2026)
Chapter GenRecipe for Generating Recipes from Videos through Deep Learning
by: Sin-wai, Chan
Published: (2026)
by: Sin-wai, Chan
Published: (2026)
Similar Items
-
MORQA: Benchmarking Evaluation Metrics for Medical Open-Ended Question Answering
by: Yim, Wen-wai, et al.
Published: (2025) -
RADAR: A Multimodal Benchmark for 3D Image-Based Radiology Report Review
by: Sun, Zhaoyi, et al.
Published: (2026) -
DermaVQA-DAS: Dermatology Assessment Schema (DAS) & Datasets for Closed-Ended Question Answering & Segmentation in Patient-Generated Dermatology Images
by: Yim, Wen-wai, et al.
Published: (2025) -
A Scoping Review of Natural Language Processing in Addressing Medically Inaccurate Information: Errors, Misinformation, and Hallucination
by: Sun, Zhaoyi, et al.
Published: (2025) -
Does Data Contamination Detection Work (Well) for LLMs? A Survey and Evaluation on Detection Assumptions
by: Fu, Yujuan, et al.
Published: (2024)