:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Abacha, Asma Ben, Yim, Wen-wai, Fu, Yujuan, Sun, Zhaoyi, Yetisgen, Meliha, Xia, Fei, Lin, Thomas
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2412.19260
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MORQA: Benchmarking Evaluation Metrics for Medical Open-Ended Question Answering
by: Yim, Wen-wai, et al.
Published: (2025)

RADAR: A Multimodal Benchmark for 3D Image-Based Radiology Report Review
by: Sun, Zhaoyi, et al.
Published: (2026)

DermaVQA-DAS: Dermatology Assessment Schema (DAS) & Datasets for Closed-Ended Question Answering & Segmentation in Patient-Generated Dermatology Images
by: Yim, Wen-wai, et al.
Published: (2025)

A Scoping Review of Natural Language Processing in Addressing Medically Inaccurate Information: Errors, Misinformation, and Hallucination
by: Sun, Zhaoyi, et al.
Published: (2025)

Does Data Contamination Detection Work (Well) for LLMs? A Survey and Evaluation on Detection Assumptions
by: Fu, Yujuan, et al.
Published: (2024)

A Systematic Analysis of Large Language Models with RAG-enabled Dynamic Prompting for Medical Error Detection and Correction
by: Ahmed, Farzad, et al.
Published: (2025)

BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning
by: Fu, Yujuan Velvin, et al.
Published: (2024)

CACER: Clinical Concept Annotations for Cancer Events and Relations
by: Fu, Yujuan, et al.
Published: (2024)

Extracting Social Determinants of Health from Pediatric Patient Notes Using Large Language Models: Novel Corpus and Methods
by: Fu, Yujuan, et al.
Published: (2024)

RadTimeline: Timeline Summarization for Longitudinal Radiological Lung Findings
by: Zhou, Sitong, et al.
Published: (2026)

UW-BioNLP at ChemoTimelines 2025: Thinking, Fine-Tuning, and Dictionary-Enhanced LLM Systems for Chemotherapy Timeline Extraction
by: Zhang, Tianmai M., et al.
Published: (2025)

Automated Identification of Incidentalomas Requiring Follow-Up: A Multi-Anatomy Evaluation of LLM-Based and Supervised Approaches
by: Park, Namu, et al.
Published: (2025)

Large Language Model-Based Agents for Automated Research Reproducibility: An Exploratory Study in Alzheimer's Disease
by: Dobbins, Nic, et al.
Published: (2025)

Identifying Imaging Follow-Up in Radiology Reports: A Comparative Analysis of Traditional ML and LLM Approaches
by: Park, Namu, et al.
Published: (2025)

VERT: Reliable LLM Judges for Radiology Report Evaluation
by: Bologna, Federica, et al.
Published: (2026)

Adapting Biomedical Abstracts into Plain language using Large Language Models
by: Gangavarapu, Haritha, et al.
Published: (2025)

CoRe-BT: A Multimodal Radiology-Pathology-Text Benchmark for Robust Brain Tumor Typing
by: Rivera, Juampablo E. Heras, et al.
Published: (2026)

A Novel Corpus of Annotated Medical Imaging Reports and Information Extraction Results Using BERT-based Language Models
by: Park, Namu, et al.
Published: (2024)

Spurious Correlations and Beyond: Understanding and Mitigating Shortcut Learning in SDOH Extraction with Large Language Models
by: Sakib, Fardin Ahsan, et al.
Published: (2025)

A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment
by: Corbeil, Jean-Philippe, et al.
Published: (2025)

Overview of the MEDIQA-OE 2025 Shared Task on Medical Order Extraction from Doctor-Patient Consultations
by: Corbeil, Jean-Philippe, et al.
Published: (2025)

MedErrBench: A Fine-Grained Multilingual Benchmark for Medical Error Detection and Correction with Clinical Expert Annotations
by: Ma, Congbo, et al.
Published: (2026)

MedRECT: A Medical Reasoning Benchmark for Error Correction in Clinical Texts
by: Iwase, Naoto, et al.
Published: (2025)

Large Language Model Capabilities in Perioperative Risk Prediction and Prognostication
by: Chung, Philip, et al.
Published: (2024)

Traj-CoA: Patient Trajectory Modeling via Chain-of-Agents for Lung Cancer Risk Prediction
by: Zeng, Sihang, et al.
Published: (2025)

Importance of Prompt Optimisation for Error Detection in Medical Notes Using Language Models
by: Myles, Craig, et al.
Published: (2026)

Overconfidence and Calibration in Medical VQA: Empirical Findings and Hallucination-Aware Mitigation
by: Byun, Ji Young, et al.
Published: (2026)

Empowering Healthcare Practitioners with Language Models: Structuring Speech Transcripts in Two Real-World Clinical Applications
by: Corbeil, Jean-Philippe, et al.
Published: (2025)

Term2Note: Synthesising Differentially Private Clinical Notes from Medical Terms
by: Wu, Yuping, et al.
Published: (2025)

AR-BENCH: Benchmarking Legal Reasoning with Judgment Error Detection, Classification and Correction
by: Li, Yifei, et al.
Published: (2026)

WangLab at MEDIQA-CORR 2024: Optimized LLM-based Programs for Medical Error Detection and Correction
by: Toma, Augustin, et al.
Published: (2024)

IryoNLP at MEDIQA-CORR 2024: Tackling the Medical Error Detection & Correction Task On the Shoulders of Medical Agents
by: Corbeil, Jean-Philippe
Published: (2024)

The Task-oriented Queries Benchmark (ToQB)
by: Yim, Keun Soo
Published: (2024)

Note2Chat: Improving LLMs for Multi-Turn Clinical History Taking Using Medical Notes
by: Zhou, Yang, et al.
Published: (2026)

TrajSurv: Learning Continuous Latent Trajectories from Electronic Health Records for Trustworthy Survival Prediction
by: Zeng, Sihang, et al.
Published: (2025)

Chapter GenRecipe for Generating Recipes from Videos through Deep Learning
by: Sin-wai, Chan
Published: (2026)

Chapter TransRecipe for Translating Recipes through Machine Processing
by: Sin-wai, Chan
Published: (2026)

Chapter TransRecipe for Translating Recipes through Machine Processing
by: Sin-wai, Chan
Published: (2026)

Chapter VisualRecipe for Visualizing Recipes through Image Technology
by: Sin-wai, Chan
Published: (2026)

Chapter GenRecipe for Generating Recipes from Videos through Deep Learning
by: Sin-wai, Chan
Published: (2026)