Guardado en:
| Autores principales: | Shen, Yuan, Wu, Xiaojun, Yu, Linghua |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2512.11544 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
ELMTEX: Fine-Tuning Large Language Models for Structured Clinical Information Extraction. A Case Study on Clinical Reports
por: Guluzade, Aynur, et al.
Publicado: (2025)
por: Guluzade, Aynur, et al.
Publicado: (2025)
Evaluating Large Language Models for IUCN Red List Species Information
por: Uryu, Shinya
Publicado: (2025)
por: Uryu, Shinya
Publicado: (2025)
End-to-End Evaluation and Governance of an EHR-Embedded AI Agent for Clinicians
por: Shah, Aaryan, et al.
Publicado: (2026)
por: Shah, Aaryan, et al.
Publicado: (2026)
A Method for the Architecture of a Medical Vertical Large Language Model Based on Deepseek R1
por: Zhang, Mingda, et al.
Publicado: (2025)
por: Zhang, Mingda, et al.
Publicado: (2025)
Case-Specific Rubrics for Clinical AI Evaluation: Methodology, Validation, and LLM-Clinician Agreement Across 823 Encounters
por: Shah, Aaryan, et al.
Publicado: (2026)
por: Shah, Aaryan, et al.
Publicado: (2026)
GPTON: Generative Pre-trained Transformers enhanced with Ontology Narration for accurate annotation of biological data
por: Li, Rongbin, et al.
Publicado: (2024)
por: Li, Rongbin, et al.
Publicado: (2024)
Ask WhAI:Probing Belief Formation in Role-Primed LLM Agents
por: Moore, Keith, et al.
Publicado: (2025)
por: Moore, Keith, et al.
Publicado: (2025)
Interpretability without actionability: mechanistic methods cannot correct language model errors despite near-perfect internal representations
por: Basu, Sanjay, et al.
Publicado: (2026)
por: Basu, Sanjay, et al.
Publicado: (2026)
AcuityBench: Evaluating Clinical Acuity Identification and Uncertainty Alignment
por: Linzmayer, Robin, et al.
Publicado: (2026)
por: Linzmayer, Robin, et al.
Publicado: (2026)
Igea: a Decoder-Only Language Model for Biomedical Text Generation in Italian
por: Buonocore, Tommaso Mario, et al.
Publicado: (2024)
por: Buonocore, Tommaso Mario, et al.
Publicado: (2024)
CPEMH: An Agentic Framework for Prompt-Driven Behavior Evaluation and Assurance in Foundation-Model Systems for Mental Health Screening
por: Lorenzoni, Giuliano, et al.
Publicado: (2026)
por: Lorenzoni, Giuliano, et al.
Publicado: (2026)
MedPI: Evaluating AI Systems in Medical Patient-facing Interactions
por: V., Diego Fajardo, et al.
Publicado: (2025)
por: V., Diego Fajardo, et al.
Publicado: (2025)
Model selection meets clinical semantics: Optimizing ICD-10-CM prediction via LLM-as-Judge evaluation, redundancy-aware sampling, and section-aware fine-tuning
por: Dai, Hong-Jie, et al.
Publicado: (2025)
por: Dai, Hong-Jie, et al.
Publicado: (2025)
BALI: Enhancing Biomedical Language Representations through Knowledge Graph and Language Model Alignment
por: Sakhovskiy, Andrey, et al.
Publicado: (2025)
por: Sakhovskiy, Andrey, et al.
Publicado: (2025)
BLT: Can Large Language Models Handle Basic Legal Text?
por: Blair-Stanek, Andrew, et al.
Publicado: (2023)
por: Blair-Stanek, Andrew, et al.
Publicado: (2023)
BioAlchemy: Distilling Biological Literature into Reasoning-Ready Reinforcement Learning Training Data
por: Hsu, Brian, et al.
Publicado: (2026)
por: Hsu, Brian, et al.
Publicado: (2026)
OEMA: Ontology-Enhanced Multi-Agent Collaboration Framework for Zero-Shot Clinical Named Entity Recognition
por: Tao, Xinli, et al.
Publicado: (2025)
por: Tao, Xinli, et al.
Publicado: (2025)
Fine-Tuning Open-Weight Language Models to Deliver Cognitive Behavioral Therapy for Depression: A Feasibility Study
por: Tahir, Talha
Publicado: (2024)
por: Tahir, Talha
Publicado: (2024)
The use of GPT-4o and Other Large Language Models for the Improvement and Design of Self-Assessment Scales for Measurement of Interpersonal Communication Skills
por: Bubaš, Goran
Publicado: (2024)
por: Bubaš, Goran
Publicado: (2024)
Curated AI beats frontier LLMs at pharma asset discovery
por: Kidziński, Łukasz, et al.
Publicado: (2026)
por: Kidziński, Łukasz, et al.
Publicado: (2026)
Classifiers of Data Sharing Statements in Clinical Trial Records
por: Mamaghani, Saber Jelodari, et al.
Publicado: (2025)
por: Mamaghani, Saber Jelodari, et al.
Publicado: (2025)
CuraView: A Multi-Agent Framework for Medical Hallucination Detection with GraphRAG-Enhanced Knowledge Verification
por: Ye, Severin, et al.
Publicado: (2026)
por: Ye, Severin, et al.
Publicado: (2026)
On Fusing ChatGPT and Ensemble Learning in Discon-tinuous Named Entity Recognition in Health Corpora
por: Chen, Tzu-Chieh, et al.
Publicado: (2024)
por: Chen, Tzu-Chieh, et al.
Publicado: (2024)
Evaluating the Challenges of LLMs in Real-world Medical Follow-up: A Comparative Study and An Optimized Framework
por: Liu, Jinyan, et al.
Publicado: (2025)
por: Liu, Jinyan, et al.
Publicado: (2025)
Expertise Is What We Want
por: Ashworth, Alan, et al.
Publicado: (2025)
por: Ashworth, Alan, et al.
Publicado: (2025)
Reshaping Free-Text Radiology Notes Into Structured Reports With Generative Transformers
por: Bergomi, Laura, et al.
Publicado: (2024)
por: Bergomi, Laura, et al.
Publicado: (2024)
Advancing Italian Biomedical Information Extraction with Transformers-based Models: Methodological Insights and Multicenter Practical Application
por: Crema, Claudio, et al.
Publicado: (2023)
por: Crema, Claudio, et al.
Publicado: (2023)
The Foundational Capabilities of Large Language Models in Predicting Postoperative Risks Using Clinical Notes
por: Alba, Charles, et al.
Publicado: (2024)
por: Alba, Charles, et al.
Publicado: (2024)
ARGUS: Seeing the Influence of Narrative Features on Persuasion in Argumentative Texts
por: Nabhani, Sara, et al.
Publicado: (2026)
por: Nabhani, Sara, et al.
Publicado: (2026)
DrugReasoner: Interpretable Drug Approval Prediction with a Reasoning-augmented Language Model
por: Ghaffarzadeh-Esfahani, Mohammadreza, et al.
Publicado: (2025)
por: Ghaffarzadeh-Esfahani, Mohammadreza, et al.
Publicado: (2025)
Temporal Relation Extraction in Clinical Texts: A Span-based Graph Transformer Approach
por: Chaturvedi, Rochana, et al.
Publicado: (2025)
por: Chaturvedi, Rochana, et al.
Publicado: (2025)
DreamNet: A Multimodal Framework for Semantic and Emotional Analysis of Sleep Narratives
por: Panchagnula, Tapasvi
Publicado: (2025)
por: Panchagnula, Tapasvi
Publicado: (2025)
Performance of Large Language Models in Supporting Medical Diagnosis and Treatment
por: Sousa, Diogo, et al.
Publicado: (2025)
por: Sousa, Diogo, et al.
Publicado: (2025)
MedStruct-S: A Benchmark for Key Discovery, Key-Conditioned QA and Semi-Structured Extraction from OCR Clinical Reports
por: Li, Yingyun, et al.
Publicado: (2026)
por: Li, Yingyun, et al.
Publicado: (2026)
Shallow Robustness, Deep Vulnerabilities: Multi-Turn Evaluation of Medical LLMs
por: Manczak, Blazej, et al.
Publicado: (2025)
por: Manczak, Blazej, et al.
Publicado: (2025)
DALL-M: Context-Aware Clinical Data Augmentation with LLMs
por: Hsieh, Chihcheng, et al.
Publicado: (2024)
por: Hsieh, Chihcheng, et al.
Publicado: (2024)
The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games
por: Mozikov, Mikhail, et al.
Publicado: (2024)
por: Mozikov, Mikhail, et al.
Publicado: (2024)
From RAG to QA-RAG: Integrating Generative AI for Pharmaceutical Regulatory Compliance Process
por: Kim, Jaewoong, et al.
Publicado: (2024)
por: Kim, Jaewoong, et al.
Publicado: (2024)
Contrastive learning of T cell receptor representations
por: Nagano, Yuta, et al.
Publicado: (2024)
por: Nagano, Yuta, et al.
Publicado: (2024)
PerkwE_COQA: Enhanced Persian Conversational Question Answering by combining contextual keyword extraction with Large Language Models
por: Moradbeiki, Pardis, et al.
Publicado: (2024)
por: Moradbeiki, Pardis, et al.
Publicado: (2024)
Ejemplares similares
-
ELMTEX: Fine-Tuning Large Language Models for Structured Clinical Information Extraction. A Case Study on Clinical Reports
por: Guluzade, Aynur, et al.
Publicado: (2025) -
Evaluating Large Language Models for IUCN Red List Species Information
por: Uryu, Shinya
Publicado: (2025) -
End-to-End Evaluation and Governance of an EHR-Embedded AI Agent for Clinicians
por: Shah, Aaryan, et al.
Publicado: (2026) -
A Method for the Architecture of a Medical Vertical Large Language Model Based on Deepseek R1
por: Zhang, Mingda, et al.
Publicado: (2025) -
Case-Specific Rubrics for Clinical AI Evaluation: Methodology, Validation, and LLM-Clinician Agreement Across 823 Encounters
por: Shah, Aaryan, et al.
Publicado: (2026)