:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Shen, Yuan, Wu, Xiaojun, Yu, Linghua
Formato:	Preprint
Publicado:	2025
Materias:	Artificial Intelligence I.2.7; J.3
Acceso en línea:	https://arxiv.org/abs/2512.11544
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

ELMTEX: Fine-Tuning Large Language Models for Structured Clinical Information Extraction. A Case Study on Clinical Reports
por: Guluzade, Aynur, et al.
Publicado: (2025)

Evaluating Large Language Models for IUCN Red List Species Information
por: Uryu, Shinya
Publicado: (2025)

End-to-End Evaluation and Governance of an EHR-Embedded AI Agent for Clinicians
por: Shah, Aaryan, et al.
Publicado: (2026)

A Method for the Architecture of a Medical Vertical Large Language Model Based on Deepseek R1
por: Zhang, Mingda, et al.
Publicado: (2025)

Case-Specific Rubrics for Clinical AI Evaluation: Methodology, Validation, and LLM-Clinician Agreement Across 823 Encounters
por: Shah, Aaryan, et al.
Publicado: (2026)

GPTON: Generative Pre-trained Transformers enhanced with Ontology Narration for accurate annotation of biological data
por: Li, Rongbin, et al.
Publicado: (2024)

Ask WhAI:Probing Belief Formation in Role-Primed LLM Agents
por: Moore, Keith, et al.
Publicado: (2025)

Interpretability without actionability: mechanistic methods cannot correct language model errors despite near-perfect internal representations
por: Basu, Sanjay, et al.
Publicado: (2026)

AcuityBench: Evaluating Clinical Acuity Identification and Uncertainty Alignment
por: Linzmayer, Robin, et al.
Publicado: (2026)

Igea: a Decoder-Only Language Model for Biomedical Text Generation in Italian
por: Buonocore, Tommaso Mario, et al.
Publicado: (2024)

CPEMH: An Agentic Framework for Prompt-Driven Behavior Evaluation and Assurance in Foundation-Model Systems for Mental Health Screening
por: Lorenzoni, Giuliano, et al.
Publicado: (2026)

MedPI: Evaluating AI Systems in Medical Patient-facing Interactions
por: V., Diego Fajardo, et al.
Publicado: (2025)

Model selection meets clinical semantics: Optimizing ICD-10-CM prediction via LLM-as-Judge evaluation, redundancy-aware sampling, and section-aware fine-tuning
por: Dai, Hong-Jie, et al.
Publicado: (2025)

BALI: Enhancing Biomedical Language Representations through Knowledge Graph and Language Model Alignment
por: Sakhovskiy, Andrey, et al.
Publicado: (2025)

BLT: Can Large Language Models Handle Basic Legal Text?
por: Blair-Stanek, Andrew, et al.
Publicado: (2023)

BioAlchemy: Distilling Biological Literature into Reasoning-Ready Reinforcement Learning Training Data
por: Hsu, Brian, et al.
Publicado: (2026)

OEMA: Ontology-Enhanced Multi-Agent Collaboration Framework for Zero-Shot Clinical Named Entity Recognition
por: Tao, Xinli, et al.
Publicado: (2025)

Fine-Tuning Open-Weight Language Models to Deliver Cognitive Behavioral Therapy for Depression: A Feasibility Study
por: Tahir, Talha
Publicado: (2024)

The use of GPT-4o and Other Large Language Models for the Improvement and Design of Self-Assessment Scales for Measurement of Interpersonal Communication Skills
por: Bubaš, Goran
Publicado: (2024)

Curated AI beats frontier LLMs at pharma asset discovery
por: Kidziński, Łukasz, et al.
Publicado: (2026)

Classifiers of Data Sharing Statements in Clinical Trial Records
por: Mamaghani, Saber Jelodari, et al.
Publicado: (2025)

CuraView: A Multi-Agent Framework for Medical Hallucination Detection with GraphRAG-Enhanced Knowledge Verification
por: Ye, Severin, et al.
Publicado: (2026)

On Fusing ChatGPT and Ensemble Learning in Discon-tinuous Named Entity Recognition in Health Corpora
por: Chen, Tzu-Chieh, et al.
Publicado: (2024)

Evaluating the Challenges of LLMs in Real-world Medical Follow-up: A Comparative Study and An Optimized Framework
por: Liu, Jinyan, et al.
Publicado: (2025)

Expertise Is What We Want
por: Ashworth, Alan, et al.
Publicado: (2025)

Reshaping Free-Text Radiology Notes Into Structured Reports With Generative Transformers
por: Bergomi, Laura, et al.
Publicado: (2024)

Advancing Italian Biomedical Information Extraction with Transformers-based Models: Methodological Insights and Multicenter Practical Application
por: Crema, Claudio, et al.
Publicado: (2023)

The Foundational Capabilities of Large Language Models in Predicting Postoperative Risks Using Clinical Notes
por: Alba, Charles, et al.
Publicado: (2024)

ARGUS: Seeing the Influence of Narrative Features on Persuasion in Argumentative Texts
por: Nabhani, Sara, et al.
Publicado: (2026)

DrugReasoner: Interpretable Drug Approval Prediction with a Reasoning-augmented Language Model
por: Ghaffarzadeh-Esfahani, Mohammadreza, et al.
Publicado: (2025)

Temporal Relation Extraction in Clinical Texts: A Span-based Graph Transformer Approach
por: Chaturvedi, Rochana, et al.
Publicado: (2025)

DreamNet: A Multimodal Framework for Semantic and Emotional Analysis of Sleep Narratives
por: Panchagnula, Tapasvi
Publicado: (2025)

Performance of Large Language Models in Supporting Medical Diagnosis and Treatment
por: Sousa, Diogo, et al.
Publicado: (2025)

MedStruct-S: A Benchmark for Key Discovery, Key-Conditioned QA and Semi-Structured Extraction from OCR Clinical Reports
por: Li, Yingyun, et al.
Publicado: (2026)

Shallow Robustness, Deep Vulnerabilities: Multi-Turn Evaluation of Medical LLMs
por: Manczak, Blazej, et al.
Publicado: (2025)

DALL-M: Context-Aware Clinical Data Augmentation with LLMs
por: Hsieh, Chihcheng, et al.
Publicado: (2024)

The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games
por: Mozikov, Mikhail, et al.
Publicado: (2024)

From RAG to QA-RAG: Integrating Generative AI for Pharmaceutical Regulatory Compliance Process
por: Kim, Jaewoong, et al.
Publicado: (2024)

Contrastive learning of T cell receptor representations
por: Nagano, Yuta, et al.
Publicado: (2024)

PerkwE_COQA: Enhanced Persian Conversational Question Answering by combining contextual keyword extraction with Large Language Models
por: Moradbeiki, Pardis, et al.
Publicado: (2024)