Saved in:
| Main Authors: | Wang, Hanyin, Wu, Zhenbang, Kolar, Gururaj, Korsapati, Hariprasad, Bartlett, Brian, Hull, Bryan, Sun, Jimeng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.21908 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Training LLMs for EHR-Based Reasoning Tasks via Reinforcement Learning
by: Lin, Jiacheng, et al.
Published: (2025)
by: Lin, Jiacheng, et al.
Published: (2025)
Process-Supervised Reward Models for Verifying Clinical Note Generation: A Scalable Approach Guided by Domain Expertise
by: Wang, Hanyin, et al.
Published: (2024)
by: Wang, Hanyin, et al.
Published: (2024)
Towards Adapting Open-Source Large Language Models for Expert-Level Clinical Note Generation
by: Wang, Hanyin, et al.
Published: (2024)
by: Wang, Hanyin, et al.
Published: (2024)
Bridging the Reproducibility Divide: Open Source Software's Role in Standardizing Healthcare AI
by: Wu, John, et al.
Published: (2026)
by: Wu, John, et al.
Published: (2026)
Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning
by: Wu, John, et al.
Published: (2024)
by: Wu, John, et al.
Published: (2024)
An Empirical Study of Reasoning Steps in Thinking Code LLMs
by: Xue, Haoran, et al.
Published: (2025)
by: Xue, Haoran, et al.
Published: (2025)
Social Determinants of Health Prediction for ICD-9 Code with Reasoning Models
by: Khan, Sharim, et al.
Published: (2025)
by: Khan, Sharim, et al.
Published: (2025)
DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction
by: Wu, John, et al.
Published: (2024)
by: Wu, John, et al.
Published: (2024)
MANet: A deep learning for object detection
by: Wu, Zhenbang
Published: (2026)
by: Wu, Zhenbang
Published: (2026)
Sparse Regression Codes for Secret Key Agreement: Achieving Strong Secrecy and Near-Optimal Rates for Gaussian Sources
by: Athanasakos, Emmanouil M., et al.
Published: (2025)
by: Athanasakos, Emmanouil M., et al.
Published: (2025)
Convex Holder bound and its applications
by: M, Hariprasad
Published: (2025)
by: M, Hariprasad
Published: (2025)
Recursive eigen extrusion: Expanding eigenbasis conjecture
by: Hariprasad, M
Published: (2019)
by: Hariprasad, M
Published: (2019)
Prompt Sensitivity and Answer Consistency of Small Open-Source Language Models for Clinical Question Answering in Low-Resource Healthcare
by: Hariprasad, Shravani
Published: (2026)
by: Hariprasad, Shravani
Published: (2026)
On the Quantization Robustness of Diffusion Language Models in Coding Benchmarks
by: Gupta, Aarav, et al.
Published: (2026)
by: Gupta, Aarav, et al.
Published: (2026)
Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation
by: Bao, Qiming, et al.
Published: (2022)
by: Bao, Qiming, et al.
Published: (2022)
Test-Time Adaptation for EEG Foundation Models: A Systematic Study under Real-World Distribution Shifts
by: Lee, Gabriel Jason, et al.
Published: (2026)
by: Lee, Gabriel Jason, et al.
Published: (2026)
Towards Physiologically Sensible Predictions via the Rule-based Reinforcement Learning Layer
by: Zhu, Lingwei, et al.
Published: (2025)
by: Zhu, Lingwei, et al.
Published: (2025)
TTM-RE: Memory-Augmented Document-Level Relation Extraction
by: Gao, Chufan, et al.
Published: (2024)
by: Gao, Chufan, et al.
Published: (2024)
Utilizing Training Data to Improve LLM Reasoning for Tabular Understanding
by: Gao, Chufan, et al.
Published: (2025)
by: Gao, Chufan, et al.
Published: (2025)
On Formally Undecidable Propositions of Nondeterministic Complexity and Related Classes
by: Kolář, Martin
Published: (2026)
by: Kolář, Martin
Published: (2026)
Scouring Parrondo's Paradox in Discrete-Time Quantum Walks
by: Kadiri, Gururaj
Published: (2024)
by: Kadiri, Gururaj
Published: (2024)
The conjugacy problem in Out(Fm) when the polynomial restrictions are non-growing
by: Bartlett, Gabriel
Published: (2025)
by: Bartlett, Gabriel
Published: (2025)
A Perspective for Adapting Generalist AI to Specialized Medical AI Applications and Their Challenges
by: Wang, Zifeng, et al.
Published: (2024)
by: Wang, Zifeng, et al.
Published: (2024)
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning
by: Chen, Yang, et al.
Published: (2025)
by: Chen, Yang, et al.
Published: (2025)
On the Empirical Complexity of Reasoning and Planning in LLMs
by: Kang, Liwei, et al.
Published: (2024)
by: Kang, Liwei, et al.
Published: (2024)
Molecular De Novo Design through Transformer-based Reinforcement Learning
by: Xu, Pengcheng, et al.
Published: (2023)
by: Xu, Pengcheng, et al.
Published: (2023)
LLMs are Bug Replicators: An Empirical Study on LLMs' Capability in Completing Bug-prone Code
by: Guo, Liwei, et al.
Published: (2025)
by: Guo, Liwei, et al.
Published: (2025)
Open the Oyster: Empirical Evaluation and Improvement of Code Reasoning Confidence in LLMs
by: Wang, Shufan, et al.
Published: (2025)
by: Wang, Shufan, et al.
Published: (2025)
An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM Agents
by: Jin, Bowen, et al.
Published: (2025)
by: Jin, Bowen, et al.
Published: (2025)
Group Distributionally Robust Optimization-Driven Reinforcement Learning for LLM Reasoning
by: Panaganti, Kishan, et al.
Published: (2026)
by: Panaganti, Kishan, et al.
Published: (2026)
Accurate, Efficient, and Explainable Deep Learning Approaches for Environmental Science Problems
by: Shi, Jimeng
Published: (2026)
by: Shi, Jimeng
Published: (2026)
CAPO: Reinforcing Consistent Reasoning in Medical Decision-Making
by: Jiang, Songtao, et al.
Published: (2025)
by: Jiang, Songtao, et al.
Published: (2025)
How Execution Features Relate to Failures: An Empirical Study and Diagnosis Approach
by: Smytzek, Marius, et al.
Published: (2025)
by: Smytzek, Marius, et al.
Published: (2025)
Effective Learning for Small Reasoning Models: An Empirical Study on 0.5B Reasoning LLMs
by: Zhuang, Xialie, et al.
Published: (2025)
by: Zhuang, Xialie, et al.
Published: (2025)
Enhancing the Code Reasoning Capabilities of LLMs via Consistency-based Reinforcement Learning
by: Qin, Zhanyue, et al.
Published: (2026)
by: Qin, Zhanyue, et al.
Published: (2026)
Agent-Centric Personalized Multiple Clustering with Multi-Modal LLMs
by: Chen, Ziye, et al.
Published: (2025)
by: Chen, Ziye, et al.
Published: (2025)
CodeReasoner: Enhancing the Code Reasoning Ability with Reinforcement Learning
by: Tang, Lingxiao, et al.
Published: (2025)
by: Tang, Lingxiao, et al.
Published: (2025)
FINE STRUCTURE IN MANHATTAN’S DAYTIME URBAN HEAT ISLAND: A NEW DATASET
by: Brian Vant-Hull
Published: (2014)
by: Brian Vant-Hull
Published: (2014)
Circular Super patterns and Zigzag constructions
by: Manjunath, Hariprasad, et al.
Published: (2026)
by: Manjunath, Hariprasad, et al.
Published: (2026)
How Good Are LLMs at Out-of-Distribution Detection?
by: Liu, Bo, et al.
Published: (2023)
by: Liu, Bo, et al.
Published: (2023)
Similar Items
-
Training LLMs for EHR-Based Reasoning Tasks via Reinforcement Learning
by: Lin, Jiacheng, et al.
Published: (2025) -
Process-Supervised Reward Models for Verifying Clinical Note Generation: A Scalable Approach Guided by Domain Expertise
by: Wang, Hanyin, et al.
Published: (2024) -
Towards Adapting Open-Source Large Language Models for Expert-Level Clinical Note Generation
by: Wang, Hanyin, et al.
Published: (2024) -
Bridging the Reproducibility Divide: Open Source Software's Role in Standardizing Healthcare AI
by: Wu, John, et al.
Published: (2026) -
Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning
by: Wu, John, et al.
Published: (2024)