Saved in:
| Main Authors: | Han, Changho, Kim, Songsoo, Kim, Dong Won, Celi, Leo Anthony, Kim, Jaewoong, Bae, SungA, Yoon, Dukyong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.20331 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Multi-Pass Large Language Model Framework for Precise and Efficient Radiology Report Error Detection
by: Kim, Songsoo, et al.
Published: (2025)
by: Kim, Songsoo, et al.
Published: (2025)
Early screening of potential breakthrough technologies with enhanced interpretability: A patent-specific hierarchical attention network model
by: Choi, Jaewoong, et al.
Published: (2024)
by: Choi, Jaewoong, et al.
Published: (2024)
Pruning and Distilling Mixture-of-Experts into Dense Language Models
by: Kim, Junhyuck, et al.
Published: (2026)
by: Kim, Junhyuck, et al.
Published: (2026)
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors
by: Lee, Keon, et al.
Published: (2024)
by: Lee, Keon, et al.
Published: (2024)
Learning to Correct for QA Reasoning with Black-box LLMs
by: Kim, Jaehyung, et al.
Published: (2024)
by: Kim, Jaehyung, et al.
Published: (2024)
CXR-LLAVA: a multimodal large language model for interpreting chest X-ray images
by: Lee, Seowoo, et al.
Published: (2023)
by: Lee, Seowoo, et al.
Published: (2023)
Design of reliable technology valuation model with calibrated machine learning of patent indicators
by: Lee, Seunghyun, et al.
Published: (2024)
by: Lee, Seunghyun, et al.
Published: (2024)
CCQA: Generating Question from Solution Can Improve Inference-Time Reasoning in SLMs
by: Kim, Jin Young, et al.
Published: (2025)
by: Kim, Jin Young, et al.
Published: (2025)
From RAG to QA-RAG: Integrating Generative AI for Pharmaceutical Regulatory Compliance Process
by: Kim, Jaewoong, et al.
Published: (2024)
by: Kim, Jaewoong, et al.
Published: (2024)
Raon-Speech Technical Report
by: Kim, Beomsoo, et al.
Published: (2026)
by: Kim, Beomsoo, et al.
Published: (2026)
Generalization in medical AI: a perspective on developing scalable models
by: Zvuloni, Eran, et al.
Published: (2023)
by: Zvuloni, Eran, et al.
Published: (2023)
SimMOF: AI agent for Automated MOF Simulations
by: Lee, Jaewoong, et al.
Published: (2026)
by: Lee, Jaewoong, et al.
Published: (2026)
A Dataset and Resources for Identifying Patient Health Literacy Information from Clinical Notes
by: Bittner, Madeline, et al.
Published: (2026)
by: Bittner, Madeline, et al.
Published: (2026)
Polishing Every Facet of the GEM: Testing Linguistic Competence of LLMs and Humans in Korean
by: Kim, SungHo, et al.
Published: (2025)
by: Kim, SungHo, et al.
Published: (2025)
Academic Vibe Coding: Opportunities for Accelerating Research in an Era of Resource Constraint
by: Crowson, Matthew G, et al.
Published: (2025)
by: Crowson, Matthew G, et al.
Published: (2025)
AnomaLLMy -- Detecting anomalous tokens in black-box LLMs through low-confidence single-token predictions
by: Witold, Waligóra
Published: (2024)
by: Witold, Waligóra
Published: (2024)
Collective Critics for Creative Story Generation
by: Bae, Minwook, et al.
Published: (2024)
by: Bae, Minwook, et al.
Published: (2024)
DEBATE: Devil's Advocate-Based Assessment and Text Evaluation
by: Kim, Alex, et al.
Published: (2024)
by: Kim, Alex, et al.
Published: (2024)
Latent Preference Modeling for Cross-Session Personalized Tool Calling
by: Yoon, Yejin, et al.
Published: (2026)
by: Yoon, Yejin, et al.
Published: (2026)
CHAMP: A Competition-level Dataset for Fine-Grained Analyses of LLMs' Mathematical Reasoning Capabilities
by: Mao, Yujun, et al.
Published: (2024)
by: Mao, Yujun, et al.
Published: (2024)
Natural Language Declarative Prompting (NLD-P): A Modular Governance Method for Prompt Design Under Model Drift
by: Kim, Hyunwoo, et al.
Published: (2026)
by: Kim, Hyunwoo, et al.
Published: (2026)
Improving Dialogue Agents by Decomposing One Global Explicit Annotation with Local Implicit Multimodal Feedback
by: Lee, Dong Won, et al.
Published: (2024)
by: Lee, Dong Won, et al.
Published: (2024)
Bridging the Gap between Expert and Language Models: Concept-guided Chess Commentary Generation and Evaluation
by: Kim, Jaechang, et al.
Published: (2024)
by: Kim, Jaechang, et al.
Published: (2024)
Language-Agnostic Suicidal Risk Detection Using Large Language Models
by: Kim, June-Woo, et al.
Published: (2025)
by: Kim, June-Woo, et al.
Published: (2025)
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance
by: Park, Dongmin, et al.
Published: (2024)
by: Park, Dongmin, et al.
Published: (2024)
Local Explanations and Self-Explanations for Assessing Faithfulness in black-box LLMs
by: Fragkathoulas, Christos, et al.
Published: (2024)
by: Fragkathoulas, Christos, et al.
Published: (2024)
LLMs can be easily Confused by Instructional Distractions
by: Hwang, Yerin, et al.
Published: (2025)
by: Hwang, Yerin, et al.
Published: (2025)
Fine-Grained and Thematic Evaluation of LLMs in Social Deduction Game
by: Kim, Byungjun, et al.
Published: (2024)
by: Kim, Byungjun, et al.
Published: (2024)
Retrieval-augmented systems can be dangerous medical communicators
by: Wong, Lionel, et al.
Published: (2025)
by: Wong, Lionel, et al.
Published: (2025)
Latent Paraphrasing: Perturbation on Layers Improves Knowledge Injection in Language Models
by: Kang, Minki, et al.
Published: (2024)
by: Kang, Minki, et al.
Published: (2024)
BenchPreS: A Benchmark for Context-Aware Personalized Preference Selectivity of Persistent-Memory LLMs
by: Yoon, Sangyeon, et al.
Published: (2026)
by: Yoon, Sangyeon, et al.
Published: (2026)
FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games
by: Ahn, Jaewoo, et al.
Published: (2025)
by: Ahn, Jaewoo, et al.
Published: (2025)
Open Ko-LLM Leaderboard2: Bridging Foundational and Practical Evaluation for Korean LLMs
by: Kim, Hyeonwoo, et al.
Published: (2024)
by: Kim, Hyeonwoo, et al.
Published: (2024)
IPCGRL: Language-Instructed Reinforcement Learning for Procedural Level Generation
by: Baek, In-Chang, et al.
Published: (2025)
by: Baek, In-Chang, et al.
Published: (2025)
Learn from Weaknesses: Automated Domain Specialization for Small Computer-Use Agents
by: Kim, Suji, et al.
Published: (2026)
by: Kim, Suji, et al.
Published: (2026)
RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation
by: Kim, Kiseung, et al.
Published: (2024)
by: Kim, Kiseung, et al.
Published: (2024)
Performance Gains of LLMs With Humans in a World of LLMs Versus Humans
by: McCullum, Lucas, et al.
Published: (2025)
by: McCullum, Lucas, et al.
Published: (2025)
Evaluating LLMs for Police Decision-Making: A Framework Based on Police Action Scenarios
by: Lee, Sangyub, et al.
Published: (2026)
by: Lee, Sangyub, et al.
Published: (2026)
From National Curricula to Cultural Awareness: Constructing Open-Ended Culture-Specific Question Answering Dataset
by: Yoo, Haneul, et al.
Published: (2026)
by: Yoo, Haneul, et al.
Published: (2026)
Reasoning Models Better Express Their Confidence
by: Yoon, Dongkeun, et al.
Published: (2025)
by: Yoon, Dongkeun, et al.
Published: (2025)
Similar Items
-
A Multi-Pass Large Language Model Framework for Precise and Efficient Radiology Report Error Detection
by: Kim, Songsoo, et al.
Published: (2025) -
Early screening of potential breakthrough technologies with enhanced interpretability: A patent-specific hierarchical attention network model
by: Choi, Jaewoong, et al.
Published: (2024) -
Pruning and Distilling Mixture-of-Experts into Dense Language Models
by: Kim, Junhyuck, et al.
Published: (2026) -
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors
by: Lee, Keon, et al.
Published: (2024) -
Learning to Correct for QA Reasoning with Black-box LLMs
by: Kim, Jaehyung, et al.
Published: (2024)