Saved in:
| Main Authors: | Sherif, Omar, Hamdi, Ali |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.09833 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Conditioning Clinical Text Generation for User Control
by: Koraş, Osman Alperen, et al.
Published: (2025)
by: Koraş, Osman Alperen, et al.
Published: (2025)
Multi-Hierarchical Feature Detection for Large Language Model Generated Text
by: Zhang, Luyan, et al.
Published: (2025)
by: Zhang, Luyan, et al.
Published: (2025)
Meta-Evaluation of Translation Evaluation Methods: a systematic up-to-date overview
by: Han, Lifeng, et al.
Published: (2016)
by: Han, Lifeng, et al.
Published: (2016)
HalluScan: A Systematic Benchmark for Detecting and Mitigating Hallucinations in Instruction-Following LLMs
by: Cherif, Ahmed
Published: (2026)
by: Cherif, Ahmed
Published: (2026)
An Industrial-Scale Insurance LLM Achieving Verifiable Domain Mastery and Hallucination Control without Competence Trade-offs
by: Zhu, Qian, et al.
Published: (2026)
by: Zhu, Qian, et al.
Published: (2026)
ReTreVal: Reasoning Tree with Validation -- A Hybrid Framework for Enhanced LLM Multi-Step Reasoning
by: HS, Abhishek, et al.
Published: (2026)
by: HS, Abhishek, et al.
Published: (2026)
EvoIdeator: Evolving Scientific Ideas through Checklist-Grounded Reinforcement Learning
by: Sauter, Andreas, et al.
Published: (2026)
by: Sauter, Andreas, et al.
Published: (2026)
A Super-Learner with Large Language Models for Medical Emergency Advising
by: Aityan, Sergey K., et al.
Published: (2025)
by: Aityan, Sergey K., et al.
Published: (2025)
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts
by: Hashemi, Helia, et al.
Published: (2024)
by: Hashemi, Helia, et al.
Published: (2024)
Reinforcement of Explainability of ChatGPT Prompts by Embedding Breast Cancer Self-Screening Rules into AI Responses
by: Khan, Yousef, et al.
Published: (2024)
by: Khan, Yousef, et al.
Published: (2024)
LongSumEval: Question-Answering Based Evaluation and Feedback-Driven Refinement for Long Document Summarization
by: Nguyen, Huyen, et al.
Published: (2026)
by: Nguyen, Huyen, et al.
Published: (2026)
CAG: Chunked Augmented Generation for Google Chrome's Built-in Gemini Nano
by: Surulimuthu, Vivek Vellaiyappan, et al.
Published: (2024)
by: Surulimuthu, Vivek Vellaiyappan, et al.
Published: (2024)
OEMA: Ontology-Enhanced Multi-Agent Collaboration Framework for Zero-Shot Clinical Named Entity Recognition
by: Tao, Xinli, et al.
Published: (2025)
by: Tao, Xinli, et al.
Published: (2025)
Task-Adaptive Semantic Communications with Controllable Diffusion-based Data Regeneration
by: Guo, Fupei, et al.
Published: (2025)
by: Guo, Fupei, et al.
Published: (2025)
AskSport: Web Application for Sports Question-Answering
by: Onofre, Enzo B, et al.
Published: (2025)
by: Onofre, Enzo B, et al.
Published: (2025)
How to Evaluate Medical AI
by: Kopanichuk, Ilia, et al.
Published: (2025)
by: Kopanichuk, Ilia, et al.
Published: (2025)
Comparing the Performance of LLMs in RAG-based Question-Answering: A Case Study in Computer Science Literature
by: Dayarathne, Ranul, et al.
Published: (2025)
by: Dayarathne, Ranul, et al.
Published: (2025)
Towards Safer Chatbots: Automated Policy Compliance Evaluation of Custom GPTs
by: Rodriguez, David, et al.
Published: (2025)
by: Rodriguez, David, et al.
Published: (2025)
Comparative Analysis of AI Agent Architectures for Entity Relationship Classification
by: Berijanian, Maryam, et al.
Published: (2025)
by: Berijanian, Maryam, et al.
Published: (2025)
Improving the Capabilities of Large Language Model Based Marketing Analytics Copilots With Semantic Search And Fine-Tuning
by: Gao, Yilin, et al.
Published: (2024)
by: Gao, Yilin, et al.
Published: (2024)
Challenges and Opportunities of NLP for HR Applications: A Discussion Paper
by: Leidner, Jochen L., et al.
Published: (2024)
by: Leidner, Jochen L., et al.
Published: (2024)
Evaluating Large Language Models on Historical Health Crisis Knowledge in Resource-Limited Settings: A Hybrid Multi-Metric Study
by: Hasan, Mohammed Rakibul
Published: (2026)
by: Hasan, Mohammed Rakibul
Published: (2026)
Practical Design and Benchmarking of Generative AI Applications for Surgical Billing and Coding
by: Rollman, John C., et al.
Published: (2025)
by: Rollman, John C., et al.
Published: (2025)
Social and Ethical Risks Posed by General-Purpose LLMs for Settling Newcomers in Canada
by: Nejadgholi, Isar, et al.
Published: (2024)
by: Nejadgholi, Isar, et al.
Published: (2024)
Lossless Prompt Compression via Dictionary-Encoding and In-Context Learning: Enabling Cost-Effective LLM Analysis of Repetitive Data
by: de Campos, Andresa Rodrigues, et al.
Published: (2026)
by: de Campos, Andresa Rodrigues, et al.
Published: (2026)
Leveraging Large Language Models to Extract and Translate Medical Information in Doctors' Notes for Health Records and Diagnostic Billing Codes
by: Hartnett, Peter, et al.
Published: (2026)
by: Hartnett, Peter, et al.
Published: (2026)
YourBench: Easy Custom Evaluation Sets for Everyone
by: Shashidhar, Sumuk, et al.
Published: (2025)
by: Shashidhar, Sumuk, et al.
Published: (2025)
Large Language Models for Judicial Entity Extraction: A Comparative Study
by: Hussain, Atin Sakkeer, et al.
Published: (2024)
by: Hussain, Atin Sakkeer, et al.
Published: (2024)
Large Language User Interfaces: Voice Interactive User Interfaces powered by LLMs
by: Wasti, Syed Mekael, et al.
Published: (2024)
by: Wasti, Syed Mekael, et al.
Published: (2024)
AI-Powered Detection of Inappropriate Language in Medical School Curricula
by: Salavati, Chiman, et al.
Published: (2025)
by: Salavati, Chiman, et al.
Published: (2025)
SRLAgent: Enhancing Self-Regulated Learning Skills through Gamification and LLM Assistance
by: Ge, Wentao, et al.
Published: (2025)
by: Ge, Wentao, et al.
Published: (2025)
Automatic Item Generation for Personality Situational Judgment Tests with Large Language Models
by: Li, Chang-Jin, et al.
Published: (2024)
by: Li, Chang-Jin, et al.
Published: (2024)
From Instructor to Collaborator: What a 90-Participant Study Reveals about Human-Agent Collaboration in a Mobile Serious Game
by: Korre, Danai
Published: (2026)
by: Korre, Danai
Published: (2026)
AutoTRIZ: Automating Engineering Innovation with TRIZ and Large Language Models
by: Jiang, Shuo, et al.
Published: (2024)
by: Jiang, Shuo, et al.
Published: (2024)
The Need for Guardrails with Large Language Models in Medical Safety-Critical Settings: An Artificial Intelligence Application in the Pharmacovigilance Ecosystem
by: Hakim, Joe B, et al.
Published: (2024)
by: Hakim, Joe B, et al.
Published: (2024)
ReLeVAnT: Relevance Lexical Vectors for Accurate Legal Text Classification
by: Gakhar, Ishaan, et al.
Published: (2026)
by: Gakhar, Ishaan, et al.
Published: (2026)
Exploring the Structure of AI-Induced Language Change in Scientific English
by: Galpin, Riley, et al.
Published: (2025)
by: Galpin, Riley, et al.
Published: (2025)
A Llama walks into the 'Bar': Efficient Supervised Fine-Tuning for Legal Reasoning in the Multi-state Bar Exam
by: Fernandes, Rean, et al.
Published: (2025)
by: Fernandes, Rean, et al.
Published: (2025)
Teaching a Language Model to Speak the Language of Tools
by: Emanuilov, Simeon
Published: (2025)
by: Emanuilov, Simeon
Published: (2025)
LLM-ReSum: A Framework for LLM Reflective Summarization through Self-Evaluation
by: Nguyen, Huyen, et al.
Published: (2026)
by: Nguyen, Huyen, et al.
Published: (2026)
Similar Items
-
Towards Conditioning Clinical Text Generation for User Control
by: Koraş, Osman Alperen, et al.
Published: (2025) -
Multi-Hierarchical Feature Detection for Large Language Model Generated Text
by: Zhang, Luyan, et al.
Published: (2025) -
Meta-Evaluation of Translation Evaluation Methods: a systematic up-to-date overview
by: Han, Lifeng, et al.
Published: (2016) -
HalluScan: A Systematic Benchmark for Detecting and Mitigating Hallucinations in Instruction-Following LLMs
by: Cherif, Ahmed
Published: (2026) -
An Industrial-Scale Insurance LLM Achieving Verifiable Domain Mastery and Hallucination Control without Competence Trade-offs
by: Zhu, Qian, et al.
Published: (2026)