:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Chongbang, Tangsang, Shrestha, Pranesh Pyara, Sarki, Amrit, Jaiswal, Anku
Natura:	Preprint
Pubblicazione:	2026
Soggetti:	Computation and Language Artificial Intelligence Machine Learning I.2.7; I.2.1
Accesso online:	https://arxiv.org/abs/2602.21647
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

HalluScan: A Systematic Benchmark for Detecting and Mitigating Hallucinations in Instruction-Following LLMs
di: Cherif, Ahmed
Pubblicazione: (2026)

Low-Resource Neural Machine Translation Using Recurrent Neural Networks and Transfer Learning: A Case Study on English-to-Igbo
di: Ekle, Ocheme Anthony, et al.
Pubblicazione: (2025)

Mitigating Trojanized Prompt Chains in Educational LLM Use Cases: Experimental Findings and Detection Tool Design
di: Charles, Richard M., et al.
Pubblicazione: (2025)

LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts
di: Hashemi, Helia, et al.
Pubblicazione: (2024)

Open-TI: Open Traffic Intelligence with Augmented Language Model
di: Da, Longchao, et al.
Pubblicazione: (2023)

Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games
di: Wu, Dekun, et al.
Pubblicazione: (2023)

Evaluating Large Language Models on Historical Health Crisis Knowledge in Resource-Limited Settings: A Hybrid Multi-Metric Study
di: Hasan, Mohammed Rakibul
Pubblicazione: (2026)

Towards Conditioning Clinical Text Generation for User Control
di: Koraş, Osman Alperen, et al.
Pubblicazione: (2025)

FlexQuant: A Flexible and Efficient Dynamic Precision Switching Framework for LLM Quantization
di: Liu, Fangxin, et al.
Pubblicazione: (2025)

Meta-Evaluation of Translation Evaluation Methods: a systematic up-to-date overview
di: Han, Lifeng, et al.
Pubblicazione: (2016)

Multi-Hierarchical Feature Detection for Large Language Model Generated Text
di: Zhang, Luyan, et al.
Pubblicazione: (2025)

An Industrial-Scale Insurance LLM Achieving Verifiable Domain Mastery and Hallucination Control without Competence Trade-offs
di: Zhu, Qian, et al.
Pubblicazione: (2026)

Introducing Brain-like Concepts to Embodied Hand-crafted Dialog Management System
di: Joublin, Frank, et al.
Pubblicazione: (2024)

Exploring the Structure of AI-Induced Language Change in Scientific English
di: Galpin, Riley, et al.
Pubblicazione: (2025)

GraphWalk: Enabling Reasoning in Large Language Models through Tool-Based Graph Navigation
di: Ghandi, Taraneh, et al.
Pubblicazione: (2026)

elsciRL: Integrating Language Solutions into Reinforcement Learning Problem Settings
di: Osborne, Philip, et al.
Pubblicazione: (2025)

LongSumEval: Question-Answering Based Evaluation and Feedback-Driven Refinement for Long Document Summarization
di: Nguyen, Huyen, et al.
Pubblicazione: (2026)

FATHOMS-RAG: A Framework for the Assessment of Thinking and Observation in Multimodal Systems that use Retrieval Augmented Generation
di: Hildebrand, Samuel, et al.
Pubblicazione: (2025)

EvoIdeator: Evolving Scientific Ideas through Checklist-Grounded Reinforcement Learning
di: Sauter, Andreas, et al.
Pubblicazione: (2026)

Leveraging Large Language Models to Extract and Translate Medical Information in Doctors' Notes for Health Records and Diagnostic Billing Codes
di: Hartnett, Peter, et al.
Pubblicazione: (2026)

The Arrival of AGI? When Expert Personas Exceed Expert Benchmarks
di: Mullens, Drake, et al.
Pubblicazione: (2026)

Attention-based sequential recommendation system using multimodal data
di: Oh, Hyungtaik, et al.
Pubblicazione: (2024)

From Search to Reasoning: A Five-Level RAG Capability Framework for Enterprise Data
di: Gill, Gurbinder, et al.
Pubblicazione: (2025)

VeriPlan: Integrating Formal Verification and LLMs into End-User Planning
di: Lee, Christine, et al.
Pubblicazione: (2025)

Large Language Models for Combinatorial Optimization of Design Structure Matrix
di: Jiang, Shuo, et al.
Pubblicazione: (2025)

Automating the Deep Space Network Data Systems; A Case Study in Adaptive Anomaly Detection through Agentic AI
di: Chou, Evan J., et al.
Pubblicazione: (2025)

LegalCheck: Retrieval- and Context-Augmented Generation for Drafting Municipal Legal Advice Letters
di: van der Meer, Virgill, et al.
Pubblicazione: (2026)

AskSport: Web Application for Sports Question-Answering
di: Onofre, Enzo B, et al.
Pubblicazione: (2025)

Social and Ethical Risks Posed by General-Purpose LLMs for Settling Newcomers in Canada
di: Nejadgholi, Isar, et al.
Pubblicazione: (2024)

PestMA: LLM-based Multi-Agent System for Informed Pest Management
di: Shi, Hongrui, et al.
Pubblicazione: (2025)

The Impact of Generative Artificial Intelligence on Ideation and the performance of Innovation Teams (Preprint)
di: Gindert, Michael, et al.
Pubblicazione: (2024)

CSSDM Ontology to Enable Continuity of Care Data Interoperability
di: Das, Subhashis, et al.
Pubblicazione: (2025)

How to Evaluate Medical AI
di: Kopanichuk, Ilia, et al.
Pubblicazione: (2025)

STLLM-DF: A Spatial-Temporal Large Language Model with Diffusion for Enhanced Multi-Mode Traffic System Forecasting
di: Shao, Zhiqi, et al.
Pubblicazione: (2024)

Improving the Capabilities of Large Language Model Based Marketing Analytics Copilots With Semantic Search And Fine-Tuning
di: Gao, Yilin, et al.
Pubblicazione: (2024)

Comparing the Performance of LLMs in RAG-based Question-Answering: A Case Study in Computer Science Literature
di: Dayarathne, Ranul, et al.
Pubblicazione: (2025)

Towards Safer Chatbots: Automated Policy Compliance Evaluation of Custom GPTs
di: Rodriguez, David, et al.
Pubblicazione: (2025)

ReTreVal: Reasoning Tree with Validation -- A Hybrid Framework for Enhanced LLM Multi-Step Reasoning
di: HS, Abhishek, et al.
Pubblicazione: (2026)

FREYR: A Framework for Recognizing and Executing Your Requests
di: Gallotta, Roberto, et al.
Pubblicazione: (2025)

CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities
di: Zhu, Yuxuan, et al.
Pubblicazione: (2025)