:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Takahashi, Kosuke, Omi, Takahiro, Arima, Kosuke, Ishigaki, Tatsuya
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Artificial Intelligence 68T50 I.2
Online Access:	https://arxiv.org/abs/2404.08262
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Enhancing Large Vision-Language Models with Layout Modality for Table Question Answering on Japanese Annual Securities Reports
by: Aida, Hayato, et al.
Published: (2025)

Exploring Design of Multi-Agent LLM Dialogues for Research Ideation
by: Ueda, Keisuke, et al.
Published: (2025)

Mitigating LLM Hallucinations through Domain-Grounded Tiered Retrieval
by: Haque, Md. Asraful, et al.
Published: (2026)

Prompting Encoder Models for Zero-Shot Classification: A Cross-Domain Study in Italian
by: Auriemma, Serena, et al.
Published: (2024)

From Prompt to Graph: Comparing LLM-Based Information Extraction Strategies in Domain-Specific Ontology Development
by: Liu, Xuan, et al.
Published: (2026)

LawInstruct: A Resource for Studying Language Model Adaptation to the Legal Domain
by: Niklaus, Joel, et al.
Published: (2024)

LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain
by: Niklaus, Joel, et al.
Published: (2023)

Can Out-of-Distribution Evaluations Uncover Reliance on Shortcuts? A Case Study in Question Answering
by: Štefánik, Michal, et al.
Published: (2025)

The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them)
by: Wang, Zihao, et al.
Published: (2025)

LLM-Assisted Crisis Management: Building Advanced LLM Platforms for Effective Emergency Response and Public Collaboration
by: Otal, Hakan T., et al.
Published: (2024)

MetaCheckGPT -- A Multi-task Hallucination Detector Using LLM Uncertainty and Meta-models
by: Mehta, Rahul, et al.
Published: (2024)

Unsolvability Ceiling in Multi-LLM Routing: An Empirical Study of Evaluation Artifacts
by: Garg, Saloni, et al.
Published: (2026)

Isolating LLM Lexical Bias: A Curation-Free Triangulated Metric for Preference-Stage Learning
by: Ming, Xiaoyang, et al.
Published: (2026)

HR-MultiWOZ: A Task Oriented Dialogue (TOD) Dataset for HR LLM Agent
by: Xu, Weijie, et al.
Published: (2024)

Understanding the Effects of RLHF on the Quality and Detectability of LLM-Generated Texts
by: Xu, Beining, et al.
Published: (2025)

TSDS: Data Selection for Task-Specific Model Finetuning
by: Liu, Zifan, et al.
Published: (2024)

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
by: Orgad, Hadas, et al.
Published: (2024)

Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering
by: Zong, Chang, et al.
Published: (2024)

ART: Adaptive Response Tuning Framework -- A Multi-Agent Tournament-Based Approach to LLM Response Optimization
by: Khan, Omer Jauhar
Published: (2025)

A Study on Bias Detection and Classification in Natural Language Processing
by: Evans, Ana Sofia, et al.
Published: (2024)

CATER: Leveraging LLM to Pioneer a Multidimensional, Reference-Independent Paradigm in Translation Quality Evaluation
by: IIDA, Kurando, et al.
Published: (2024)

Multi-Model Synthetic Training for Mission-Critical Small Language Models
by: Platt, Nolan, et al.
Published: (2025)

Task Complexity Matters: An Empirical Study of Reasoning in LLMs for Sentiment Analysis
by: Huang, Donghao, et al.
Published: (2026)

Cognitive Workspace: Active Memory Management for LLMs -- An Empirical Study of Functional Infinite Context
by: An, Tao
Published: (2025)

Evaluation of RAG Metrics for Question Answering in the Telecom Domain
by: Roychowdhury, Sujoy, et al.
Published: (2024)

Transforming and Combining Rewards for Aligning Large Language Models
by: Wang, Zihao, et al.
Published: (2024)

Understanding the Uncertainty of LLM Explanations: A Perspective Based on Reasoning Topology
by: Da, Longchao, et al.
Published: (2025)

AIPsy-Affect: A Keyword-Free Clinical Stimulus Battery for Mechanistic Interpretability of Emotion in Language Models
by: Keeman, Michael
Published: (2026)

Model Misalignment and Language Change: Traces of AI-Associated Language in Unscripted Spoken English
by: Anderson, Bryce, et al.
Published: (2025)

The CLEF-2025 CheckThat! Lab: Subjectivity, Fact-Checking, Claim Normalization, and Retrieval
by: Alam, Firoj, et al.
Published: (2025)

Textual Data Bias Detection and Mitigation -- An Extensible Pipeline with Experimental Evaluation
by: Görge, Rebekka, et al.
Published: (2025)

lmfaoooo at SemEval-2026 Task 1: Humor Is an Audience. Preference Modeling for Constrained Humor Generation
by: Tikhonov, Alexey, et al.
Published: (2026)

Large Language Models Report Subjective Experience Under Self-Referential Processing
by: Berg, Cameron, et al.
Published: (2025)

Product-of-Experts Training Reduces Dataset Artifacts in Natural Language Inference
by: Mathew, Aby Mammen
Published: (2026)

No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models
by: Sela, Omer
Published: (2026)

PennyCoder: Efficient Domain-Specific LLMs for PennyLane-Based Quantum Code Generation
by: Basit, Abdul, et al.
Published: (2025)

A Comprehensive Survey of Compression Algorithms for Language Models
by: Park, Seungcheol, et al.
Published: (2024)

MORABLES: A Benchmark for Assessing Abstract Moral Reasoning in LLMs with Fables
by: Marcuzzo, Matteo, et al.
Published: (2025)

Revealing the Parametric Knowledge of Language Models: A Unified Framework for Attribution Methods
by: Yu, Haeun, et al.
Published: (2024)

A Graph-based Approach for Multi-Modal Question Answering from Flowcharts in Telecom Documents
by: Soman, Sumit, et al.
Published: (2025)