Saved in:
| Main Authors: | Takahashi, Kosuke, Omi, Takahiro, Arima, Kosuke, Ishigaki, Tatsuya |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.08262 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Enhancing Large Vision-Language Models with Layout Modality for Table Question Answering on Japanese Annual Securities Reports
by: Aida, Hayato, et al.
Published: (2025)
by: Aida, Hayato, et al.
Published: (2025)
Exploring Design of Multi-Agent LLM Dialogues for Research Ideation
by: Ueda, Keisuke, et al.
Published: (2025)
by: Ueda, Keisuke, et al.
Published: (2025)
Mitigating LLM Hallucinations through Domain-Grounded Tiered Retrieval
by: Haque, Md. Asraful, et al.
Published: (2026)
by: Haque, Md. Asraful, et al.
Published: (2026)
Prompting Encoder Models for Zero-Shot Classification: A Cross-Domain Study in Italian
by: Auriemma, Serena, et al.
Published: (2024)
by: Auriemma, Serena, et al.
Published: (2024)
From Prompt to Graph: Comparing LLM-Based Information Extraction Strategies in Domain-Specific Ontology Development
by: Liu, Xuan, et al.
Published: (2026)
by: Liu, Xuan, et al.
Published: (2026)
LawInstruct: A Resource for Studying Language Model Adaptation to the Legal Domain
by: Niklaus, Joel, et al.
Published: (2024)
by: Niklaus, Joel, et al.
Published: (2024)
LEXTREME: A Multi-Lingual and Multi-Task Benchmark for the Legal Domain
by: Niklaus, Joel, et al.
Published: (2023)
by: Niklaus, Joel, et al.
Published: (2023)
Can Out-of-Distribution Evaluations Uncover Reliance on Shortcuts? A Case Study in Question Answering
by: Štefánik, Michal, et al.
Published: (2025)
by: Štefánik, Michal, et al.
Published: (2025)
The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them)
by: Wang, Zihao, et al.
Published: (2025)
by: Wang, Zihao, et al.
Published: (2025)
LLM-Assisted Crisis Management: Building Advanced LLM Platforms for Effective Emergency Response and Public Collaboration
by: Otal, Hakan T., et al.
Published: (2024)
by: Otal, Hakan T., et al.
Published: (2024)
MetaCheckGPT -- A Multi-task Hallucination Detector Using LLM Uncertainty and Meta-models
by: Mehta, Rahul, et al.
Published: (2024)
by: Mehta, Rahul, et al.
Published: (2024)
Unsolvability Ceiling in Multi-LLM Routing: An Empirical Study of Evaluation Artifacts
by: Garg, Saloni, et al.
Published: (2026)
by: Garg, Saloni, et al.
Published: (2026)
Isolating LLM Lexical Bias: A Curation-Free Triangulated Metric for Preference-Stage Learning
by: Ming, Xiaoyang, et al.
Published: (2026)
by: Ming, Xiaoyang, et al.
Published: (2026)
HR-MultiWOZ: A Task Oriented Dialogue (TOD) Dataset for HR LLM Agent
by: Xu, Weijie, et al.
Published: (2024)
by: Xu, Weijie, et al.
Published: (2024)
Understanding the Effects of RLHF on the Quality and Detectability of LLM-Generated Texts
by: Xu, Beining, et al.
Published: (2025)
by: Xu, Beining, et al.
Published: (2025)
TSDS: Data Selection for Task-Specific Model Finetuning
by: Liu, Zifan, et al.
Published: (2024)
by: Liu, Zifan, et al.
Published: (2024)
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
by: Orgad, Hadas, et al.
Published: (2024)
by: Orgad, Hadas, et al.
Published: (2024)
Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering
by: Zong, Chang, et al.
Published: (2024)
by: Zong, Chang, et al.
Published: (2024)
ART: Adaptive Response Tuning Framework -- A Multi-Agent Tournament-Based Approach to LLM Response Optimization
by: Khan, Omer Jauhar
Published: (2025)
by: Khan, Omer Jauhar
Published: (2025)
A Study on Bias Detection and Classification in Natural Language Processing
by: Evans, Ana Sofia, et al.
Published: (2024)
by: Evans, Ana Sofia, et al.
Published: (2024)
CATER: Leveraging LLM to Pioneer a Multidimensional, Reference-Independent Paradigm in Translation Quality Evaluation
by: IIDA, Kurando, et al.
Published: (2024)
by: IIDA, Kurando, et al.
Published: (2024)
Multi-Model Synthetic Training for Mission-Critical Small Language Models
by: Platt, Nolan, et al.
Published: (2025)
by: Platt, Nolan, et al.
Published: (2025)
Task Complexity Matters: An Empirical Study of Reasoning in LLMs for Sentiment Analysis
by: Huang, Donghao, et al.
Published: (2026)
by: Huang, Donghao, et al.
Published: (2026)
Cognitive Workspace: Active Memory Management for LLMs -- An Empirical Study of Functional Infinite Context
by: An, Tao
Published: (2025)
by: An, Tao
Published: (2025)
Evaluation of RAG Metrics for Question Answering in the Telecom Domain
by: Roychowdhury, Sujoy, et al.
Published: (2024)
by: Roychowdhury, Sujoy, et al.
Published: (2024)
Transforming and Combining Rewards for Aligning Large Language Models
by: Wang, Zihao, et al.
Published: (2024)
by: Wang, Zihao, et al.
Published: (2024)
Understanding the Uncertainty of LLM Explanations: A Perspective Based on Reasoning Topology
by: Da, Longchao, et al.
Published: (2025)
by: Da, Longchao, et al.
Published: (2025)
AIPsy-Affect: A Keyword-Free Clinical Stimulus Battery for Mechanistic Interpretability of Emotion in Language Models
by: Keeman, Michael
Published: (2026)
by: Keeman, Michael
Published: (2026)
Model Misalignment and Language Change: Traces of AI-Associated Language in Unscripted Spoken English
by: Anderson, Bryce, et al.
Published: (2025)
by: Anderson, Bryce, et al.
Published: (2025)
The CLEF-2025 CheckThat! Lab: Subjectivity, Fact-Checking, Claim Normalization, and Retrieval
by: Alam, Firoj, et al.
Published: (2025)
by: Alam, Firoj, et al.
Published: (2025)
Textual Data Bias Detection and Mitigation -- An Extensible Pipeline with Experimental Evaluation
by: Görge, Rebekka, et al.
Published: (2025)
by: Görge, Rebekka, et al.
Published: (2025)
lmfaoooo at SemEval-2026 Task 1: Humor Is an Audience. Preference Modeling for Constrained Humor Generation
by: Tikhonov, Alexey, et al.
Published: (2026)
by: Tikhonov, Alexey, et al.
Published: (2026)
Large Language Models Report Subjective Experience Under Self-Referential Processing
by: Berg, Cameron, et al.
Published: (2025)
by: Berg, Cameron, et al.
Published: (2025)
Product-of-Experts Training Reduces Dataset Artifacts in Natural Language Inference
by: Mathew, Aby Mammen
Published: (2026)
by: Mathew, Aby Mammen
Published: (2026)
No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models
by: Sela, Omer
Published: (2026)
by: Sela, Omer
Published: (2026)
PennyCoder: Efficient Domain-Specific LLMs for PennyLane-Based Quantum Code Generation
by: Basit, Abdul, et al.
Published: (2025)
by: Basit, Abdul, et al.
Published: (2025)
A Comprehensive Survey of Compression Algorithms for Language Models
by: Park, Seungcheol, et al.
Published: (2024)
by: Park, Seungcheol, et al.
Published: (2024)
MORABLES: A Benchmark for Assessing Abstract Moral Reasoning in LLMs with Fables
by: Marcuzzo, Matteo, et al.
Published: (2025)
by: Marcuzzo, Matteo, et al.
Published: (2025)
Revealing the Parametric Knowledge of Language Models: A Unified Framework for Attribution Methods
by: Yu, Haeun, et al.
Published: (2024)
by: Yu, Haeun, et al.
Published: (2024)
A Graph-based Approach for Multi-Modal Question Answering from Flowcharts in Telecom Documents
by: Soman, Sumit, et al.
Published: (2025)
by: Soman, Sumit, et al.
Published: (2025)
Similar Items
-
Enhancing Large Vision-Language Models with Layout Modality for Table Question Answering on Japanese Annual Securities Reports
by: Aida, Hayato, et al.
Published: (2025) -
Exploring Design of Multi-Agent LLM Dialogues for Research Ideation
by: Ueda, Keisuke, et al.
Published: (2025) -
Mitigating LLM Hallucinations through Domain-Grounded Tiered Retrieval
by: Haque, Md. Asraful, et al.
Published: (2026) -
Prompting Encoder Models for Zero-Shot Classification: A Cross-Domain Study in Italian
by: Auriemma, Serena, et al.
Published: (2024) -
From Prompt to Graph: Comparing LLM-Based Information Extraction Strategies in Domain-Specific Ontology Development
by: Liu, Xuan, et al.
Published: (2026)