Saved in:
| Main Authors: | Lu, Pengcheng, Poesio, Massimo |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.10696 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A LLM Benchmark based on the Minecraft Builder Dialog Agent Task
by: Madge, Chris, et al.
Published: (2024)
by: Madge, Chris, et al.
Published: (2024)
Review of coreference resolution in English and Persian
by: Mohammadi, Hassan Haji, et al.
Published: (2022)
by: Mohammadi, Hassan Haji, et al.
Published: (2022)
Understanding The Effect Of Temperature On Alignment With Human Opinions
by: Pavlovic, Maja, et al.
Published: (2024)
by: Pavlovic, Maja, et al.
Published: (2024)
Large Language Models as Minecraft Agents
by: Madge, Chris, et al.
Published: (2024)
by: Madge, Chris, et al.
Published: (2024)
Data Augmentation for Fake Reviews Detection in Multiple Languages and Multiple Domains
by: Liu, Ming, et al.
Published: (2025)
by: Liu, Ming, et al.
Published: (2025)
The Effectiveness of LLMs as Annotators: A Comparative Overview and Empirical Analysis of Direct Representation
by: Pavlovic, Maja, et al.
Published: (2024)
by: Pavlovic, Maja, et al.
Published: (2024)
Can LLMs Detect Ambiguous Plural Reference? An Analysis of Split-Antecedent and Mereological Reference
by: Anh, Dang, et al.
Published: (2025)
by: Anh, Dang, et al.
Published: (2025)
Referential ambiguity and clarification requests: comparing human and LLM behaviour
by: Madge, Chris, et al.
Published: (2025)
by: Madge, Chris, et al.
Published: (2025)
Grounded Misunderstandings in Asymmetric Dialogue: A Perspectivist Annotation Scheme for MapTask
by: Li, Nan, et al.
Published: (2025)
by: Li, Nan, et al.
Published: (2025)
An Assessment of Human vs. Model Uncertainty in Soft-Label Learning and Calibration
by: Pavlovic, Maja, et al.
Published: (2026)
by: Pavlovic, Maja, et al.
Published: (2026)
Extending Activation Steering to Broad Skills and Multiple Behaviours
by: van der Weij, Teun, et al.
Published: (2024)
by: van der Weij, Teun, et al.
Published: (2024)
Improving LLMs' Learning for Coreference Resolution
by: Gan, Yujian, et al.
Published: (2025)
by: Gan, Yujian, et al.
Published: (2025)
Human Label Variation in Implicit Discourse Relation Recognition
by: Yung, Frances, et al.
Published: (2026)
by: Yung, Frances, et al.
Published: (2026)
ClarQ-LLM: A Benchmark for Models Clarifying and Requesting Information in Task-Oriented Dialog
by: Gan, Yujian, et al.
Published: (2024)
by: Gan, Yujian, et al.
Published: (2024)
Assessing the Reliability of LLMs Annotations in the Context of Demographic Bias and Model Explanation
by: Mohammadi, Hadi, et al.
Published: (2025)
by: Mohammadi, Hadi, et al.
Published: (2025)
Making Dialogue Grounding Data Rich: A Three-Tier Data Synthesis Framework for Generalized Referring Expression Comprehension
by: Shao, Juexi, et al.
Published: (2025)
by: Shao, Juexi, et al.
Published: (2025)
MDC-R: The Minecraft Dialogue Corpus with Reference
by: Madge, Chris, et al.
Published: (2025)
by: Madge, Chris, et al.
Published: (2025)
Matching domain experts by training from scratch on domain knowledge
by: Luo, Xiaoliang, et al.
Published: (2024)
by: Luo, Xiaoliang, et al.
Published: (2024)
LeWiDi-2025 at NLPerspectives: Third Edition of the Learning with Disagreements Shared Task
by: Leonardelli, Elisa, et al.
Published: (2025)
by: Leonardelli, Elisa, et al.
Published: (2025)
Augmenting speech transcripts of VR recordings with gaze, pointing, and visual context for multimodal coreference resolution
by: Bovo, Riccardo, et al.
Published: (2025)
by: Bovo, Riccardo, et al.
Published: (2025)
Large language models as oracles for instantiating ontologies with domain-specific knowledge
by: Ciatto, Giovanni, et al.
Published: (2024)
by: Ciatto, Giovanni, et al.
Published: (2024)
MLRIP: Pre-training a military language representation model with informative factual knowledge and professional knowledge base
by: Li, Hui, et al.
Published: (2022)
by: Li, Hui, et al.
Published: (2022)
Seeded Poisson Factorization: leveraging domain knowledge to fit topic models
by: Prostmaier, Bernd, et al.
Published: (2025)
by: Prostmaier, Bernd, et al.
Published: (2025)
Question answering system of bridge design specification based on large language model
by: Zhang, Leye, et al.
Published: (2024)
by: Zhang, Leye, et al.
Published: (2024)
Emission-GPT: A domain-specific language model agent for knowledge retrieval, emission inventory and data analysis
by: Ye, Jiashu, et al.
Published: (2025)
by: Ye, Jiashu, et al.
Published: (2025)
Reasoning based on symbolic and parametric knowledge bases: a survey
by: Xu, Mayi, et al.
Published: (2025)
by: Xu, Mayi, et al.
Published: (2025)
Math anxiety and associative knowledge structure are entwined in psychology students but not in Large Language Models like GPT-3.5 and GPT-4o
by: Ciringione, Luciana, et al.
Published: (2025)
by: Ciringione, Luciana, et al.
Published: (2025)
Topic-to-essay generation with knowledge-based content selection
by: Wang, Jieyong, et al.
Published: (2024)
by: Wang, Jieyong, et al.
Published: (2024)
RAG-Boost: Retrieval-Augmented Generation Enhanced LLM-based Speech Recognition
by: Wang, Pengcheng, et al.
Published: (2025)
by: Wang, Pengcheng, et al.
Published: (2025)
cantnlp@DravidianLangTech 2026: organic domain adaptation improves multi-class hope speech detection in Tulu
by: Li, Andrew, et al.
Published: (2026)
by: Li, Andrew, et al.
Published: (2026)
Towards Better Graph-based Cross-document Relation Extraction via Non-bridge Entity Enhancement and Prediction Debiasing
by: Yue, Hao, et al.
Published: (2024)
by: Yue, Hao, et al.
Published: (2024)
Global joint models for coreference resolution and named entity classification
by: Pascal Denis
Published: (2009)
by: Pascal Denis
Published: (2009)
Topic Coverage-based Demonstration Retrieval for In-Context Learning
by: Kweon, Wonbin, et al.
Published: (2025)
by: Kweon, Wonbin, et al.
Published: (2025)
RITFIS: Robust input testing framework for LLMs-based intelligent software
by: Xiao, Mingxuan, et al.
Published: (2024)
by: Xiao, Mingxuan, et al.
Published: (2024)
Performance of large language models in numerical vs. semantic medical knowledge: Benchmarking on evidence-based Q&As
by: Avnat, Eden, et al.
Published: (2024)
by: Avnat, Eden, et al.
Published: (2024)
Adapter-based Selective Knowledge Distillation for Federated Multi-domain Meeting Summarization
by: Feng, Xiachong, et al.
Published: (2023)
by: Feng, Xiachong, et al.
Published: (2023)
Augmenting Biomedical Named Entity Recognition with General-domain Resources
by: Yin, Yu, et al.
Published: (2024)
by: Yin, Yu, et al.
Published: (2024)
Structure-Augmented Reasoning Generation
by: Parekh, Jash Rajesh, et al.
Published: (2025)
by: Parekh, Jash Rajesh, et al.
Published: (2025)
Harmony in Diversity: Multi-domain Contrastive Policy Optimization for Large Reasoning Models
by: Yu, Zongji, et al.
Published: (2026)
by: Yu, Zongji, et al.
Published: (2026)
Multi-domain Multilingual Sentiment Analysis in Industry: Predicting Aspect-based Opinion Quadruples
by: White, Benjamin, et al.
Published: (2025)
by: White, Benjamin, et al.
Published: (2025)
Similar Items
-
A LLM Benchmark based on the Minecraft Builder Dialog Agent Task
by: Madge, Chris, et al.
Published: (2024) -
Review of coreference resolution in English and Persian
by: Mohammadi, Hassan Haji, et al.
Published: (2022) -
Understanding The Effect Of Temperature On Alignment With Human Opinions
by: Pavlovic, Maja, et al.
Published: (2024) -
Large Language Models as Minecraft Agents
by: Madge, Chris, et al.
Published: (2024) -
Data Augmentation for Fake Reviews Detection in Multiple Languages and Multiple Domains
by: Liu, Ming, et al.
Published: (2025)