Saved in:
| Main Authors: | Go, Gregory Hok Tjoan, Ly, Khang, Søgaard, Anders, Tabatabaei, Amin, de Rijke, Maarten, Chen, Xinyi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.05138 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AnalyticsGPT: An LLM Workflow for Scientometric Question Answering
by: Ly, Khang, et al.
Published: (2026)
by: Ly, Khang, et al.
Published: (2026)
Lost at the Beginning of Reasoning
by: Liao, Baohao, et al.
Published: (2025)
by: Liao, Baohao, et al.
Published: (2025)
Can Small Agents Collaborate to Beat a Single Large Language Model?
by: Żywot, Agata, et al.
Published: (2026)
by: Żywot, Agata, et al.
Published: (2026)
Multi-Step Semantic Reasoning in Generative Retrieval
by: Dong, Steven, et al.
Published: (2026)
by: Dong, Steven, et al.
Published: (2026)
A Cooperative Multi-Agent Framework for Zero-Shot Named Entity Recognition
by: Wang, Zihan, et al.
Published: (2025)
by: Wang, Zihan, et al.
Published: (2025)
Revisiting the LiRA Membership Inference Attack Under Realistic Assumptions
by: Jebreel, Najeeb, et al.
Published: (2026)
by: Jebreel, Najeeb, et al.
Published: (2026)
Evaluation of Attribution Bias in Generator-Aware Retrieval-Augmented Large Language Models
by: Abolghasemi, Amin, et al.
Published: (2024)
by: Abolghasemi, Amin, et al.
Published: (2024)
What if Othello-Playing Language Models Could See?
by: Chen, Xinyi, et al.
Published: (2025)
by: Chen, Xinyi, et al.
Published: (2025)
Evaluation Revisited: A Taxonomy of Evaluation Concerns in Natural Language Processing
by: Dhar, Ruchira, et al.
Published: (2026)
by: Dhar, Ruchira, et al.
Published: (2026)
Concept Space Alignment in Multilingual LLMs
by: Peng, Qiwei, et al.
Published: (2024)
by: Peng, Qiwei, et al.
Published: (2024)
Revisiting the Othello World Model Hypothesis
by: Yuan, Yifei, et al.
Published: (2025)
by: Yuan, Yifei, et al.
Published: (2025)
Measuring Bias in a Ranked List using Term-based Representations
by: Abolghasemi, Amin, et al.
Published: (2024)
by: Abolghasemi, Amin, et al.
Published: (2024)
Factual Consistency of Multilingual Pretrained Language Models
by: Fierro, Constanza, et al.
Published: (2022)
by: Fierro, Constanza, et al.
Published: (2022)
LiRA: Light-Robust Adversary for Model-based Reinforcement Learning in Real World
by: Kobayashi, Taisuke
Published: (2024)
by: Kobayashi, Taisuke
Published: (2024)
Exponential-Family Membership Inference: From LiRA and RMIA to BaVarIA
by: Brännvall, Rickard
Published: (2026)
by: Brännvall, Rickard
Published: (2026)
QFMTS: Generating Query-Focused Summaries over Multi-Table Inputs
by: Zhang, Weijia, et al.
Published: (2024)
by: Zhang, Weijia, et al.
Published: (2024)
Multi-Agent Collaborative Framework For Math Problem Generation
by: Karbasi, Kia, et al.
Published: (2025)
by: Karbasi, Kia, et al.
Published: (2025)
The Use of Readability Metrics in Legal Text: A Systematic Literature Review
by: Han, Yu, et al.
Published: (2024)
by: Han, Yu, et al.
Published: (2024)
CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems
by: Abolghasemi, Amin, et al.
Published: (2024)
by: Abolghasemi, Amin, et al.
Published: (2024)
From Words to Worlds: Compositionality for Cognitive Architectures
by: Dhar, Ruchira, et al.
Published: (2024)
by: Dhar, Ruchira, et al.
Published: (2024)
MACPO: Weak-to-Strong Alignment via Multi-Agent Contrastive Preference Optimization
by: Lyu, Yougang, et al.
Published: (2024)
by: Lyu, Yougang, et al.
Published: (2024)
Does Instruction Tuning Make LLMs More Consistent?
by: Fierro, Constanza, et al.
Published: (2024)
by: Fierro, Constanza, et al.
Published: (2024)
Article Classification with Graph Neural Networks and Multigraphs
by: Ly, Khang, et al.
Published: (2023)
by: Ly, Khang, et al.
Published: (2023)
Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs
by: Siro, Clemencia, et al.
Published: (2024)
by: Siro, Clemencia, et al.
Published: (2024)
AGENT-CQ: Automatic Generation and Evaluation of Clarifying Questions for Conversational Search with LLMs
by: Siro, Clemencia, et al.
Published: (2024)
by: Siro, Clemencia, et al.
Published: (2024)
Evaluating Adjective-Noun Compositionality in LLMs: Functional vs Representational Perspectives
by: Dhar, Ruchira, et al.
Published: (2026)
by: Dhar, Ruchira, et al.
Published: (2026)
MELoRA: Mini-Ensemble Low-Rank Adapters for Parameter-Efficient Fine-Tuning
by: Ren, Pengjie, et al.
Published: (2024)
by: Ren, Pengjie, et al.
Published: (2024)
The SIFo Benchmark: Investigating the Sequential Instruction Following Ability of Large Language Models
by: Chen, Xinyi, et al.
Published: (2024)
by: Chen, Xinyi, et al.
Published: (2024)
SubSearch: Intermediate Rewards for Unsupervised Guided Reasoning in Complex Retrieval
by: Petcu, Roxana, et al.
Published: (2026)
by: Petcu, Roxana, et al.
Published: (2026)
A Parametric Memory Head for Continual Generative Retrieval
by: Mekonnen, Kidist Amde, et al.
Published: (2026)
by: Mekonnen, Kidist Amde, et al.
Published: (2026)
CorpusBrain++: A Continual Generative Pre-Training Framework for Knowledge-Intensive Language Tasks
by: Guo, Jiafeng, et al.
Published: (2024)
by: Guo, Jiafeng, et al.
Published: (2024)
Understanding Subword Compositionality of Large Language Models
by: Peng, Qiwei, et al.
Published: (2025)
by: Peng, Qiwei, et al.
Published: (2025)
Comprehensive Reassessment of Large-Scale Evaluation Outcomes in LLMs: A Multifaceted Statistical Approach
by: Sun, Kun, et al.
Published: (2024)
by: Sun, Kun, et al.
Published: (2024)
Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems
by: Siro, Clemencia, et al.
Published: (2024)
by: Siro, Clemencia, et al.
Published: (2024)
Table Question Answering for Low-resourced Indic Languages
by: Pal, Vaishali, et al.
Published: (2024)
by: Pal, Vaishali, et al.
Published: (2024)
How Do Multilingual Language Models Remember Facts?
by: Fierro, Constanza, et al.
Published: (2024)
by: Fierro, Constanza, et al.
Published: (2024)
Unlocking Markets: A Multilingual Benchmark to Cross-Market Question Answering
by: Yuan, Yifei, et al.
Published: (2024)
by: Yuan, Yifei, et al.
Published: (2024)
WebQAmGaze: A Multilingual Webcam Eye-Tracking-While-Reading Dataset
by: Ribeiro, Tiago, et al.
Published: (2023)
by: Ribeiro, Tiago, et al.
Published: (2023)
AQA: Adaptive Question Answering in a Society of LLMs via Contextual Multi-Armed Bandit
by: Hoveyda, Mohanna, et al.
Published: (2024)
by: Hoveyda, Mohanna, et al.
Published: (2024)
Information Discovery in e-Commerce
by: Ren, Zhaochun, et al.
Published: (2024)
by: Ren, Zhaochun, et al.
Published: (2024)
Similar Items
-
AnalyticsGPT: An LLM Workflow for Scientometric Question Answering
by: Ly, Khang, et al.
Published: (2026) -
Lost at the Beginning of Reasoning
by: Liao, Baohao, et al.
Published: (2025) -
Can Small Agents Collaborate to Beat a Single Large Language Model?
by: Żywot, Agata, et al.
Published: (2026) -
Multi-Step Semantic Reasoning in Generative Retrieval
by: Dong, Steven, et al.
Published: (2026) -
A Cooperative Multi-Agent Framework for Zero-Shot Named Entity Recognition
by: Wang, Zihan, et al.
Published: (2025)