Saved in:
| Main Authors: | Wischounig, Laurin, Abdallah, Abdelrahman, Jatowt, Adam |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.18005 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Are LLM-Based Retrievers Worth Their Cost? An Empirical Study of Efficiency, Robustness, and Reasoning Overhead
by: Abdallah, Abdelrahman, et al.
Published: (2026)
by: Abdallah, Abdelrahman, et al.
Published: (2026)
SustainableQA: A Comprehensive Question Answering Dataset for Corporate Sustainability and EU Taxonomy Reporting
by: Ali, Mohammed, et al.
Published: (2025)
by: Ali, Mohammed, et al.
Published: (2025)
TEMPO: A Realistic Multi-Domain Benchmark for Temporal Reasoning-Intensive Retrieval
by: Abdallah, Abdelrahman, et al.
Published: (2026)
by: Abdallah, Abdelrahman, et al.
Published: (2026)
Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation
by: Abdallah, Abdelrahman, et al.
Published: (2025)
by: Abdallah, Abdelrahman, et al.
Published: (2025)
TempRetriever: Fusion-based Temporal Dense Passage Retrieval for Time-Sensitive Questions
by: Abdallah, Abdelrahman, et al.
Published: (2025)
by: Abdallah, Abdelrahman, et al.
Published: (2025)
RECOR: Reasoning-focused Multi-turn Conversational Retrieval Benchmark
by: Ali, Mohammed, et al.
Published: (2026)
by: Ali, Mohammed, et al.
Published: (2026)
It's High Time: A Survey of Temporal Question Answering
by: Piryani, Bhawna, et al.
Published: (2025)
by: Piryani, Bhawna, et al.
Published: (2025)
BracketRank: Large Language Model Document Ranking via Reasoning-based Competitive Elimination
by: Abdallah, Abdelrahman, et al.
Published: (2026)
by: Abdallah, Abdelrahman, et al.
Published: (2026)
RankArena: A Unified Platform for Evaluating Retrieval, Reranking and RAG with Human and LLM Feedback
by: Abdallah, Abdelrahman, et al.
Published: (2025)
by: Abdallah, Abdelrahman, et al.
Published: (2025)
HintEval: A Comprehensive Framework for Hint Generation and Evaluation for Questions
by: Mozafari, Jamshid, et al.
Published: (2025)
by: Mozafari, Jamshid, et al.
Published: (2025)
Wrong Answers Can Also Be Useful: PlausibleQA -- A Large-Scale QA Dataset with Answer Plausibility Scores
by: Mozafari, Jamshid, et al.
Published: (2025)
by: Mozafari, Jamshid, et al.
Published: (2025)
DeAR: Dual-Stage Document Reranking with Reasoning Agents via LLM Distillation
by: Abdallah, Abdelrahman, et al.
Published: (2025)
by: Abdallah, Abdelrahman, et al.
Published: (2025)
Exploring Hint Generation Approaches in Open-Domain Question Answering
by: Mozafari, Jamshid, et al.
Published: (2024)
by: Mozafari, Jamshid, et al.
Published: (2024)
How Good are LLM-based Rerankers? An Empirical Analysis of State-of-the-Art Reranking Models
by: Abdallah, Abdelrahman, et al.
Published: (2025)
by: Abdallah, Abdelrahman, et al.
Published: (2025)
A Study into Investigating Temporal Robustness of LLMs
by: Wallat, Jonas, et al.
Published: (2025)
by: Wallat, Jonas, et al.
Published: (2025)
ArabicaQA: A Comprehensive Dataset for Arabic Question Answering
by: Abdallah, Abdelrahman, et al.
Published: (2024)
by: Abdallah, Abdelrahman, et al.
Published: (2024)
MM-BRIGHT: A Multi-Task Multimodal Benchmark for Reasoning-Intensive Retrieval
by: Abdallah, Abdelrahman, et al.
Published: (2026)
by: Abdallah, Abdelrahman, et al.
Published: (2026)
LLMTemporalComparator: A Tool for Analysing Differences in Temporal Adaptations of Large Language Models
by: Fritsch, Reinhard Friedrich, et al.
Published: (2024)
by: Fritsch, Reinhard Friedrich, et al.
Published: (2024)
Analyzing the Role of Context in Forecasting with Large Language Models
by: Mutschlechner, Gerrit, et al.
Published: (2025)
by: Mutschlechner, Gerrit, et al.
Published: (2025)
Navigating Tomorrow: Reliably Assessing Large Language Models Performance on Future Event Prediction
by: Nako, Petraq, et al.
Published: (2025)
by: Nako, Petraq, et al.
Published: (2025)
The Impact of International Collaborations with Highly Publishing Countries in Computer Science
by: Espes, Alberto Gomez, et al.
Published: (2025)
by: Espes, Alberto Gomez, et al.
Published: (2025)
WikiHint: A Human-Annotated Dataset for Hint Ranking and Generation
by: Mozafari, Jamshid, et al.
Published: (2024)
by: Mozafari, Jamshid, et al.
Published: (2024)
Wisdom of the Crowds in Forecasting: Forecast Summarization for Supporting Future Event Prediction
by: Saha, Anisha, et al.
Published: (2025)
by: Saha, Anisha, et al.
Published: (2025)
Negative Sampling in Recommendation: A Survey and Future Directions
by: Ma, Haokai, et al.
Published: (2024)
by: Ma, Haokai, et al.
Published: (2024)
Context Convergence Improves Answering Inferential Questions
by: Mozafari, Jamshid, et al.
Published: (2026)
by: Mozafari, Jamshid, et al.
Published: (2026)
Question Difficulty Estimation for Large Language Models via Answer Plausibility Scoring
by: Mozafari, Jamshid, et al.
Published: (2026)
by: Mozafari, Jamshid, et al.
Published: (2026)
Evaluating Answer Reranking Strategies in Time-sensitive Question Answering
by: Kardan, Mehmet, et al.
Published: (2025)
by: Kardan, Mehmet, et al.
Published: (2025)
HIVE: Query, Hypothesize, Verify An LLM Framework for Multimodal Reasoning-Intensive Retrieval
by: Abdalla, Mahmoud, et al.
Published: (2026)
by: Abdalla, Mahmoud, et al.
Published: (2026)
TriSampler: A Better Negative Sampling Principle for Dense Retrieval
by: Yang, Zhen, et al.
Published: (2024)
by: Yang, Zhen, et al.
Published: (2024)
PARSE: An Open-Domain Reasoning Question Answering Benchmark for Persian
by: Mozafari, Jamshid, et al.
Published: (2026)
by: Mozafari, Jamshid, et al.
Published: (2026)
Detecting Future-related Contexts of Entity Mentions
by: Prashar, Puneet, et al.
Published: (2025)
by: Prashar, Puneet, et al.
Published: (2025)
Reproducing NevIR: Negation in Neural Information Retrieval
by: Elsen, Coen van den, et al.
Published: (2025)
by: Elsen, Coen van den, et al.
Published: (2025)
Multi-hop Question Answering
by: Mavi, Vaibhav, et al.
Published: (2022)
by: Mavi, Vaibhav, et al.
Published: (2022)
Inferential Question Answering
by: Mozafari, Jamshid, et al.
Published: (2026)
by: Mozafari, Jamshid, et al.
Published: (2026)
A Survey of Model Architectures in Information Retrieval
by: Xu, Zhichao, et al.
Published: (2025)
by: Xu, Zhichao, et al.
Published: (2025)
NevIR: Negation in Neural Information Retrieval
by: Weller, Orion, et al.
Published: (2023)
by: Weller, Orion, et al.
Published: (2023)
Logical Consistency is Vital: Neural-Symbolic Information Retrieval for Negative-Constraint Queries
by: Xu, Ganlin, et al.
Published: (2025)
by: Xu, Ganlin, et al.
Published: (2025)
Enhancing Knowledge Retrieval with In-Context Learning and Semantic Search through Generative AI
by: Ghali, Mohammed-Khalil, et al.
Published: (2024)
by: Ghali, Mohammed-Khalil, et al.
Published: (2024)
A Survey of Generative Information Retrieval
by: Kuo, Tzu-Lin, et al.
Published: (2024)
by: Kuo, Tzu-Lin, et al.
Published: (2024)
Differentially Private Datastore Generation for Retrieval-Augmented Inference
by: Abouelenein, Abdelrahman, et al.
Published: (2026)
by: Abouelenein, Abdelrahman, et al.
Published: (2026)
Similar Items
-
Are LLM-Based Retrievers Worth Their Cost? An Empirical Study of Efficiency, Robustness, and Reasoning Overhead
by: Abdallah, Abdelrahman, et al.
Published: (2026) -
SustainableQA: A Comprehensive Question Answering Dataset for Corporate Sustainability and EU Taxonomy Reporting
by: Ali, Mohammed, et al.
Published: (2025) -
TEMPO: A Realistic Multi-Domain Benchmark for Temporal Reasoning-Intensive Retrieval
by: Abdallah, Abdelrahman, et al.
Published: (2026) -
Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation
by: Abdallah, Abdelrahman, et al.
Published: (2025) -
TempRetriever: Fusion-based Temporal Dense Passage Retrieval for Time-Sensitive Questions
by: Abdallah, Abdelrahman, et al.
Published: (2025)