Saved in:
| Main Authors: | Fritsch, Reinhard Friedrich, Jatowt, Adam |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.04195 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Analyzing the Role of Context in Forecasting with Large Language Models
by: Mutschlechner, Gerrit, et al.
Published: (2025)
by: Mutschlechner, Gerrit, et al.
Published: (2025)
Navigating Tomorrow: Reliably Assessing Large Language Models Performance on Future Event Prediction
by: Nako, Petraq, et al.
Published: (2025)
by: Nako, Petraq, et al.
Published: (2025)
Question Difficulty Estimation for Large Language Models via Answer Plausibility Scoring
by: Mozafari, Jamshid, et al.
Published: (2026)
by: Mozafari, Jamshid, et al.
Published: (2026)
BracketRank: Large Language Model Document Ranking via Reasoning-based Competitive Elimination
by: Abdallah, Abdelrahman, et al.
Published: (2026)
by: Abdallah, Abdelrahman, et al.
Published: (2026)
TEMPO: A Realistic Multi-Domain Benchmark for Temporal Reasoning-Intensive Retrieval
by: Abdallah, Abdelrahman, et al.
Published: (2026)
by: Abdallah, Abdelrahman, et al.
Published: (2026)
Negative Sampling Techniques in Information Retrieval: A Survey
by: Wischounig, Laurin, et al.
Published: (2026)
by: Wischounig, Laurin, et al.
Published: (2026)
SustainableQA: A Comprehensive Question Answering Dataset for Corporate Sustainability and EU Taxonomy Reporting
by: Ali, Mohammed, et al.
Published: (2025)
by: Ali, Mohammed, et al.
Published: (2025)
It's High Time: A Survey of Temporal Question Answering
by: Piryani, Bhawna, et al.
Published: (2025)
by: Piryani, Bhawna, et al.
Published: (2025)
Wrong Answers Can Also Be Useful: PlausibleQA -- A Large-Scale QA Dataset with Answer Plausibility Scores
by: Mozafari, Jamshid, et al.
Published: (2025)
by: Mozafari, Jamshid, et al.
Published: (2025)
The Impact of International Collaborations with Highly Publishing Countries in Computer Science
by: Espes, Alberto Gomez, et al.
Published: (2025)
by: Espes, Alberto Gomez, et al.
Published: (2025)
TempRetriever: Fusion-based Temporal Dense Passage Retrieval for Time-Sensitive Questions
by: Abdallah, Abdelrahman, et al.
Published: (2025)
by: Abdallah, Abdelrahman, et al.
Published: (2025)
WikiHint: A Human-Annotated Dataset for Hint Ranking and Generation
by: Mozafari, Jamshid, et al.
Published: (2024)
by: Mozafari, Jamshid, et al.
Published: (2024)
Wisdom of the Crowds in Forecasting: Forecast Summarization for Supporting Future Event Prediction
by: Saha, Anisha, et al.
Published: (2025)
by: Saha, Anisha, et al.
Published: (2025)
A Study into Investigating Temporal Robustness of LLMs
by: Wallat, Jonas, et al.
Published: (2025)
by: Wallat, Jonas, et al.
Published: (2025)
Are LLM-Based Retrievers Worth Their Cost? An Empirical Study of Efficiency, Robustness, and Reasoning Overhead
by: Abdallah, Abdelrahman, et al.
Published: (2026)
by: Abdallah, Abdelrahman, et al.
Published: (2026)
Context Convergence Improves Answering Inferential Questions
by: Mozafari, Jamshid, et al.
Published: (2026)
by: Mozafari, Jamshid, et al.
Published: (2026)
Evaluating Answer Reranking Strategies in Time-sensitive Question Answering
by: Kardan, Mehmet, et al.
Published: (2025)
by: Kardan, Mehmet, et al.
Published: (2025)
PARSE: An Open-Domain Reasoning Question Answering Benchmark for Persian
by: Mozafari, Jamshid, et al.
Published: (2026)
by: Mozafari, Jamshid, et al.
Published: (2026)
Detecting Future-related Contexts of Entity Mentions
by: Prashar, Puneet, et al.
Published: (2025)
by: Prashar, Puneet, et al.
Published: (2025)
HintEval: A Comprehensive Framework for Hint Generation and Evaluation for Questions
by: Mozafari, Jamshid, et al.
Published: (2025)
by: Mozafari, Jamshid, et al.
Published: (2025)
Multi-hop Question Answering
by: Mavi, Vaibhav, et al.
Published: (2022)
by: Mavi, Vaibhav, et al.
Published: (2022)
How Good are LLM-based Rerankers? An Empirical Analysis of State-of-the-Art Reranking Models
by: Abdallah, Abdelrahman, et al.
Published: (2025)
by: Abdallah, Abdelrahman, et al.
Published: (2025)
Exploring Hint Generation Approaches in Open-Domain Question Answering
by: Mozafari, Jamshid, et al.
Published: (2024)
by: Mozafari, Jamshid, et al.
Published: (2024)
Inferential Question Answering
by: Mozafari, Jamshid, et al.
Published: (2026)
by: Mozafari, Jamshid, et al.
Published: (2026)
DeAR: Dual-Stage Document Reranking with Reasoning Agents via LLM Distillation
by: Abdallah, Abdelrahman, et al.
Published: (2025)
by: Abdallah, Abdelrahman, et al.
Published: (2025)
RankArena: A Unified Platform for Evaluating Retrieval, Reranking and RAG with Human and LLM Feedback
by: Abdallah, Abdelrahman, et al.
Published: (2025)
by: Abdallah, Abdelrahman, et al.
Published: (2025)
RECOR: Reasoning-focused Multi-turn Conversational Retrieval Benchmark
by: Ali, Mohammed, et al.
Published: (2026)
by: Ali, Mohammed, et al.
Published: (2026)
Rankify: A Comprehensive Python Toolkit for Retrieval, Re-Ranking, and Retrieval-Augmented Generation
by: Abdallah, Abdelrahman, et al.
Published: (2025)
by: Abdallah, Abdelrahman, et al.
Published: (2025)
A Two-Stage Adaptation of Large Language Models for Text Ranking
by: Zhang, Longhui, et al.
Published: (2023)
by: Zhang, Longhui, et al.
Published: (2023)
Structural and Disentangled Adaptation of Large Vision Language Models for Multimodal Recommendation
by: Rao, Zhongtao, et al.
Published: (2025)
by: Rao, Zhongtao, et al.
Published: (2025)
Efficient Temporal-aware Matryoshka Adaptation for Temporal Information Retrieval
by: Huynh, Tuan-Luc, et al.
Published: (2026)
by: Huynh, Tuan-Luc, et al.
Published: (2026)
ArabicaQA: A Comprehensive Dataset for Arabic Question Answering
by: Abdallah, Abdelrahman, et al.
Published: (2024)
by: Abdallah, Abdelrahman, et al.
Published: (2024)
Temporal-Aware User Behaviour Simulation with Large Language Models for Recommender Systems
by: Wanyan, Xinye, et al.
Published: (2025)
by: Wanyan, Xinye, et al.
Published: (2025)
MassTool: A Multi-Task Search-Based Tool Retrieval Framework for Large Language Models
by: Lin, Jianghao, et al.
Published: (2025)
by: Lin, Jianghao, et al.
Published: (2025)
Towards Completeness-Oriented Tool Retrieval for Large Language Models
by: Qu, Changle, et al.
Published: (2024)
by: Qu, Changle, et al.
Published: (2024)
Tool Graph Retriever: Exploring Dependency Graph-based Tool Retrieval for Large Language Models
by: Gao, Linfeng, et al.
Published: (2025)
by: Gao, Linfeng, et al.
Published: (2025)
A Comprehensive Evaluation of Large Language Models on Temporal Event Forecasting
by: Chang, He, et al.
Published: (2024)
by: Chang, He, et al.
Published: (2024)
Comparative Analysis of Large Language Models in Generating Telugu Responses for Maternal Health Queries
by: Bhanusree, Anagani, et al.
Published: (2026)
by: Bhanusree, Anagani, et al.
Published: (2026)
Lifelong Personalized Low-Rank Adaptation of Large Language Models for Recommendation
by: Zhu, Jiachen, et al.
Published: (2024)
by: Zhu, Jiachen, et al.
Published: (2024)
MedDoc-Bot: A Chat Tool for Comparative Analysis of Large Language Models in the Context of the Pediatric Hypertension Guideline
by: Jabarulla, Mohamed Yaseen, et al.
Published: (2024)
by: Jabarulla, Mohamed Yaseen, et al.
Published: (2024)
Similar Items
-
Analyzing the Role of Context in Forecasting with Large Language Models
by: Mutschlechner, Gerrit, et al.
Published: (2025) -
Navigating Tomorrow: Reliably Assessing Large Language Models Performance on Future Event Prediction
by: Nako, Petraq, et al.
Published: (2025) -
Question Difficulty Estimation for Large Language Models via Answer Plausibility Scoring
by: Mozafari, Jamshid, et al.
Published: (2026) -
BracketRank: Large Language Model Document Ranking via Reasoning-based Competitive Elimination
by: Abdallah, Abdelrahman, et al.
Published: (2026) -
TEMPO: A Realistic Multi-Domain Benchmark for Temporal Reasoning-Intensive Retrieval
by: Abdallah, Abdelrahman, et al.
Published: (2026)