Saved in:
| Main Authors: | McKechnie, Jack, McDonald, Graham |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.05144 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Measuring Hypothesis Testing Errors in the Evaluation of Retrieval Systems
by: McKechnie, Jack, et al.
Published: (2025)
by: McKechnie, Jack, et al.
Published: (2025)
Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias
by: Dehghan, Mahdi, et al.
Published: (2026)
by: Dehghan, Mahdi, et al.
Published: (2026)
Query Exposure Prediction for Groups of Documents in Rankings
by: Jaenich, Thomas, et al.
Published: (2024)
by: Jaenich, Thomas, et al.
Published: (2024)
Document Similarity Enhanced IPS Estimation for Unbiased Learning to Rank
by: Liang, Zeyan, et al.
Published: (2025)
by: Liang, Zeyan, et al.
Published: (2025)
Temporal Fact Conflicts in LLMs: Reproducibility Insights from Unifying DYNAMICQA and MULAN
by: Dey, Ritajit, et al.
Published: (2026)
by: Dey, Ritajit, et al.
Published: (2026)
Quantifying Query Fairness Under Unawareness
by: Jaenich, Thomas, et al.
Published: (2025)
by: Jaenich, Thomas, et al.
Published: (2025)
Behind Closed Doors: An Exploratory Study of the Perceptions of Librarians and the Hidden Intellectual Work of Collection Development in Canadian Public Libraries.
by: Nilsen, Kirsti, et al.
Published: (2002)
by: Nilsen, Kirsti, et al.
Published: (2002)
The Cranfield II Relevance Assessments: A Critical Evaluation
by: Harter, Stephen P.
Published: (1971)
by: Harter, Stephen P.
Published: (1971)
Self-Service Circulation: An Exploratory Study.
by: Carey, Robert F., et al.
Published: (1998)
by: Carey, Robert F., et al.
Published: (1998)
Judging the Judges: A Collection of LLM-Generated Relevance Judgements
by: Rahmani, Hossein A., et al.
Published: (2025)
by: Rahmani, Hossein A., et al.
Published: (2025)
Variations in Relevance Judgments and the Shelf Life of Test Collections
by: Parry, Andrew, et al.
Published: (2025)
by: Parry, Andrew, et al.
Published: (2025)
Axiomatic Causal Interventions for Reverse Engineering Relevance Computation in Neural Retrieval Models
by: Chen, Catherine, et al.
Published: (2024)
by: Chen, Catherine, et al.
Published: (2024)
JudgeBlender: Ensembling Judgments for Automatic Relevance Assessment
by: Rahmani, Hossein A., et al.
Published: (2024)
by: Rahmani, Hossein A., et al.
Published: (2024)
Batched Self-Consistency Improves LLM Relevance Assessment and Ranking
by: Korikov, Anton, et al.
Published: (2025)
by: Korikov, Anton, et al.
Published: (2025)
When LLM Judges Inflate Scores: Exploring Overrating in Relevance Assessment
by: Yu, Chuting, et al.
Published: (2026)
by: Yu, Chuting, et al.
Published: (2026)
Searching Personal Collections
by: Bendersky, Michael, et al.
Published: (2024)
by: Bendersky, Michael, et al.
Published: (2024)
SARA: Selective and Adaptive Retrieval-augmented Generation with Context Compression
by: Jin, Yiqiao, et al.
Published: (2025)
by: Jin, Yiqiao, et al.
Published: (2025)
All Eyes on the Ranker: Participatory Auditing to Surface Blind Spots in Ranked Search Results
by: Rezk, Anna Marie, et al.
Published: (2026)
by: Rezk, Anna Marie, et al.
Published: (2026)
Limitations of Automatic Relevance Assessments with Large Language Models for Fair and Reliable Retrieval Evaluation
by: Otero, David, et al.
Published: (2024)
by: Otero, David, et al.
Published: (2024)
SelRoute: Query-Type-Aware Routing for Long-Term Conversational Memory Retrieval
by: McKee, Matthew
Published: (2026)
by: McKee, Matthew
Published: (2026)
Calibration-Disentangled Learning and Relevance-Prioritized Reranking for Calibrated Sequential Recommendation
by: Jeon, Hyunsik, et al.
Published: (2024)
by: Jeon, Hyunsik, et al.
Published: (2024)
A Human-AI Comparative Analysis of Prompt Sensitivity in LLM-Based Relevance Judgment
by: Arabzadeh, Negar, et al.
Published: (2025)
by: Arabzadeh, Negar, et al.
Published: (2025)
Beyond Relevance: Evaluate and Improve Retrievers on Perspective Awareness
by: Zhao, Xinran, et al.
Published: (2024)
by: Zhao, Xinran, et al.
Published: (2024)
Judging with Personality and Confidence: A Study on Personality-Conditioned LLM Relevance Assessment
by: Chen, Nuo, et al.
Published: (2026)
by: Chen, Nuo, et al.
Published: (2026)
SPECTRA: Synthetic IR Test Collections with Relevance Oracles and Controlled Distractor Diagnostics
by: Liang, Eric
Published: (2026)
by: Liang, Eric
Published: (2026)
LLM-based Relevance Assessment for Web-Scale Search Evaluation at Pinterest
by: Wang, Han, et al.
Published: (2025)
by: Wang, Han, et al.
Published: (2025)
A Large-Scale Study of Relevance Assessments with Large Language Models: An Initial Look
by: Upadhyay, Shivani, et al.
Published: (2024)
by: Upadhyay, Shivani, et al.
Published: (2024)
REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering
by: Wang, Yuhao, et al.
Published: (2024)
by: Wang, Yuhao, et al.
Published: (2024)
LLMJudge: LLMs for Relevance Judgments
by: Rahmani, Hossein A., et al.
Published: (2024)
by: Rahmani, Hossein A., et al.
Published: (2024)
Generalized Pseudo-Relevance Feedback
by: Tu, Yiteng, et al.
Published: (2025)
by: Tu, Yiteng, et al.
Published: (2025)
R3A: Reinforced Reasoning for Relevance Assessment for RAG in User-Generated Content Platforms
by: Yuan, Xiaowei, et al.
Published: (2025)
by: Yuan, Xiaowei, et al.
Published: (2025)
A Deep Learning Approach for Selective Relevance Feedback
by: Datta, Suchana, et al.
Published: (2024)
by: Datta, Suchana, et al.
Published: (2024)
LLM-Assisted Pseudo-Relevance Feedback
by: Otero, David, et al.
Published: (2026)
by: Otero, David, et al.
Published: (2026)
LUCid: Redefining Relevance For Lifelong Personalization
by: Okite, Chimaobi, et al.
Published: (2026)
by: Okite, Chimaobi, et al.
Published: (2026)
Interpreting Multilingual and Document-Length Sensitive Relevance Computations in Neural Retrieval Models through Axiomatic Causal Interventions
by: Savolainen, Oliver, et al.
Published: (2025)
by: Savolainen, Oliver, et al.
Published: (2025)
Sensitivity-Aware Retrieval-Augmented Intent Clarification
by: Larooij, Maik
Published: (2026)
by: Larooij, Maik
Published: (2026)
REGENT: Relevance-Guided Attention for Entity-Aware Multi-Vector Neural Re-Ranking
by: Chatterjee, Shubham
Published: (2025)
by: Chatterjee, Shubham
Published: (2025)
Recent Relevance Research: Implications for Information Professionals.
by: Greisdorf, Howard, et al.
Published: (2000)
by: Greisdorf, Howard, et al.
Published: (2000)
NeuCLIRBench: A Modern Evaluation Collection for Monolingual, Cross-Language, and Multilingual Information Retrieval
by: Lawrie, Dawn, et al.
Published: (2025)
by: Lawrie, Dawn, et al.
Published: (2025)
"Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented Generation
by: Thakur, Nandan, et al.
Published: (2023)
by: Thakur, Nandan, et al.
Published: (2023)
Similar Items
-
Measuring Hypothesis Testing Errors in the Evaluation of Retrieval Systems
by: McKechnie, Jack, et al.
Published: (2025) -
Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias
by: Dehghan, Mahdi, et al.
Published: (2026) -
Query Exposure Prediction for Groups of Documents in Rankings
by: Jaenich, Thomas, et al.
Published: (2024) -
Document Similarity Enhanced IPS Estimation for Unbiased Learning to Rank
by: Liang, Zeyan, et al.
Published: (2025) -
Temporal Fact Conflicts in LLMs: Reproducibility Insights from Unifying DYNAMICQA and MULAN
by: Dey, Ritajit, et al.
Published: (2026)