:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	McKechnie, Jack, McDonald, Graham
Format:	Preprint
Published:	2024
Subjects:	Information Retrieval
Online Access:	https://arxiv.org/abs/2401.05144
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Measuring Hypothesis Testing Errors in the Evaluation of Retrieval Systems
by: McKechnie, Jack, et al.
Published: (2025)

Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias
by: Dehghan, Mahdi, et al.
Published: (2026)

Query Exposure Prediction for Groups of Documents in Rankings
by: Jaenich, Thomas, et al.
Published: (2024)

Document Similarity Enhanced IPS Estimation for Unbiased Learning to Rank
by: Liang, Zeyan, et al.
Published: (2025)

Temporal Fact Conflicts in LLMs: Reproducibility Insights from Unifying DYNAMICQA and MULAN
by: Dey, Ritajit, et al.
Published: (2026)

Quantifying Query Fairness Under Unawareness
by: Jaenich, Thomas, et al.
Published: (2025)

Behind Closed Doors: An Exploratory Study of the Perceptions of Librarians and the Hidden Intellectual Work of Collection Development in Canadian Public Libraries.
by: Nilsen, Kirsti, et al.
Published: (2002)

The Cranfield II Relevance Assessments: A Critical Evaluation
by: Harter, Stephen P.
Published: (1971)

Self-Service Circulation: An Exploratory Study.
by: Carey, Robert F., et al.
Published: (1998)

Judging the Judges: A Collection of LLM-Generated Relevance Judgements
by: Rahmani, Hossein A., et al.
Published: (2025)

Variations in Relevance Judgments and the Shelf Life of Test Collections
by: Parry, Andrew, et al.
Published: (2025)

Axiomatic Causal Interventions for Reverse Engineering Relevance Computation in Neural Retrieval Models
by: Chen, Catherine, et al.
Published: (2024)

JudgeBlender: Ensembling Judgments for Automatic Relevance Assessment
by: Rahmani, Hossein A., et al.
Published: (2024)

Batched Self-Consistency Improves LLM Relevance Assessment and Ranking
by: Korikov, Anton, et al.
Published: (2025)

When LLM Judges Inflate Scores: Exploring Overrating in Relevance Assessment
by: Yu, Chuting, et al.
Published: (2026)

Searching Personal Collections
by: Bendersky, Michael, et al.
Published: (2024)

SARA: Selective and Adaptive Retrieval-augmented Generation with Context Compression
by: Jin, Yiqiao, et al.
Published: (2025)

All Eyes on the Ranker: Participatory Auditing to Surface Blind Spots in Ranked Search Results
by: Rezk, Anna Marie, et al.
Published: (2026)

Limitations of Automatic Relevance Assessments with Large Language Models for Fair and Reliable Retrieval Evaluation
by: Otero, David, et al.
Published: (2024)

SelRoute: Query-Type-Aware Routing for Long-Term Conversational Memory Retrieval
by: McKee, Matthew
Published: (2026)

Calibration-Disentangled Learning and Relevance-Prioritized Reranking for Calibrated Sequential Recommendation
by: Jeon, Hyunsik, et al.
Published: (2024)

A Human-AI Comparative Analysis of Prompt Sensitivity in LLM-Based Relevance Judgment
by: Arabzadeh, Negar, et al.
Published: (2025)

Beyond Relevance: Evaluate and Improve Retrievers on Perspective Awareness
by: Zhao, Xinran, et al.
Published: (2024)

Judging with Personality and Confidence: A Study on Personality-Conditioned LLM Relevance Assessment
by: Chen, Nuo, et al.
Published: (2026)

SPECTRA: Synthetic IR Test Collections with Relevance Oracles and Controlled Distractor Diagnostics
by: Liang, Eric
Published: (2026)

LLM-based Relevance Assessment for Web-Scale Search Evaluation at Pinterest
by: Wang, Han, et al.
Published: (2025)

A Large-Scale Study of Relevance Assessments with Large Language Models: An Initial Look
by: Upadhyay, Shivani, et al.
Published: (2024)

REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering
by: Wang, Yuhao, et al.
Published: (2024)

LLMJudge: LLMs for Relevance Judgments
by: Rahmani, Hossein A., et al.
Published: (2024)

Generalized Pseudo-Relevance Feedback
by: Tu, Yiteng, et al.
Published: (2025)

R3A: Reinforced Reasoning for Relevance Assessment for RAG in User-Generated Content Platforms
by: Yuan, Xiaowei, et al.
Published: (2025)

A Deep Learning Approach for Selective Relevance Feedback
by: Datta, Suchana, et al.
Published: (2024)

LLM-Assisted Pseudo-Relevance Feedback
by: Otero, David, et al.
Published: (2026)

LUCid: Redefining Relevance For Lifelong Personalization
by: Okite, Chimaobi, et al.
Published: (2026)

Interpreting Multilingual and Document-Length Sensitive Relevance Computations in Neural Retrieval Models through Axiomatic Causal Interventions
by: Savolainen, Oliver, et al.
Published: (2025)

Sensitivity-Aware Retrieval-Augmented Intent Clarification
by: Larooij, Maik
Published: (2026)

REGENT: Relevance-Guided Attention for Entity-Aware Multi-Vector Neural Re-Ranking
by: Chatterjee, Shubham
Published: (2025)

Recent Relevance Research: Implications for Information Professionals.
by: Greisdorf, Howard, et al.
Published: (2000)

NeuCLIRBench: A Modern Evaluation Collection for Monolingual, Cross-Language, and Multilingual Information Retrieval
by: Lawrie, Dawn, et al.
Published: (2025)

"Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented Generation
by: Thakur, Nandan, et al.
Published: (2023)