Saved in:
| Main Author: | Diaz, Fernando |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.13680 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Pessimistic Off-Policy Optimization for Learning to Rank
by: Cief, Matej, et al.
Published: (2022)
by: Cief, Matej, et al.
Published: (2022)
Recall, Robustness, and Lexicographic Evaluation
by: Diaz, Fernando, et al.
Published: (2023)
by: Diaz, Fernando, et al.
Published: (2023)
Evaluation of Agents under Simulated AI Marketplace Dynamics
by: Kim, To Eun, et al.
Published: (2026)
by: Kim, To Eun, et al.
Published: (2026)
Offline Evaluation of Set-Based Text-to-Image Generation
by: Arabzadeh, Negar, et al.
Published: (2024)
by: Arabzadeh, Negar, et al.
Published: (2024)
Multilingual and Domain-Agnostic Tip-of-the-Tongue Query Generation for Simulated Evaluation
by: He, Xuhong, et al.
Published: (2026)
by: He, Xuhong, et al.
Published: (2026)
LTRR: Learning To Rank Retrievers for LLMs
by: Kim, To Eun, et al.
Published: (2025)
by: Kim, To Eun, et al.
Published: (2025)
The Impact of Group Membership Bias on the Quality and Fairness of Exposure in Ranking
by: Vardasbi, Ali, et al.
Published: (2023)
by: Vardasbi, Ali, et al.
Published: (2023)
Overview of the TREC 2025 Tip-of-the-Tongue track
by: Arguello, Jaime, et al.
Published: (2026)
by: Arguello, Jaime, et al.
Published: (2026)
Comprehensive Evaluation of Matrix Factorization Models for Collaborative Filtering Recommender Systems
by: Bobadilla, Jesús, et al.
Published: (2024)
by: Bobadilla, Jesús, et al.
Published: (2024)
Density-based User Representation using Gaussian Process Regression for Multi-interest Personalized Retrieval
by: Wu, Haolun, et al.
Published: (2023)
by: Wu, Haolun, et al.
Published: (2023)
An Intrinsic Framework of Information Retrieval Evaluation Measures
by: Giner, Fernando
Published: (2023)
by: Giner, Fernando
Published: (2023)
Tip of the Tongue Query Elicitation for Simulated Evaluation
by: He, Yifan, et al.
Published: (2025)
by: He, Yifan, et al.
Published: (2025)
Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented Generation
by: Kim, To Eun, et al.
Published: (2024)
by: Kim, To Eun, et al.
Published: (2024)
An Evaluation Study of Generative Adversarial Networks for Collaborative Filtering
by: Maurera, Fernando Benjamín Pérez, et al.
Published: (2022)
by: Maurera, Fernando Benjamín Pérez, et al.
Published: (2022)
The Cranfield II Relevance Assessments: A Critical Evaluation
by: Harter, Stephen P.
Published: (1971)
by: Harter, Stephen P.
Published: (1971)
Author Unknown: Evaluating Performance of Author Extraction Libraries on Global Online News Articles
by: Hatwar, Sriharsha, et al.
Published: (2024)
by: Hatwar, Sriharsha, et al.
Published: (2024)
On the Evaluation Metric for Hashing
by: Jiang, Qing-Yuan, et al.
Published: (2019)
by: Jiang, Qing-Yuan, et al.
Published: (2019)
Human-Computer Interaction as a basis for assessing Geographic Information Retrieval Systems.
by: Manuel Enrique Puebla Martínez
Published: (2018)
by: Manuel Enrique Puebla Martínez
Published: (2018)
Evaluating the Explainability of Neural Rankers
by: Pandian, Saran, et al.
Published: (2024)
by: Pandian, Saran, et al.
Published: (2024)
Generative Information Retrieval Evaluation
by: Alaofi, Marwah, et al.
Published: (2024)
by: Alaofi, Marwah, et al.
Published: (2024)
Pointwise Metrics for Clustering Evaluation
by: van Staden, Stephan
Published: (2024)
by: van Staden, Stephan
Published: (2024)
The Viability of Crowdsourcing for RAG Evaluation
by: Gienapp, Lukas, et al.
Published: (2025)
by: Gienapp, Lukas, et al.
Published: (2025)
Online and Offline Evaluation in Search Clarification
by: Tavakoli, Leila, et al.
Published: (2024)
by: Tavakoli, Leila, et al.
Published: (2024)
Beyond Utility: Evaluating LLM as Recommender
by: Jiang, Chumeng, et al.
Published: (2024)
by: Jiang, Chumeng, et al.
Published: (2024)
Sustainability Evaluation Metrics for Recommender Systems
by: Felfernig, Alexander, et al.
Published: (2025)
by: Felfernig, Alexander, et al.
Published: (2025)
Leveraging LLMs to Evaluate Usefulness of Document
by: Wang, Xingzhu, et al.
Published: (2025)
by: Wang, Xingzhu, et al.
Published: (2025)
Application of the Variety-Generator Approach to Searches of Personal Names in Bibliographic Data Bases - Part 2. Optimization of Key-Sets, and Evaluation of Their Retrieval Efficiency
by: Fokker, Dirk W., et al.
Published: (1974)
by: Fokker, Dirk W., et al.
Published: (1974)
ASPIRE: Assistive System for Performance Evaluation in IR
by: Peikos, Georgios, et al.
Published: (2024)
by: Peikos, Georgios, et al.
Published: (2024)
Replicability Measures for Longitudinal Information Retrieval Evaluation
by: Keller, Jüri, et al.
Published: (2024)
by: Keller, Jüri, et al.
Published: (2024)
A Comparison of Methods for Evaluating Generative IR
by: Arabzadeh, Negar, et al.
Published: (2024)
by: Arabzadeh, Negar, et al.
Published: (2024)
Evaluation of Cluster Id Assignment Schemes with ABCDE
by: van Staden, Stephan
Published: (2024)
by: van Staden, Stephan
Published: (2024)
Evaluation of Temporal Change in IR Test Collections
by: Keller, Jüri, et al.
Published: (2024)
by: Keller, Jüri, et al.
Published: (2024)
Offline Evaluation Measures of Fairness in Recommender Systems
by: Rampisela, Theresia Veronika
Published: (2026)
by: Rampisela, Theresia Veronika
Published: (2026)
Evaluating Search System Explainability with Psychometrics and Crowdsourcing
by: Chen, Catherine, et al.
Published: (2022)
by: Chen, Catherine, et al.
Published: (2022)
LLM-Driven Usefulness Labeling for IR Evaluation
by: Dewan, Mouly, et al.
Published: (2025)
by: Dewan, Mouly, et al.
Published: (2025)
Experimental Evaluation of Dynamic Topic Modeling Algorithms
by: Onah, Ngozichukwuka, et al.
Published: (2025)
by: Onah, Ngozichukwuka, et al.
Published: (2025)
Retrieval Augmented Generation Evaluation for Health Documents
by: Ceresa, Mario, et al.
Published: (2025)
by: Ceresa, Mario, et al.
Published: (2025)
The Hidden Cost of Defaults in Recommender System Evaluation
by: Berling, Hannah, et al.
Published: (2025)
by: Berling, Hannah, et al.
Published: (2025)
CF4J: Collaborative Filtering for Java
by: Ortega, Fernando, et al.
Published: (2024)
by: Ortega, Fernando, et al.
Published: (2024)
Reliability quality measures for recommender systems
by: Bobadilla, Jesús, et al.
Published: (2024)
by: Bobadilla, Jesús, et al.
Published: (2024)
Similar Items
-
Pessimistic Off-Policy Optimization for Learning to Rank
by: Cief, Matej, et al.
Published: (2022) -
Recall, Robustness, and Lexicographic Evaluation
by: Diaz, Fernando, et al.
Published: (2023) -
Evaluation of Agents under Simulated AI Marketplace Dynamics
by: Kim, To Eun, et al.
Published: (2026) -
Offline Evaluation of Set-Based Text-to-Image Generation
by: Arabzadeh, Negar, et al.
Published: (2024) -
Multilingual and Domain-Agnostic Tip-of-the-Tongue Query Generation for Simulated Evaluation
by: He, Xuhong, et al.
Published: (2026)