Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Savolainen, Oliver, Amjad, Dur e Najaf, Petcu, Roxana
Format:	Preprint
Published:	2025
Subjects:	Information Retrieval Artificial Intelligence
Online Access:	https://arxiv.org/abs/2505.02154
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918008727797760
author	Savolainen, Oliver Amjad, Dur e Najaf Petcu, Roxana
author_facet	Savolainen, Oliver Amjad, Dur e Najaf Petcu, Roxana
contents	This reproducibility study analyzes and extends the paper "Axiomatic Causal Interventions for Reverse Engineering Relevance Computation in Neural Retrieval Models," which investigates how neural retrieval models encode task-relevant properties such as term frequency. We reproduce key experiments from the original paper, confirming that information on query terms is captured in the model encoding. We extend this work by applying activation patching to Spanish and Chinese datasets and by exploring whether document-length information is encoded in the model as well. Our results confirm that the designed activation patching method can isolate the behavior to specific components and tokens in neural retrieval models. Moreover, our findings indicate that the location of term frequency generalizes across languages and that in later layers, the information for sequence-level tasks is represented in the CLS token. The results highlight the need for further research into interpretability in information retrieval and reproducibility in machine learning research. Our code is available at https://github.com/OliverSavolainen/axiomatic-ir-reproduce.
format	Preprint
id	arxiv_https___arxiv_org_abs_2505_02154
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Interpreting Multilingual and Document-Length Sensitive Relevance Computations in Neural Retrieval Models through Axiomatic Causal Interventions Savolainen, Oliver Amjad, Dur e Najaf Petcu, Roxana Information Retrieval Artificial Intelligence This reproducibility study analyzes and extends the paper "Axiomatic Causal Interventions for Reverse Engineering Relevance Computation in Neural Retrieval Models," which investigates how neural retrieval models encode task-relevant properties such as term frequency. We reproduce key experiments from the original paper, confirming that information on query terms is captured in the model encoding. We extend this work by applying activation patching to Spanish and Chinese datasets and by exploring whether document-length information is encoded in the model as well. Our results confirm that the designed activation patching method can isolate the behavior to specific components and tokens in neural retrieval models. Moreover, our findings indicate that the location of term frequency generalizes across languages and that in later layers, the information for sequence-level tasks is represented in the CLS token. The results highlight the need for further research into interpretability in information retrieval and reproducibility in machine learning research. Our code is available at https://github.com/OliverSavolainen/axiomatic-ir-reproduce.
title	Interpreting Multilingual and Document-Length Sensitive Relevance Computations in Neural Retrieval Models through Axiomatic Causal Interventions
topic	Information Retrieval Artificial Intelligence
url	https://arxiv.org/abs/2505.02154

Similar Items