Saved in:
Bibliographic Details
Main Authors: Savolainen, Oliver, Amjad, Dur e Najaf, Petcu, Roxana
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2505.02154
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918008727797760
author Savolainen, Oliver
Amjad, Dur e Najaf
Petcu, Roxana
author_facet Savolainen, Oliver
Amjad, Dur e Najaf
Petcu, Roxana
contents This reproducibility study analyzes and extends the paper "Axiomatic Causal Interventions for Reverse Engineering Relevance Computation in Neural Retrieval Models," which investigates how neural retrieval models encode task-relevant properties such as term frequency. We reproduce key experiments from the original paper, confirming that information on query terms is captured in the model encoding. We extend this work by applying activation patching to Spanish and Chinese datasets and by exploring whether document-length information is encoded in the model as well. Our results confirm that the designed activation patching method can isolate the behavior to specific components and tokens in neural retrieval models. Moreover, our findings indicate that the location of term frequency generalizes across languages and that in later layers, the information for sequence-level tasks is represented in the CLS token. The results highlight the need for further research into interpretability in information retrieval and reproducibility in machine learning research. Our code is available at https://github.com/OliverSavolainen/axiomatic-ir-reproduce.
format Preprint
id arxiv_https___arxiv_org_abs_2505_02154
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Interpreting Multilingual and Document-Length Sensitive Relevance Computations in Neural Retrieval Models through Axiomatic Causal Interventions
Savolainen, Oliver
Amjad, Dur e Najaf
Petcu, Roxana
Information Retrieval
Artificial Intelligence
This reproducibility study analyzes and extends the paper "Axiomatic Causal Interventions for Reverse Engineering Relevance Computation in Neural Retrieval Models," which investigates how neural retrieval models encode task-relevant properties such as term frequency. We reproduce key experiments from the original paper, confirming that information on query terms is captured in the model encoding. We extend this work by applying activation patching to Spanish and Chinese datasets and by exploring whether document-length information is encoded in the model as well. Our results confirm that the designed activation patching method can isolate the behavior to specific components and tokens in neural retrieval models. Moreover, our findings indicate that the location of term frequency generalizes across languages and that in later layers, the information for sequence-level tasks is represented in the CLS token. The results highlight the need for further research into interpretability in information retrieval and reproducibility in machine learning research. Our code is available at https://github.com/OliverSavolainen/axiomatic-ir-reproduce.
title Interpreting Multilingual and Document-Length Sensitive Relevance Computations in Neural Retrieval Models through Axiomatic Causal Interventions
topic Information Retrieval
Artificial Intelligence
url https://arxiv.org/abs/2505.02154