Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Liu, Junhao, Yu, Haonan, Yan, Zhenyu, Zhang, Xin
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2602.04607
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910011641298944
author	Liu, Junhao Yu, Haonan Yan, Zhenyu Zhang, Xin
author_facet	Liu, Junhao Yu, Haonan Yan, Zhenyu Zhang, Xin
contents	As Large Language Models (LLMs) scale to handle massive context windows, achieving surgical feature-level interpretation is essential for high-stakes tasks like legal auditing and code debugging. However, existing local model-agnostic explanation methods face a critical dilemma in these scenarios: feature-based methods suffer from attribution dilution due to high feature dimensionality, thus failing to provide faithful explanations. In this paper, we propose Focus-LIME, a coarse-to-fine framework designed to restore the tractability of surgical interpretation. Focus-LIME utilizes a proxy model to curate the perturbation neighborhood, allowing the target model to perform fine-grained attribution exclusively within the optimized context. Empirical evaluations on long-context benchmarks demonstrate that our method makes surgical explanations practicable and provides faithful explanations to users.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_04607
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Focus-LIME: Surgical Interpretation of Long-Context Large Language Models via Proxy-Based Neighborhood Selection Liu, Junhao Yu, Haonan Yan, Zhenyu Zhang, Xin Computation and Language Machine Learning As Large Language Models (LLMs) scale to handle massive context windows, achieving surgical feature-level interpretation is essential for high-stakes tasks like legal auditing and code debugging. However, existing local model-agnostic explanation methods face a critical dilemma in these scenarios: feature-based methods suffer from attribution dilution due to high feature dimensionality, thus failing to provide faithful explanations. In this paper, we propose Focus-LIME, a coarse-to-fine framework designed to restore the tractability of surgical interpretation. Focus-LIME utilizes a proxy model to curate the perturbation neighborhood, allowing the target model to perform fine-grained attribution exclusively within the optimized context. Empirical evaluations on long-context benchmarks demonstrate that our method makes surgical explanations practicable and provides faithful explanations to users.
title	Focus-LIME: Surgical Interpretation of Long-Context Large Language Models via Proxy-Based Neighborhood Selection
topic	Computation and Language Machine Learning
url	https://arxiv.org/abs/2602.04607

Similar Items