Saved in:
| Main Author: | Davies, Harry J |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.02688 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Specialised or Generic? Tokenization Choices for Radiology Language Models
by: Warr, Hermione, et al.
Published: (2025)
by: Warr, Hermione, et al.
Published: (2025)
FlashDecoding++: Faster Large Language Model Inference on GPUs
by: Hong, Ke, et al.
Published: (2023)
by: Hong, Ke, et al.
Published: (2023)
SlimSpec: Low-Rank Draft LM-Head for Accelerated Speculative Decoding
by: Plaksin, Anton, et al.
Published: (2026)
by: Plaksin, Anton, et al.
Published: (2026)
Targeted Angular Reversal of Weights (TARS) for Knowledge Removal in Large Language Models
by: Davies, Harry J., et al.
Published: (2024)
by: Davies, Harry J., et al.
Published: (2024)
Replicating ReLM Results: Validating Large Language Models with ReLM
by: Adamson, Reece, et al.
Published: (2025)
by: Adamson, Reece, et al.
Published: (2025)
Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis
by: Yu, Zeping, et al.
Published: (2024)
by: Yu, Zeping, et al.
Published: (2024)
MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model
by: Huo, Jiahao, et al.
Published: (2024)
by: Huo, Jiahao, et al.
Published: (2024)
Preference Heads in Large Language Models: A Mechanistic Framework for Interpretable Personalization
by: Zhang, Weixu, et al.
Published: (2026)
by: Zhang, Weixu, et al.
Published: (2026)
CD4LM: Consistency Distillation and aDaptive Decoding for Diffusion Language Models
by: Liang, Yihao, et al.
Published: (2026)
by: Liang, Yihao, et al.
Published: (2026)
MzansiText and MzansiLM: An Open Corpus and Decoder-Only Language Model for South African Languages
by: Lombard, Anri, et al.
Published: (2026)
by: Lombard, Anri, et al.
Published: (2026)
DFlash: Block Diffusion for Flash Speculative Decoding
by: Chen, Jian, et al.
Published: (2026)
by: Chen, Jian, et al.
Published: (2026)
Constructing Interpretable Features from Compositional Neuron Groups
by: Shafran, Or, et al.
Published: (2025)
by: Shafran, Or, et al.
Published: (2025)
Automatically Interpreting Millions of Features in Large Language Models
by: Paulo, Gonçalo, et al.
Published: (2024)
by: Paulo, Gonçalo, et al.
Published: (2024)
Lost in Backpropagation: The LM Head is a Gradient Bottleneck
by: Godey, Nathan, et al.
Published: (2026)
by: Godey, Nathan, et al.
Published: (2026)
Interpreting Bias in Large Language Models: A Feature-Based Approach
by: Prakash, Nirmalendu, et al.
Published: (2024)
by: Prakash, Nirmalendu, et al.
Published: (2024)
Unveiling Language Competence Neurons: A Psycholinguistic Approach to Model Interpretability
by: Duan, Xufeng, et al.
Published: (2024)
by: Duan, Xufeng, et al.
Published: (2024)
On Relation-Specific Neurons in Large Language Models
by: Liu, Yihong, et al.
Published: (2025)
by: Liu, Yihong, et al.
Published: (2025)
DATE-LM: Benchmarking Data Attribution Evaluation for Large Language Models
by: Jiao, Cathy, et al.
Published: (2025)
by: Jiao, Cathy, et al.
Published: (2025)
Improving Neuron-level Interpretability with White-box Language Models
by: Bai, Hao, et al.
Published: (2024)
by: Bai, Hao, et al.
Published: (2024)
LM2: Large Memory Models
by: Kang, Jikun, et al.
Published: (2025)
by: Kang, Jikun, et al.
Published: (2025)
DarwinLM: Evolutionary Structured Pruning of Large Language Models
by: Tang, Shengkun, et al.
Published: (2025)
by: Tang, Shengkun, et al.
Published: (2025)
CogLM: Tracking Cognitive Development of Large Language Models
by: Wang, Xinglin, et al.
Published: (2024)
by: Wang, Xinglin, et al.
Published: (2024)
The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Pre-trained Language Models
by: Liu, Yan, et al.
Published: (2024)
by: Liu, Yan, et al.
Published: (2024)
Let's Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model
by: Xu, Haoyun, et al.
Published: (2024)
by: Xu, Haoyun, et al.
Published: (2024)
InternLM-Law: An Open Source Chinese Legal Large Language Model
by: Fei, Zhiwei, et al.
Published: (2024)
by: Fei, Zhiwei, et al.
Published: (2024)
SaulLM-7B: A pioneering Large Language Model for Law
by: Colombo, Pierre, et al.
Published: (2024)
by: Colombo, Pierre, et al.
Published: (2024)
ChainLM: Empowering Large Language Models with Improved Chain-of-Thought Prompting
by: Cheng, Xiaoxue, et al.
Published: (2024)
by: Cheng, Xiaoxue, et al.
Published: (2024)
On the Multilingual Ability of Decoder-based Pre-trained Language Models: Finding and Controlling Language-Specific Neurons
by: Kojima, Takeshi, et al.
Published: (2024)
by: Kojima, Takeshi, et al.
Published: (2024)
From Heads to Neurons: Causal Attribution and Steering in Multi-Task Vision-Language Models
by: Wang, Qidong, et al.
Published: (2026)
by: Wang, Qidong, et al.
Published: (2026)
Isolating Culture Neurons in Multilingual Large Language Models
by: Namazifard, Danial, et al.
Published: (2025)
by: Namazifard, Danial, et al.
Published: (2025)
Neuron-Level Sequential Editing for Large Language Models
by: Jiang, Houcheng, et al.
Published: (2024)
by: Jiang, Houcheng, et al.
Published: (2024)
JudgeLM: Fine-tuned Large Language Models are Scalable Judges
by: Zhu, Lianghui, et al.
Published: (2023)
by: Zhu, Lianghui, et al.
Published: (2023)
Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph
by: Vashurin, Roman, et al.
Published: (2024)
by: Vashurin, Roman, et al.
Published: (2024)
Encoder-Decoder or Decoder-Only? Revisiting Encoder-Decoder Large Language Model
by: Zhang, Biao, et al.
Published: (2025)
by: Zhang, Biao, et al.
Published: (2025)
Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models
by: Tang, Tianyi, et al.
Published: (2024)
by: Tang, Tianyi, et al.
Published: (2024)
SEKE: Specialised Experts for Keyword Extraction
by: Martinc, Matej, et al.
Published: (2024)
by: Martinc, Matej, et al.
Published: (2024)
FaithLM: Towards Faithful Explanations for Large Language Models
by: Chuang, Yu-Neng, et al.
Published: (2024)
by: Chuang, Yu-Neng, et al.
Published: (2024)
KEDRec-LM: A Knowledge-distilled Explainable Drug Recommendation Large Language Model
by: Zhang, Kai, et al.
Published: (2025)
by: Zhang, Kai, et al.
Published: (2025)
SageLM: A Multi-aspect and Explainable Large Language Model for Speech Judgement
by: Ge, Yuan, et al.
Published: (2025)
by: Ge, Yuan, et al.
Published: (2025)
InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning
by: Ying, Huaiyuan, et al.
Published: (2024)
by: Ying, Huaiyuan, et al.
Published: (2024)
Similar Items
-
Specialised or Generic? Tokenization Choices for Radiology Language Models
by: Warr, Hermione, et al.
Published: (2025) -
FlashDecoding++: Faster Large Language Model Inference on GPUs
by: Hong, Ke, et al.
Published: (2023) -
SlimSpec: Low-Rank Draft LM-Head for Accelerated Speculative Decoding
by: Plaksin, Anton, et al.
Published: (2026) -
Targeted Angular Reversal of Weights (TARS) for Knowledge Removal in Large Language Models
by: Davies, Harry J., et al.
Published: (2024) -
Replicating ReLM Results: Validating Large Language Models with ReLM
by: Adamson, Reece, et al.
Published: (2025)