Saved in:
| Main Authors: | Möller, Lucas, Nikolaev, Dmitry, Padó, Sebastian |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.02883 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Understanding the Relationship between In-context Learning and Compositional Generalization
by: Han, Sungjun, et al.
Published: (2024)
by: Han, Sungjun, et al.
Published: (2024)
Regular-pattern-sensitive CRFs for Distant Label Interactions
by: Papay, Sean, et al.
Published: (2024)
by: Papay, Sean, et al.
Published: (2024)
Explaining Caption-Image Interactions in CLIP Models with Second-Order Attributions
by: Möller, Lucas, et al.
Published: (2024)
by: Möller, Lucas, et al.
Published: (2024)
Beyond prompt brittleness: Evaluating the reliability and consistency of political worldviews in LLMs
by: Ceron, Tanise, et al.
Published: (2024)
by: Ceron, Tanise, et al.
Published: (2024)
Off-the-Shelf LLMs as Process Scorers: Training-Free Alternative to PRMs for Mathematical Reasoning
by: Chegini, Atoosa, et al.
Published: (2026)
by: Chegini, Atoosa, et al.
Published: (2026)
An Analysis of Embedding Layers and Similarity Scores using Siamese Neural Networks
by: Bingi, Yash, et al.
Published: (2023)
by: Bingi, Yash, et al.
Published: (2023)
Interpretable Text Embeddings and Text Similarity Explanation: A Survey
by: Opitz, Juri, et al.
Published: (2025)
by: Opitz, Juri, et al.
Published: (2025)
Dual Path Attribution: Efficient Attribution for SwiGLU-Transformers through Layer-Wise Target Propagation
by: Jantsch, Lasse Marten, et al.
Published: (2026)
by: Jantsch, Lasse Marten, et al.
Published: (2026)
Do Language Models Encode Knowledge of Linguistic Constraint Violations?
by: Hardy, et al.
Published: (2026)
by: Hardy, et al.
Published: (2026)
Qwen Goes Brrr: Off-the-Shelf RAG for Ukrainian Multi-Domain Document Understanding
by: Bazdyrev, Anton, et al.
Published: (2026)
by: Bazdyrev, Anton, et al.
Published: (2026)
Pseudo-Siamese Network for Planning in Target-Oriented Proactive Dialogues
by: Kang, Xinyue, et al.
Published: (2026)
by: Kang, Xinyue, et al.
Published: (2026)
Efficient Language Modeling for Low-Resource Settings with Hybrid RNN-Transformer Architectures
by: Lindenmaier, Gabriel, et al.
Published: (2025)
by: Lindenmaier, Gabriel, et al.
Published: (2025)
Learning to Explain: Supervised Token Attribution from Transformer Attention Patterns
by: Mihaila, George
Published: (2026)
by: Mihaila, George
Published: (2026)
Diverging Transformer Predictions for Human Sentence Processing: A Comprehensive Analysis of Agreement Attraction Effects
by: von der Malsburg, Titus, et al.
Published: (2026)
by: von der Malsburg, Titus, et al.
Published: (2026)
Evaluating the Robustness of Adverse Drug Event Classification Models Using Templates
by: MacPhail, Dorothea, et al.
Published: (2024)
by: MacPhail, Dorothea, et al.
Published: (2024)
Hyperloop Transformers
by: Zeitoun, Abbas, et al.
Published: (2026)
by: Zeitoun, Abbas, et al.
Published: (2026)
Multi-Dimensional Machine Translation Evaluation: Model Evaluation and Resource for Korean
by: Park, Dojun, et al.
Published: (2024)
by: Park, Dojun, et al.
Published: (2024)
Finding Sense in Nonsense with Generated Contexts: Perspectives from Humans and Language Models
by: Olsen, Katrina, et al.
Published: (2026)
by: Olsen, Katrina, et al.
Published: (2026)
Refining GPT-3 Embeddings with a Siamese Structure for Technical Post Duplicate Detection
by: Wu, Xingfang, et al.
Published: (2023)
by: Wu, Xingfang, et al.
Published: (2023)
Transformer-VQ: Linear-Time Transformers via Vector Quantization
by: Lingle, Lucas D.
Published: (2023)
by: Lingle, Lucas D.
Published: (2023)
Cross-Refine: Improving Natural Language Explanation Generation by Learning in Tandem
by: Wang, Qianli, et al.
Published: (2024)
by: Wang, Qianli, et al.
Published: (2024)
Artwork Interpretation with Vision Language Models: A Case Study on Emotions and Emotion Symbols
by: Padó, Sebastian, et al.
Published: (2025)
by: Padó, Sebastian, et al.
Published: (2025)
On the Duality between Gradient Transformations and Adapters
by: Torroba-Hennigen, Lucas, et al.
Published: (2025)
by: Torroba-Hennigen, Lucas, et al.
Published: (2025)
Transformer-based Joint Modelling for Automatic Essay Scoring and Off-Topic Detection
by: Das, Sourya Dipta, et al.
Published: (2024)
by: Das, Sourya Dipta, et al.
Published: (2024)
Learning to Attribute with Attention
by: Cohen-Wang, Benjamin, et al.
Published: (2025)
by: Cohen-Wang, Benjamin, et al.
Published: (2025)
Understanding Gated Neurons in Transformers from Their Input-Output Functionality
by: Gerstner, Sebastian, et al.
Published: (2025)
by: Gerstner, Sebastian, et al.
Published: (2025)
GLUScope: A Tool for Analyzing GLU Neurons in Transformer Language Models
by: Gerstner, Sebastian, et al.
Published: (2026)
by: Gerstner, Sebastian, et al.
Published: (2026)
Hidden Heroes and Gradient Bloats: Layer-Wise Redundancy Inverts Attribution in Transformers
by: Ye, Donald
Published: (2026)
by: Ye, Donald
Published: (2026)
FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation
by: Wang, Qianli, et al.
Published: (2025)
by: Wang, Qianli, et al.
Published: (2025)
Transformers represent belief state geometry in their residual stream
by: Shai, Adam S., et al.
Published: (2024)
by: Shai, Adam S., et al.
Published: (2024)
AttributionBench: How Hard is Automatic Attribution Evaluation?
by: Li, Yifei, et al.
Published: (2024)
by: Li, Yifei, et al.
Published: (2024)
Through a Compressed Lens: Investigating The Impact of Quantization on Factual Knowledge Recall
by: Wang, Qianli, et al.
Published: (2025)
by: Wang, Qianli, et al.
Published: (2025)
Universal Approximation of Visual Autoregressive Transformers
by: Chen, Yifang, et al.
Published: (2025)
by: Chen, Yifang, et al.
Published: (2025)
ContextCite: Attributing Model Generation to Context
by: Cohen-Wang, Benjamin, et al.
Published: (2024)
by: Cohen-Wang, Benjamin, et al.
Published: (2024)
The Impact of Automatic Speech Transcription on Speaker Attribution
by: Aggazzotti, Cristina, et al.
Published: (2025)
by: Aggazzotti, Cristina, et al.
Published: (2025)
Attribution analysis of legal language as used by LLM
by: Belew, Richard K.
Published: (2025)
by: Belew, Richard K.
Published: (2025)
Multi-Layer Transformers Gradient Can be Approximated in Almost Linear Time
by: Liang, Yingyu, et al.
Published: (2024)
by: Liang, Yingyu, et al.
Published: (2024)
Confidence Preservation Property in Knowledge Distillation Abstractions
by: Vengertsev, Dmitry, et al.
Published: (2024)
by: Vengertsev, Dmitry, et al.
Published: (2024)
Neuron-Level Knowledge Attribution in Large Language Models
by: Yu, Zeping, et al.
Published: (2023)
by: Yu, Zeping, et al.
Published: (2023)
Off-Policy Value-Based Reinforcement Learning for Large Language Models
by: Wang, Peng-Yuan, et al.
Published: (2026)
by: Wang, Peng-Yuan, et al.
Published: (2026)
Similar Items
-
Towards Understanding the Relationship between In-context Learning and Compositional Generalization
by: Han, Sungjun, et al.
Published: (2024) -
Regular-pattern-sensitive CRFs for Distant Label Interactions
by: Papay, Sean, et al.
Published: (2024) -
Explaining Caption-Image Interactions in CLIP Models with Second-Order Attributions
by: Möller, Lucas, et al.
Published: (2024) -
Beyond prompt brittleness: Evaluating the reliability and consistency of political worldviews in LLMs
by: Ceron, Tanise, et al.
Published: (2024) -
Off-the-Shelf LLMs as Process Scorers: Training-Free Alternative to PRMs for Mathematical Reasoning
by: Chegini, Atoosa, et al.
Published: (2026)