Saved in:
| Main Authors: | Nielsen, Beatrix M. G., Marconato, Emanuele, Gresele, Luigi, Dittadi, Andrea, Buchholz, Simon |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.15438 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
When Does Closeness in Distribution Imply Representational Similarity? An Identifiability Perspective
by: Nielsen, Beatrix M. G., et al.
Published: (2025)
by: Nielsen, Beatrix M. G., et al.
Published: (2025)
All or None: Identifiable Linear Properties of Next-token Predictors in Language Modeling
by: Marconato, Emanuele, et al.
Published: (2024)
by: Marconato, Emanuele, et al.
Published: (2024)
Relational Linear Properties in Language Models: An Empirical Investigation
by: Valer, Giovanni, et al.
Published: (2026)
by: Valer, Giovanni, et al.
Published: (2026)
Causal Component Analysis
by: Wendong, Liang, et al.
Published: (2023)
by: Wendong, Liang, et al.
Published: (2023)
Concise and Logically Consistent Conformal Sets for Neuro-Symbolic Concept-Based Models
by: Bortolotti, Samuele, et al.
Published: (2026)
by: Bortolotti, Samuele, et al.
Published: (2026)
What is causal about causal models and representations?
by: Jørgensen, Frederik Hytting, et al.
Published: (2025)
by: Jørgensen, Frederik Hytting, et al.
Published: (2025)
Shortcuts and Identifiability in Concept-based Models from a Neuro-Symbolic Lens
by: Bortolotti, Samuele, et al.
Published: (2025)
by: Bortolotti, Samuele, et al.
Published: (2025)
DiffEnc: Variational Diffusion with a Learned Encoder
by: Nielsen, Beatrix M. G., et al.
Published: (2023)
by: Nielsen, Beatrix M. G., et al.
Published: (2023)
BEARS Make Neuro-Symbolic Models Aware of their Reasoning Shortcuts
by: Marconato, Emanuele, et al.
Published: (2024)
by: Marconato, Emanuele, et al.
Published: (2024)
Sparse Shift Autoencoders for Identifying Concepts from Large Language Model Activations
by: Joshi, Shruti, et al.
Published: (2025)
by: Joshi, Shruti, et al.
Published: (2025)
A Neuro-Symbolic Benchmark Suite for Concept Quality and Reasoning Shortcuts
by: Bortolotti, Samuele, et al.
Published: (2024)
by: Bortolotti, Samuele, et al.
Published: (2024)
Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training
by: Emadi, Seyed Morteza
Published: (2026)
by: Emadi, Seyed Morteza
Published: (2026)
Symbol Grounding in Neuro-Symbolic AI: A Gentle Introduction to Reasoning Shortcuts
by: Marconato, Emanuele, et al.
Published: (2025)
by: Marconato, Emanuele, et al.
Published: (2025)
Learning Interpretable Concepts: Unifying Causal Representation Learning and Foundation Models
by: Rajendran, Goutham, et al.
Published: (2024)
by: Rajendran, Goutham, et al.
Published: (2024)
From Logits to Hierarchies: Hierarchical Clustering made Simple
by: Palumbo, Emanuele, et al.
Published: (2024)
by: Palumbo, Emanuele, et al.
Published: (2024)
Harmonizing Multi-Objective LLM Unlearning via Unified Domain Representation and Bidirectional Logit Distillation
by: Zhong, Yisheng, et al.
Published: (2026)
by: Zhong, Yisheng, et al.
Published: (2026)
Measuring Time-Series Dataset Similarity using Wasserstein Distance
by: Chen, Hongjie, et al.
Published: (2025)
by: Chen, Hongjie, et al.
Published: (2025)
Certified Robustness Under Bounded Levenshtein Distance
by: Rocamora, Elias Abad, et al.
Published: (2025)
by: Rocamora, Elias Abad, et al.
Published: (2025)
Logit Dynamics in Softmax Policy Gradient Methods
by: Li, Yingru
Published: (2025)
by: Li, Yingru
Published: (2025)
SMART: Relation-Aware Learning of Geometric Representations for Knowledge Graphs
by: Amouzouvi, Kossi, et al.
Published: (2025)
by: Amouzouvi, Kossi, et al.
Published: (2025)
Spectral Logit Sculpting: Adaptive Low-Rank Logit Transformation for Controlled Text Generation
by: Li, Jin, et al.
Published: (2025)
by: Li, Jin, et al.
Published: (2025)
Logit Distillation on Manifolds: Mapping by Learning
by: Yang, Yiru, et al.
Published: (2026)
by: Yang, Yiru, et al.
Published: (2026)
Algorithmic causal structure emerging through compression
by: Wendong, Liang, et al.
Published: (2025)
by: Wendong, Liang, et al.
Published: (2025)
Peak-Controlled Logits Poisoning Attack in Federated Distillation
by: Tang, Yuhan, et al.
Published: (2024)
by: Tang, Yuhan, et al.
Published: (2024)
Model-Level GNN Explanations via Rule-to-Graph Readout for Logit Reconstruction
by: Lu, Shengyao, et al.
Published: (2025)
by: Lu, Shengyao, et al.
Published: (2025)
Molecular Graph Representation Learning via Structural Similarity Information
by: Yao, Chengyu, et al.
Published: (2024)
by: Yao, Chengyu, et al.
Published: (2024)
What Representational Similarity Measures Imply about Decodable Information
by: Harvey, Sarah E., et al.
Published: (2024)
by: Harvey, Sarah E., et al.
Published: (2024)
An Explainable Multi-Task Similarity Measure: Integrating Accumulated Local Effects and Weighted Fréchet Distance
by: Hidalgo, Pablo, et al.
Published: (2026)
by: Hidalgo, Pablo, et al.
Published: (2026)
Auxiliary Reward Generation with Transition Distance Representation Learning
by: Li, Siyuan, et al.
Published: (2024)
by: Li, Siyuan, et al.
Published: (2024)
SCALA: Split Federated Learning with Concatenated Activations and Logit Adjustments
by: Yang, Jiarong, et al.
Published: (2024)
by: Yang, Jiarong, et al.
Published: (2024)
From Projection to Prediction: Beyond Logits for Scalable Language Models
by: Dong, Jianbing, et al.
Published: (2025)
by: Dong, Jianbing, et al.
Published: (2025)
CLadder: Assessing Causal Reasoning in Language Models
by: Jin, Zhijing, et al.
Published: (2023)
by: Jin, Zhijing, et al.
Published: (2023)
From Molecules to Mixtures: Learning Representations of Olfactory Mixture Similarity using Inductive Biases
by: Tom, Gary, et al.
Published: (2025)
by: Tom, Gary, et al.
Published: (2025)
Early Detection of Multidrug Resistance Using Multivariate Time Series Analysis and Interpretable Patient-Similarity Representations
by: Escudero-Arnanz, Óscar, et al.
Published: (2025)
by: Escudero-Arnanz, Óscar, et al.
Published: (2025)
Formalising the Logit Shift Induced by LoRA: A Technical Note
by: Shi, Xiang, et al.
Published: (2026)
by: Shi, Xiang, et al.
Published: (2026)
The Triangle of Similarity: A Multi-Faceted Framework for Comparing Neural Network Representations
by: Sirikova, Olha, et al.
Published: (2026)
by: Sirikova, Olha, et al.
Published: (2026)
Multiclass Local Calibration with the Jensen-Shannon Distance
by: Barbera, Cesare, et al.
Published: (2025)
by: Barbera, Cesare, et al.
Published: (2025)
SHRED: Retain-Set-Free Unlearning via Self-Distillation with Logit Demotion
by: Hu, Zizhao, et al.
Published: (2026)
by: Hu, Zizhao, et al.
Published: (2026)
Sharpness-Aware Minimization in Logit Space Efficiently Enhances Direct Preference Optimization
by: Luo, Haocheng, et al.
Published: (2026)
by: Luo, Haocheng, et al.
Published: (2026)
An Adversarial Example for Direct Logit Attribution: Memory Management in GELU-4L
by: Janiak, Jett, et al.
Published: (2023)
by: Janiak, Jett, et al.
Published: (2023)
Similar Items
-
When Does Closeness in Distribution Imply Representational Similarity? An Identifiability Perspective
by: Nielsen, Beatrix M. G., et al.
Published: (2025) -
All or None: Identifiable Linear Properties of Next-token Predictors in Language Modeling
by: Marconato, Emanuele, et al.
Published: (2024) -
Relational Linear Properties in Language Models: An Empirical Investigation
by: Valer, Giovanni, et al.
Published: (2026) -
Causal Component Analysis
by: Wendong, Liang, et al.
Published: (2023) -
Concise and Logically Consistent Conformal Sets for Neuro-Symbolic Concept-Based Models
by: Bortolotti, Samuele, et al.
Published: (2026)