:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Nielsen, Beatrix M. G., Marconato, Emanuele, Gresele, Luigi, Dittadi, Andrea, Buchholz, Simon
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.15438
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

When Does Closeness in Distribution Imply Representational Similarity? An Identifiability Perspective
by: Nielsen, Beatrix M. G., et al.
Published: (2025)

All or None: Identifiable Linear Properties of Next-token Predictors in Language Modeling
by: Marconato, Emanuele, et al.
Published: (2024)

Relational Linear Properties in Language Models: An Empirical Investigation
by: Valer, Giovanni, et al.
Published: (2026)

Causal Component Analysis
by: Wendong, Liang, et al.
Published: (2023)

Concise and Logically Consistent Conformal Sets for Neuro-Symbolic Concept-Based Models
by: Bortolotti, Samuele, et al.
Published: (2026)

What is causal about causal models and representations?
by: Jørgensen, Frederik Hytting, et al.
Published: (2025)

Shortcuts and Identifiability in Concept-based Models from a Neuro-Symbolic Lens
by: Bortolotti, Samuele, et al.
Published: (2025)

DiffEnc: Variational Diffusion with a Learned Encoder
by: Nielsen, Beatrix M. G., et al.
Published: (2023)

BEARS Make Neuro-Symbolic Models Aware of their Reasoning Shortcuts
by: Marconato, Emanuele, et al.
Published: (2024)

Sparse Shift Autoencoders for Identifying Concepts from Large Language Model Activations
by: Joshi, Shruti, et al.
Published: (2025)

A Neuro-Symbolic Benchmark Suite for Concept Quality and Reasoning Shortcuts
by: Bortolotti, Samuele, et al.
Published: (2024)

Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training
by: Emadi, Seyed Morteza
Published: (2026)

Symbol Grounding in Neuro-Symbolic AI: A Gentle Introduction to Reasoning Shortcuts
by: Marconato, Emanuele, et al.
Published: (2025)

Learning Interpretable Concepts: Unifying Causal Representation Learning and Foundation Models
by: Rajendran, Goutham, et al.
Published: (2024)

From Logits to Hierarchies: Hierarchical Clustering made Simple
by: Palumbo, Emanuele, et al.
Published: (2024)

Harmonizing Multi-Objective LLM Unlearning via Unified Domain Representation and Bidirectional Logit Distillation
by: Zhong, Yisheng, et al.
Published: (2026)

Measuring Time-Series Dataset Similarity using Wasserstein Distance
by: Chen, Hongjie, et al.
Published: (2025)

Certified Robustness Under Bounded Levenshtein Distance
by: Rocamora, Elias Abad, et al.
Published: (2025)

Logit Dynamics in Softmax Policy Gradient Methods
by: Li, Yingru
Published: (2025)

SMART: Relation-Aware Learning of Geometric Representations for Knowledge Graphs
by: Amouzouvi, Kossi, et al.
Published: (2025)

Spectral Logit Sculpting: Adaptive Low-Rank Logit Transformation for Controlled Text Generation
by: Li, Jin, et al.
Published: (2025)

Logit Distillation on Manifolds: Mapping by Learning
by: Yang, Yiru, et al.
Published: (2026)

Algorithmic causal structure emerging through compression
by: Wendong, Liang, et al.
Published: (2025)

Peak-Controlled Logits Poisoning Attack in Federated Distillation
by: Tang, Yuhan, et al.
Published: (2024)

Model-Level GNN Explanations via Rule-to-Graph Readout for Logit Reconstruction
by: Lu, Shengyao, et al.
Published: (2025)

Molecular Graph Representation Learning via Structural Similarity Information
by: Yao, Chengyu, et al.
Published: (2024)

What Representational Similarity Measures Imply about Decodable Information
by: Harvey, Sarah E., et al.
Published: (2024)

An Explainable Multi-Task Similarity Measure: Integrating Accumulated Local Effects and Weighted Fréchet Distance
by: Hidalgo, Pablo, et al.
Published: (2026)

Auxiliary Reward Generation with Transition Distance Representation Learning
by: Li, Siyuan, et al.
Published: (2024)

SCALA: Split Federated Learning with Concatenated Activations and Logit Adjustments
by: Yang, Jiarong, et al.
Published: (2024)

From Projection to Prediction: Beyond Logits for Scalable Language Models
by: Dong, Jianbing, et al.
Published: (2025)

CLadder: Assessing Causal Reasoning in Language Models
by: Jin, Zhijing, et al.
Published: (2023)

From Molecules to Mixtures: Learning Representations of Olfactory Mixture Similarity using Inductive Biases
by: Tom, Gary, et al.
Published: (2025)

Early Detection of Multidrug Resistance Using Multivariate Time Series Analysis and Interpretable Patient-Similarity Representations
by: Escudero-Arnanz, Óscar, et al.
Published: (2025)

Formalising the Logit Shift Induced by LoRA: A Technical Note
by: Shi, Xiang, et al.
Published: (2026)

The Triangle of Similarity: A Multi-Faceted Framework for Comparing Neural Network Representations
by: Sirikova, Olha, et al.
Published: (2026)

Multiclass Local Calibration with the Jensen-Shannon Distance
by: Barbera, Cesare, et al.
Published: (2025)

SHRED: Retain-Set-Free Unlearning via Self-Distillation with Logit Demotion
by: Hu, Zizhao, et al.
Published: (2026)

Sharpness-Aware Minimization in Logit Space Efficiently Enhances Direct Preference Optimization
by: Luo, Haocheng, et al.
Published: (2026)

An Adversarial Example for Direct Logit Attribution: Memory Management in GELU-4L
by: Janiak, Jett, et al.
Published: (2023)