Saved in:
| Main Authors: | Piotrowski, Mateusz, Riechers, Paul M., Filan, Daniel, Shai, Adam S. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.01954 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Neural networks leverage nominally quantum and post-quantum representations
by: Riechers, Paul M., et al.
Published: (2025)
by: Riechers, Paul M., et al.
Published: (2025)
Transformers represent belief state geometry in their residual stream
by: Shai, Adam S., et al.
Published: (2024)
by: Shai, Adam S., et al.
Published: (2024)
Rank-1 LoRAs Encode Interpretable Reasoning Signals
by: Ward, Jake, et al.
Published: (2025)
by: Ward, Jake, et al.
Published: (2025)
Next-token pretraining implies in-context learning
by: Riechers, Paul M., et al.
Published: (2025)
by: Riechers, Paul M., et al.
Published: (2025)
Transformers learn factored representations
by: Shai, Adam, et al.
Published: (2026)
by: Shai, Adam, et al.
Published: (2026)
Geometry and Dynamics of LayerNorm
by: Riechers, Paul M.
Published: (2024)
by: Riechers, Paul M.
Published: (2024)
In-context learning agents are asymmetric belief updaters
by: Schubert, Johannes A., et al.
Published: (2024)
by: Schubert, Johannes A., et al.
Published: (2024)
Positive concave deep equilibrium models
by: Gabor, Mateusz, et al.
Published: (2024)
by: Gabor, Mateusz, et al.
Published: (2024)
Fixed points of nonnegative neural networks
by: Piotrowski, Tomasz J., et al.
Published: (2021)
by: Piotrowski, Tomasz J., et al.
Published: (2021)
Concept-based explainability for an EEG transformer model
by: Gjølbye, Anders, et al.
Published: (2023)
by: Gjølbye, Anders, et al.
Published: (2023)
An explainable transformer circuit for compositional generalization
by: Tang, Cheng, et al.
Published: (2025)
by: Tang, Cheng, et al.
Published: (2025)
A multi-criteria approach for selecting an explanation from the set of counterfactuals produced by an ensemble of explainers
by: Stępka, Ignacy, et al.
Published: (2024)
by: Stępka, Ignacy, et al.
Published: (2024)
Learning the greatest common divisor: explaining transformer predictions
by: Charton, François
Published: (2023)
by: Charton, François
Published: (2023)
Accurate estimation of feature importance faithfulness for tree models
by: Gajewski, Mateusz, et al.
Published: (2024)
by: Gajewski, Mateusz, et al.
Published: (2024)
survex: an R package for explaining machine learning survival models
by: Spytek, Mikołaj, et al.
Published: (2023)
by: Spytek, Mikołaj, et al.
Published: (2023)
LLMs are not (consistently) Bayesian: Quantifying internal (in)consistencies of LLMs' probabilistic beliefs
by: Chen, Chacha, et al.
Published: (2026)
by: Chen, Chacha, et al.
Published: (2026)
MolPILE -- large-scale, diverse dataset for molecular representation learning
by: Adamczyk, Jakub, et al.
Published: (2025)
by: Adamczyk, Jakub, et al.
Published: (2025)
Balancing Expressivity and Robustness: Constrained Rational Activations for Reinforcement Learning
by: Surdej, Rafał, et al.
Published: (2025)
by: Surdej, Rafał, et al.
Published: (2025)
Robust Conformal Prediction Using Privileged Information
by: Feldman, Shai, et al.
Published: (2024)
by: Feldman, Shai, et al.
Published: (2024)
How Many Iterations to Jailbreak? Dynamic Budget Allocation for Multi-Turn LLM Evaluation
by: Feldman, Shai, et al.
Published: (2026)
by: Feldman, Shai, et al.
Published: (2026)
Deep Dreams Are Made of This: Visualizing Monosemantic Features in Diffusion Models
by: Szokalski, Adam, et al.
Published: (2026)
by: Szokalski, Adam, et al.
Published: (2026)
Baseflow identification via explainable AI with Kolmogorov-Arnold networks
by: Liu, Chuyang, et al.
Published: (2024)
by: Liu, Chuyang, et al.
Published: (2024)
Do LLMs have core beliefs?
by: Sokol, Anna, et al.
Published: (2026)
by: Sokol, Anna, et al.
Published: (2026)
Statistical and structural identifiability in representation learning
by: Nelson, Walter, et al.
Published: (2026)
by: Nelson, Walter, et al.
Published: (2026)
Learning from positive and unlabeled examples -Finite size sample bounds
by: Mansouri, Farnam, et al.
Published: (2025)
by: Mansouri, Farnam, et al.
Published: (2025)
Online Learning with Improving Agents: Multiclass, Budgeted Agents and Bandit Learners
by: Ashkezari, Sajad, et al.
Published: (2026)
by: Ashkezari, Sajad, et al.
Published: (2026)
Multinomial belief networks for healthcare data
by: Donker, H. C., et al.
Published: (2023)
by: Donker, H. C., et al.
Published: (2023)
LaB-GATr: geometric algebra transformers for large biomedical surface and volume meshes
by: Suk, Julian, et al.
Published: (2024)
by: Suk, Julian, et al.
Published: (2024)
DeepShare: Sharing ReLU Across Channels and Layers for Efficient Private Inference
by: Bornfeld, Yonathan, et al.
Published: (2025)
by: Bornfeld, Yonathan, et al.
Published: (2025)
Learning to Score
by: Kriger, Yogev, et al.
Published: (2025)
by: Kriger, Yogev, et al.
Published: (2025)
Global explainability of a deep abstaining classifier
by: Dhaubhadel, Sayera, et al.
Published: (2025)
by: Dhaubhadel, Sayera, et al.
Published: (2025)
Contrastive representations of high-dimensional, structured treatments
by: Andreu, Oriol Corcoll, et al.
Published: (2024)
by: Andreu, Oriol Corcoll, et al.
Published: (2024)
VARSHAP: Addressing Global Dependency Problems in Explainable AI with Variance-Based Local Feature Attribution
by: Gajewski, Mateusz, et al.
Published: (2025)
by: Gajewski, Mateusz, et al.
Published: (2025)
Amortized Causal Discovery with Prior-Fitted Networks
by: Sypniewski, Mateusz, et al.
Published: (2025)
by: Sypniewski, Mateusz, et al.
Published: (2025)
Conformal Prediction with Corrupted Labels: Uncertain Imputation and Robust Re-weighting
by: Feldman, Shai, et al.
Published: (2025)
by: Feldman, Shai, et al.
Published: (2025)
A Novel Data-Dependent Learning Paradigm for Large Hypothesis Classes
by: Pour, Alireza F., et al.
Published: (2025)
by: Pour, Alireza F., et al.
Published: (2025)
Federated style aware transformer aggregation of representations
by: Jeon, Mincheol, et al.
Published: (2025)
by: Jeon, Mincheol, et al.
Published: (2025)
Evaluating representation learning on the protein structure universe
by: Jamasb, Arian R., et al.
Published: (2024)
by: Jamasb, Arian R., et al.
Published: (2024)
Universal crystal material property prediction via multi-view geometric fusion in graph transformers
by: Zhang, Liang, et al.
Published: (2025)
by: Zhang, Liang, et al.
Published: (2025)
Characterizing higher-order representations through generative diffusion models explains human decoded neurofeedback performance
by: Asrari, Hojjat Azimi, et al.
Published: (2025)
by: Asrari, Hojjat Azimi, et al.
Published: (2025)
Similar Items
-
Neural networks leverage nominally quantum and post-quantum representations
by: Riechers, Paul M., et al.
Published: (2025) -
Transformers represent belief state geometry in their residual stream
by: Shai, Adam S., et al.
Published: (2024) -
Rank-1 LoRAs Encode Interpretable Reasoning Signals
by: Ward, Jake, et al.
Published: (2025) -
Next-token pretraining implies in-context learning
by: Riechers, Paul M., et al.
Published: (2025) -
Transformers learn factored representations
by: Shai, Adam, et al.
Published: (2026)