:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Piotrowski, Mateusz, Riechers, Paul M., Filan, Daniel, Shai, Adam S.
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2502.01954
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Neural networks leverage nominally quantum and post-quantum representations
by: Riechers, Paul M., et al.
Published: (2025)

Transformers represent belief state geometry in their residual stream
by: Shai, Adam S., et al.
Published: (2024)

Rank-1 LoRAs Encode Interpretable Reasoning Signals
by: Ward, Jake, et al.
Published: (2025)

Next-token pretraining implies in-context learning
by: Riechers, Paul M., et al.
Published: (2025)

Transformers learn factored representations
by: Shai, Adam, et al.
Published: (2026)

Geometry and Dynamics of LayerNorm
by: Riechers, Paul M.
Published: (2024)

In-context learning agents are asymmetric belief updaters
by: Schubert, Johannes A., et al.
Published: (2024)

Positive concave deep equilibrium models
by: Gabor, Mateusz, et al.
Published: (2024)

Fixed points of nonnegative neural networks
by: Piotrowski, Tomasz J., et al.
Published: (2021)

Concept-based explainability for an EEG transformer model
by: Gjølbye, Anders, et al.
Published: (2023)

An explainable transformer circuit for compositional generalization
by: Tang, Cheng, et al.
Published: (2025)

A multi-criteria approach for selecting an explanation from the set of counterfactuals produced by an ensemble of explainers
by: Stępka, Ignacy, et al.
Published: (2024)

Learning the greatest common divisor: explaining transformer predictions
by: Charton, François
Published: (2023)

Accurate estimation of feature importance faithfulness for tree models
by: Gajewski, Mateusz, et al.
Published: (2024)

survex: an R package for explaining machine learning survival models
by: Spytek, Mikołaj, et al.
Published: (2023)

LLMs are not (consistently) Bayesian: Quantifying internal (in)consistencies of LLMs' probabilistic beliefs
by: Chen, Chacha, et al.
Published: (2026)

MolPILE -- large-scale, diverse dataset for molecular representation learning
by: Adamczyk, Jakub, et al.
Published: (2025)

Balancing Expressivity and Robustness: Constrained Rational Activations for Reinforcement Learning
by: Surdej, Rafał, et al.
Published: (2025)

Robust Conformal Prediction Using Privileged Information
by: Feldman, Shai, et al.
Published: (2024)

How Many Iterations to Jailbreak? Dynamic Budget Allocation for Multi-Turn LLM Evaluation
by: Feldman, Shai, et al.
Published: (2026)

Deep Dreams Are Made of This: Visualizing Monosemantic Features in Diffusion Models
by: Szokalski, Adam, et al.
Published: (2026)

Baseflow identification via explainable AI with Kolmogorov-Arnold networks
by: Liu, Chuyang, et al.
Published: (2024)

Do LLMs have core beliefs?
by: Sokol, Anna, et al.
Published: (2026)

Statistical and structural identifiability in representation learning
by: Nelson, Walter, et al.
Published: (2026)

Learning from positive and unlabeled examples -Finite size sample bounds
by: Mansouri, Farnam, et al.
Published: (2025)

Online Learning with Improving Agents: Multiclass, Budgeted Agents and Bandit Learners
by: Ashkezari, Sajad, et al.
Published: (2026)

Multinomial belief networks for healthcare data
by: Donker, H. C., et al.
Published: (2023)

LaB-GATr: geometric algebra transformers for large biomedical surface and volume meshes
by: Suk, Julian, et al.
Published: (2024)

DeepShare: Sharing ReLU Across Channels and Layers for Efficient Private Inference
by: Bornfeld, Yonathan, et al.
Published: (2025)

Learning to Score
by: Kriger, Yogev, et al.
Published: (2025)

Global explainability of a deep abstaining classifier
by: Dhaubhadel, Sayera, et al.
Published: (2025)

Contrastive representations of high-dimensional, structured treatments
by: Andreu, Oriol Corcoll, et al.
Published: (2024)

VARSHAP: Addressing Global Dependency Problems in Explainable AI with Variance-Based Local Feature Attribution
by: Gajewski, Mateusz, et al.
Published: (2025)

Amortized Causal Discovery with Prior-Fitted Networks
by: Sypniewski, Mateusz, et al.
Published: (2025)

Conformal Prediction with Corrupted Labels: Uncertain Imputation and Robust Re-weighting
by: Feldman, Shai, et al.
Published: (2025)

A Novel Data-Dependent Learning Paradigm for Large Hypothesis Classes
by: Pour, Alireza F., et al.
Published: (2025)

Federated style aware transformer aggregation of representations
by: Jeon, Mincheol, et al.
Published: (2025)

Evaluating representation learning on the protein structure universe
by: Jamasb, Arian R., et al.
Published: (2024)

Universal crystal material property prediction via multi-view geometric fusion in graph transformers
by: Zhang, Liang, et al.
Published: (2025)

Characterizing higher-order representations through generative diffusion models explains human decoded neurofeedback performance
by: Asrari, Hojjat Azimi, et al.
Published: (2025)