Saved in:
| Main Author: | Vessio, Gennaro |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.12723 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
What is the Relationship between Tensor Factorizations and Circuits (and How Can We Exploit it)?
by: Loconte, Lorenzo, et al.
Published: (2024)
by: Loconte, Lorenzo, et al.
Published: (2024)
Neural network modelling of kinematic and dynamic features for signature verification
by: Diaz, Moises, et al.
Published: (2024)
by: Diaz, Moises, et al.
Published: (2024)
Layer-wise Update Aggregation with Recycling for Communication-Efficient Federated Learning
by: Kim, Jisoo, et al.
Published: (2025)
by: Kim, Jisoo, et al.
Published: (2025)
Layer-wise Derivative Controlled Networks
by: Martnishn, Rowan, et al.
Published: (2026)
by: Martnishn, Rowan, et al.
Published: (2026)
Modeling and benchmarking quantum optical neurons for efficient neural computation
by: Andrisani, Andrea, et al.
Published: (2025)
by: Andrisani, Andrea, et al.
Published: (2025)
Is Aggregation the Only Choice? Federated Learning via Layer-wise Model Recombination
by: Hu, Ming, et al.
Published: (2023)
by: Hu, Ming, et al.
Published: (2023)
Layer-wise Weight Selection for Power-Efficient Neural Network Acceleration
by: Fang, Jiaxun, et al.
Published: (2025)
by: Fang, Jiaxun, et al.
Published: (2025)
DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation
by: Shing, Makoto, et al.
Published: (2025)
by: Shing, Makoto, et al.
Published: (2025)
Element-wise Modulation of Random Matrices for Efficient Neural Layers
by: Szorc, Maksymilian
Published: (2025)
by: Szorc, Maksymilian
Published: (2025)
Theoretical Learning Performance of Graph Neural Networks: The Impact of Jumping Connections and Layer-wise Sparsification
by: Sun, Jiawei, et al.
Published: (2025)
by: Sun, Jiawei, et al.
Published: (2025)
Interpreting Attention Layer Outputs with Sparse Autoencoders
by: Kissane, Connor, et al.
Published: (2024)
by: Kissane, Connor, et al.
Published: (2024)
Depth-Aware Initialization for Stable and Efficient Neural Network Training
by: Pandey, Vijay
Published: (2025)
by: Pandey, Vijay
Published: (2025)
Geometric Layer-wise Approximation Rates for Deep Networks
by: Zhang, Shijun, et al.
Published: (2026)
by: Zhang, Shijun, et al.
Published: (2026)
A Generalized Tikhonov Layer for Interpretable-by-design Graph Neural Networks
by: Tremblay, Nicolas, et al.
Published: (2026)
by: Tremblay, Nicolas, et al.
Published: (2026)
LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging
by: Kim, Jinuk, et al.
Published: (2024)
by: Kim, Jinuk, et al.
Published: (2024)
Attention Consistency Regularization for Interpretable Early-Exit Neural Networks
by: Zhao, Yanhua
Published: (2026)
by: Zhao, Yanhua
Published: (2026)
Layer-wise Linear Mode Connectivity
by: Adilova, Linara, et al.
Published: (2023)
by: Adilova, Linara, et al.
Published: (2023)
Disentangling Recall and Reasoning in Transformer Models through Layer-wise Attention and Activation Analysis
by: Fartale, Harshwardhan, et al.
Published: (2025)
by: Fartale, Harshwardhan, et al.
Published: (2025)
CLAA: Cross-Layer Attention Aggregation for Accelerating LLM Prefill
by: McDanel, Bradley, et al.
Published: (2026)
by: McDanel, Bradley, et al.
Published: (2026)
Memory-adaptive Depth-wise Heterogeneous Federated Learning
by: Zhang, Kai, et al.
Published: (2023)
by: Zhang, Kai, et al.
Published: (2023)
LEAP: Layer-wise Exit-Aware Pretraining for Efficient Transformer Inference
by: Kapadia, Shashank, et al.
Published: (2026)
by: Kapadia, Shashank, et al.
Published: (2026)
Efficient and Flexible Neural Network Training through Layer-wise Feedback Propagation
by: Weber, Leander, et al.
Published: (2023)
by: Weber, Leander, et al.
Published: (2023)
Predicting Drivers' Route Trajectories in Last-Mile Delivery Using A Pair-wise Attention-based Pointer Neural Network
by: Mo, Baichuan, et al.
Published: (2023)
by: Mo, Baichuan, et al.
Published: (2023)
LayerBoost: Layer-Aware Attention Reduction for Efficient LLMs
by: Souibgui, Mohamed Ali, et al.
Published: (2026)
by: Souibgui, Mohamed Ali, et al.
Published: (2026)
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers
by: Bozic, Vukasin, et al.
Published: (2023)
by: Bozic, Vukasin, et al.
Published: (2023)
LI-DSN: A Layer-wise Interactive Dual-Stream Network for EEG Decoding
by: Yue, Chenghao, et al.
Published: (2026)
by: Yue, Chenghao, et al.
Published: (2026)
How Interpretable Are Interpretable Graph Neural Networks?
by: Chen, Yongqiang, et al.
Published: (2024)
by: Chen, Yongqiang, et al.
Published: (2024)
Massive Activations in Graph Neural Networks: Decoding Attention for Domain-Dependent Interpretability
by: Bini, Lorenzo, et al.
Published: (2024)
by: Bini, Lorenzo, et al.
Published: (2024)
Fast Tensorization of Neural Networks via Slice-wise Feature Distillation
by: Hamreras, Safa, et al.
Published: (2026)
by: Hamreras, Safa, et al.
Published: (2026)
Scalable Model Merging with Progressive Layer-wise Distillation
by: Xu, Jing, et al.
Published: (2025)
by: Xu, Jing, et al.
Published: (2025)
LoaQ: Layer-wise Output Approximation Quantization
by: Lin, Li, et al.
Published: (2025)
by: Lin, Li, et al.
Published: (2025)
LOTOS: Layer-wise Orthogonalization for Training Robust Ensembles
by: Ebrahimpour-Boroojeny, Ali, et al.
Published: (2024)
by: Ebrahimpour-Boroojeny, Ali, et al.
Published: (2024)
Optimal Depth of Neural Networks
by: Qi, Qian
Published: (2025)
by: Qi, Qian
Published: (2025)
On the Effect of Uncertainty on Layer-wise Inference Dynamics
by: Kim, Sunwoo, et al.
Published: (2025)
by: Kim, Sunwoo, et al.
Published: (2025)
Controlled Model Debiasing through Minimal and Interpretable Updates
by: Di Gennaro, Federico, et al.
Published: (2025)
by: Di Gennaro, Federico, et al.
Published: (2025)
SqueezeAttention: 2D Management of KV-Cache in LLM Inference via Layer-wise Optimal Budget
by: Wang, Zihao, et al.
Published: (2024)
by: Wang, Zihao, et al.
Published: (2024)
Towards Federated Clustering: A Client-wise Private Graph Aggregation Framework
by: He, Guanxiong, et al.
Published: (2025)
by: He, Guanxiong, et al.
Published: (2025)
FlowMixer: A Depth-Agnostic Neural Architecture for Interpretable Spatiotemporal Forecasting
by: Mehouachi, Fares B., et al.
Published: (2025)
by: Mehouachi, Fares B., et al.
Published: (2025)
Element-wise Attention Is All You Need
by: Feng, Guoxin
Published: (2025)
by: Feng, Guoxin
Published: (2025)
Node-wise Filtering in Graph Neural Networks: A Mixture of Experts Approach
by: Han, Haoyu, et al.
Published: (2024)
by: Han, Haoyu, et al.
Published: (2024)
Similar Items
-
What is the Relationship between Tensor Factorizations and Circuits (and How Can We Exploit it)?
by: Loconte, Lorenzo, et al.
Published: (2024) -
Neural network modelling of kinematic and dynamic features for signature verification
by: Diaz, Moises, et al.
Published: (2024) -
Layer-wise Update Aggregation with Recycling for Communication-Efficient Federated Learning
by: Kim, Jisoo, et al.
Published: (2025) -
Layer-wise Derivative Controlled Networks
by: Martnishn, Rowan, et al.
Published: (2026) -
Modeling and benchmarking quantum optical neurons for efficient neural computation
by: Andrisani, Andrea, et al.
Published: (2025)