Saved in:
| Main Authors: | Abohwo, Jason, Mosen, Thomas |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.01874 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Weight-based Decomposition: A Case for Bilinear MLPs
by: Pearce, Michael T., et al.
Published: (2024)
by: Pearce, Michael T., et al.
Published: (2024)
Beyond Gaussian Initializations: Signal Preserving Weight Initialization for Odd-Sigmoid Activations
by: Lee, Hyunwoo, et al.
Published: (2025)
by: Lee, Hyunwoo, et al.
Published: (2025)
Network-Aware Bilinear Tokenization for Brain Functional Connectivity Representation Learning
by: Milecki, Leo, et al.
Published: (2026)
by: Milecki, Leo, et al.
Published: (2026)
NegMerge: Sign-Consensual Weight Merging for Machine Unlearning
by: Kim, Hyo Seo, et al.
Published: (2024)
by: Kim, Hyo Seo, et al.
Published: (2024)
Bilinear Convolution Decomposition for Causal RL Interpretability
by: Oozeer, Narmeen, et al.
Published: (2024)
by: Oozeer, Narmeen, et al.
Published: (2024)
Learning Equivariant Functions via Quadratic Forms
by: Karjol, Pavan, et al.
Published: (2025)
by: Karjol, Pavan, et al.
Published: (2025)
Beyond Johnson-Lindenstrauss: Uniform Bounds for Sketched Bilinear Forms
by: Deb, Rohan, et al.
Published: (2025)
by: Deb, Rohan, et al.
Published: (2025)
Bilinear representation mitigates reversal curse and enables consistent model editing
by: Kim, Dong-Kyum, et al.
Published: (2025)
by: Kim, Dong-Kyum, et al.
Published: (2025)
Geometric Regularization in Mixture-of-Experts: The Disconnect Between Weights and Activations
by: Kim, Hyunjun
Published: (2026)
by: Kim, Hyunjun
Published: (2026)
BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
by: Lin, Haohong, et al.
Published: (2024)
by: Lin, Haohong, et al.
Published: (2024)
Mining Generalizable Activation Functions
by: Vitvitskyi, Alex, et al.
Published: (2026)
by: Vitvitskyi, Alex, et al.
Published: (2026)
Spectra-to-Structure and Structure-to-Spectra Inference Across the Periodic Table
by: Wang, Yufeng, et al.
Published: (2025)
by: Wang, Yufeng, et al.
Published: (2025)
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs
by: Cheng, Wenhua, et al.
Published: (2023)
by: Cheng, Wenhua, et al.
Published: (2023)
MIBP-Cert: Certified Training against Data Perturbations with Mixed-Integer Bilinear Programs
by: Lorenz, Tobias, et al.
Published: (2024)
by: Lorenz, Tobias, et al.
Published: (2024)
Towards a More Complete Theory of Function Preserving Transforms
by: Painter, Michael
Published: (2024)
by: Painter, Michael
Published: (2024)
ANAct: Adaptive Normalization for Activation Functions
by: Peiwen, Yuan, et al.
Published: (2022)
by: Peiwen, Yuan, et al.
Published: (2022)
WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference
by: Chen, Sihan, et al.
Published: (2025)
by: Chen, Sihan, et al.
Published: (2025)
A Method on Searching Better Activation Functions
by: Sun, Haoyuan, et al.
Published: (2024)
by: Sun, Haoyuan, et al.
Published: (2024)
ScoresActivation: A New Activation Function for Model Agnostic Global Explainability by Design
by: Covaci, Emanuel, et al.
Published: (2025)
by: Covaci, Emanuel, et al.
Published: (2025)
WiSparse: Boosting LLM Inference Efficiency with Weight-Aware Mixed Activation Sparsity
by: Chen, Lei, et al.
Published: (2026)
by: Chen, Lei, et al.
Published: (2026)
Intrinsic Structure as a Proxy for Saliency: SVD-Based Weight Preservation for Mixed-Precision Quantization in Large Language Models
by: Landge, Shashank, et al.
Published: (2025)
by: Landge, Shashank, et al.
Published: (2025)
Beyond Expression Similarity: Contrastive Learning Recovers Functional Gene Associations from Protein Interaction Structure
by: Dury, Jason
Published: (2026)
by: Dury, Jason
Published: (2026)
Global Convergence in Neural ODEs: Impact of Activation Functions
by: Gao, Tianxiang, et al.
Published: (2025)
by: Gao, Tianxiang, et al.
Published: (2025)
Efficient Search for Customized Activation Functions with Gradient Descent
by: Strack, Lukas, et al.
Published: (2024)
by: Strack, Lukas, et al.
Published: (2024)
Find A Winning Sign: Sign Is All We Need to Win the Lottery
by: Oh, Junghun, et al.
Published: (2025)
by: Oh, Junghun, et al.
Published: (2025)
Unsupervised Learning for Quadratic Assignment
by: Min, Yimeng, et al.
Published: (2025)
by: Min, Yimeng, et al.
Published: (2025)
Hallucination Detection via Activations of Open-Weight Proxy Analyzers
by: Singh, Akshita, et al.
Published: (2026)
by: Singh, Akshita, et al.
Published: (2026)
Analytical Solution of a Three-layer Network with a Matrix Exponential Activation Function
by: Gai, Kuo, et al.
Published: (2024)
by: Gai, Kuo, et al.
Published: (2024)
Non-Interfering Weight Fields: Treating Model Parameters as a Continuously Extensible Function
by: Chaudhry, Sarim
Published: (2026)
by: Chaudhry, Sarim
Published: (2026)
Certified Signed Graph Unlearning
by: Zhao, Junpeng, et al.
Published: (2025)
by: Zhao, Junpeng, et al.
Published: (2025)
Sign Lock-In: Randomly Initialized Weight Signs Persist and Bottleneck Sub-Bit Model Compression
by: Sakai, Akira, et al.
Published: (2026)
by: Sakai, Akira, et al.
Published: (2026)
Towards Trustworthy Vital Sign Forecasting: Leveraging Uncertainty for Prediction Intervals
by: Wang, Li Rong, et al.
Published: (2025)
by: Wang, Li Rong, et al.
Published: (2025)
FLAIN: Mitigating Backdoor Attacks in Federated Learning via Flipping Weight Updates of Low-Activation Input Neurons
by: Ding, Binbin, et al.
Published: (2024)
by: Ding, Binbin, et al.
Published: (2024)
Geometry Preserving Loss Functions Promote Improved Adaptation of Blackbox Generative Model
by: Mitra, Sinjini, et al.
Published: (2026)
by: Mitra, Sinjini, et al.
Published: (2026)
Measuring What Matters: Intrinsic Distance Preservation as a Robust Metric for Embedding Quality
by: Hart, Steven N., et al.
Published: (2024)
by: Hart, Steven N., et al.
Published: (2024)
Spectra: Rethinking Optimizers for LLMs Under Spectral Anisotropy
by: Huang, Zhendong, et al.
Published: (2026)
by: Huang, Zhendong, et al.
Published: (2026)
Beyond Weighted Summation: Learnable Nonlinear Aggregation Functions for Robust Artificial Neurons
by: Bozyigit, Berke Deniz
Published: (2026)
by: Bozyigit, Berke Deniz
Published: (2026)
Catapult Dynamics and Phase Transitions in Quadratic Nets
by: Meltzer, David, et al.
Published: (2023)
by: Meltzer, David, et al.
Published: (2023)
Feature-Function Curvature Analysis: A Geometric Framework for Explaining Differentiable Models
by: Najafi, Hamed, et al.
Published: (2025)
by: Najafi, Hamed, et al.
Published: (2025)
COUNTDOWN: Contextually Sparse Activation Filtering Out Unnecessary Weights in Down Projection
by: Cheon, Jaewon, et al.
Published: (2025)
by: Cheon, Jaewon, et al.
Published: (2025)
Similar Items
-
Weight-based Decomposition: A Case for Bilinear MLPs
by: Pearce, Michael T., et al.
Published: (2024) -
Beyond Gaussian Initializations: Signal Preserving Weight Initialization for Odd-Sigmoid Activations
by: Lee, Hyunwoo, et al.
Published: (2025) -
Network-Aware Bilinear Tokenization for Brain Functional Connectivity Representation Learning
by: Milecki, Leo, et al.
Published: (2026) -
NegMerge: Sign-Consensual Weight Merging for Machine Unlearning
by: Kim, Hyo Seo, et al.
Published: (2024) -
Bilinear Convolution Decomposition for Causal RL Interpretability
by: Oozeer, Narmeen, et al.
Published: (2024)