Saved in:
| Main Authors: | da Silva, Marvin F., Dangel, Felix, Oore, Sageev |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.05409 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Generalizing the Geometry of Model Merging Through Frechet Averages
by: da Silva, Marvin F., et al.
Published: (2026)
by: da Silva, Marvin F., et al.
Published: (2026)
DiffAug: A Diffuse-and-Denoise Augmentation for Training Robust Classifiers
by: Sastry, Chandramouli, et al.
Published: (2023)
by: Sastry, Chandramouli, et al.
Published: (2023)
Test-Time Training for Depression Detection
by: Dumpala, Sri Harsha, et al.
Published: (2024)
by: Dumpala, Sri Harsha, et al.
Published: (2024)
Seeing Syntax: Uncovering Syntactic Learning Limitations in Vision-Language Models
by: Dumpala, Sri Harsha, et al.
Published: (2024)
by: Dumpala, Sri Harsha, et al.
Published: (2024)
Self-Distillation of Hidden Layers for Self-Supervised Representation Learning
by: Lowe, Scott C., et al.
Published: (2026)
by: Lowe, Scott C., et al.
Published: (2026)
Convolutions and More as Einsum: A Tensor Network Perspective with Advances for Second-Order Methods
by: Dangel, Felix
Published: (2023)
by: Dangel, Felix
Published: (2023)
Self-Supervised Embeddings for Detecting Individual Symptoms of Depression
by: Dumpala, Sri Harsha, et al.
Published: (2024)
by: Dumpala, Sri Harsha, et al.
Published: (2024)
Lowering PyTorch's Memory Consumption for Selective Differentiation
by: Bhatia, Samarth, et al.
Published: (2024)
by: Bhatia, Samarth, et al.
Published: (2024)
What Does It Mean to Be a Transformer? Insights from a Theoretical Hessian Analysis
by: Ormaniec, Weronika, et al.
Published: (2024)
by: Ormaniec, Weronika, et al.
Published: (2024)
Predicting Individual Depression Symptoms from Acoustic Features During Speech
by: Rodriguez, Sebastian, et al.
Published: (2024)
by: Rodriguez, Sebastian, et al.
Published: (2024)
Sensitivity of Generative VLMs to Semantically and Lexically Altered Prompts
by: Dumpala, Sri Harsha, et al.
Published: (2024)
by: Dumpala, Sri Harsha, et al.
Published: (2024)
An Empirical Study into Clustering of Unseen Datasets with Self-Supervised Encoders
by: Lowe, Scott C., et al.
Published: (2024)
by: Lowe, Scott C., et al.
Published: (2024)
On the Disconnect Between Theory and Practice of Neural Networks: Limits of the NTK Perspective
by: Wenger, Jonathan, et al.
Published: (2023)
by: Wenger, Jonathan, et al.
Published: (2023)
Efficient Bilevel Optimization with KFAC-Based Hypergradients
by: Liao, Disen, et al.
Published: (2026)
by: Liao, Disen, et al.
Published: (2026)
SUGARCREPE++ Dataset: Vision-Language Model Sensitivity to Semantic and Lexical Alterations
by: Dumpala, Sri Harsha, et al.
Published: (2024)
by: Dumpala, Sri Harsha, et al.
Published: (2024)
Kronecker-Factored Approximate Curvature for Physics-Informed Neural Networks
by: Dangel, Felix, et al.
Published: (2024)
by: Dangel, Felix, et al.
Published: (2024)
Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion
by: Huang, Yujia, et al.
Published: (2024)
by: Huang, Yujia, et al.
Published: (2024)
Kronecker-factored Approximate Curvature (KFAC) From Scratch
by: Dangel, Felix, et al.
Published: (2025)
by: Dangel, Felix, et al.
Published: (2025)
Collapsing Taylor Mode Automatic Differentiation
by: Dangel, Felix, et al.
Published: (2025)
by: Dangel, Felix, et al.
Published: (2025)
Hiding in Plain Sight: Reframing Hardware Trojan Benchmarking as a Hide&Seek Modification
by: Sarihi, Amin, et al.
Published: (2024)
by: Sarihi, Amin, et al.
Published: (2024)
Improving Energy Natural Gradient Descent through Woodbury, Momentum, and Randomization
by: Guzmán-Cordero, Andrés, et al.
Published: (2025)
by: Guzmán-Cordero, Andrés, et al.
Published: (2025)
Fishers for Free? Approximating the Fisher Information Matrix by Recycling the Squared Gradient Accumulator
by: Li, YuXin, et al.
Published: (2025)
by: Li, YuXin, et al.
Published: (2025)
Sketching Low-Rank Plus Diagonal Matrices
by: Fernandez, Andres, et al.
Published: (2025)
by: Fernandez, Andres, et al.
Published: (2025)
Hide and Seek: Investigating Redundancy in Earth Observation Imagery
by: Papazafeiropoulos, Tasos, et al.
Published: (2026)
by: Papazafeiropoulos, Tasos, et al.
Published: (2026)
Robust Optimization Approach and Learning Based Hide-and-Seek Game for Resilient Network Design
by: Khosravi, Mohammad, et al.
Published: (2026)
by: Khosravi, Mohammad, et al.
Published: (2026)
Sharpness-Aware Teleportation on Riemannian Manifolds
by: Truong, Tuan, et al.
Published: (2023)
by: Truong, Tuan, et al.
Published: (2023)
Revisiting Scalable Hessian Diagonal Approximations for Applications in Reinforcement Learning
by: Elsayed, Mohamed, et al.
Published: (2024)
by: Elsayed, Mohamed, et al.
Published: (2024)
Hide-and-Seek Attribution: Weakly Supervised Segmentation of Vertebral Metastases in CT
by: Atad, Matan, et al.
Published: (2025)
by: Atad, Matan, et al.
Published: (2025)
Reparametrizing Shampoo and SOAP for Subspace Basis Updates and BFloat16 Storage
by: Milligan, Alan, et al.
Published: (2026)
by: Milligan, Alan, et al.
Published: (2026)
Hide and Find: A Distributed Adversarial Attack on Federated Graph Learning
by: Liu, Jinshan, et al.
Published: (2026)
by: Liu, Jinshan, et al.
Published: (2026)
FedHide: Federated Learning by Hiding in the Neighbors
by: Park, Hyunsin, et al.
Published: (2024)
by: Park, Hyunsin, et al.
Published: (2024)
Position: Curvature Matrices Should Be Democratized via Linear Operators
by: Dangel, Felix, et al.
Published: (2025)
by: Dangel, Felix, et al.
Published: (2025)
Hide and Seek in Noise Labels: Noise-Robust Collaborative Active Learning with LLM-Powered Assistance
by: Yuan, Bo, et al.
Published: (2025)
by: Yuan, Bo, et al.
Published: (2025)
Isometric Immersion Learning with Riemannian Geometry
by: Chen, Zihao, et al.
Published: (2024)
by: Chen, Zihao, et al.
Published: (2024)
Does SGD Seek Flatness or Sharpness? An Exactly Solvable Model
by: Xu, Yizhou, et al.
Published: (2026)
by: Xu, Yizhou, et al.
Published: (2026)
A High-Throughput Compute-Efficient POMDP Hide-And-Seek-Engine (HASE) for Multi-Agent Operations
by: Flavin, Timothy, et al.
Published: (2026)
by: Flavin, Timothy, et al.
Published: (2026)
Spectral-factorized Positive-definite Curvature Learning for NN Training
by: Lin, Wu, et al.
Published: (2025)
by: Lin, Wu, et al.
Published: (2025)
Understanding and Improving Shampoo and SOAP via Kullback-Leibler Minimization
by: Lin, Wu, et al.
Published: (2025)
by: Lin, Wu, et al.
Published: (2025)
Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective
by: Lin, Wu, et al.
Published: (2024)
by: Lin, Wu, et al.
Published: (2024)
Structured Inverse-Free Natural Gradient: Memory-Efficient & Numerically-Stable KFAC
by: Lin, Wu, et al.
Published: (2023)
by: Lin, Wu, et al.
Published: (2023)
Similar Items
-
Generalizing the Geometry of Model Merging Through Frechet Averages
by: da Silva, Marvin F., et al.
Published: (2026) -
DiffAug: A Diffuse-and-Denoise Augmentation for Training Robust Classifiers
by: Sastry, Chandramouli, et al.
Published: (2023) -
Test-Time Training for Depression Detection
by: Dumpala, Sri Harsha, et al.
Published: (2024) -
Seeing Syntax: Uncovering Syntactic Learning Limitations in Vision-Language Models
by: Dumpala, Sri Harsha, et al.
Published: (2024) -
Self-Distillation of Hidden Layers for Self-Supervised Representation Learning
by: Lowe, Scott C., et al.
Published: (2026)