Saved in:
| Main Authors: | Racz, Daniel, Gonzalez, Martin, Petreczky, Mihaly, Benczur, Andras, Daroczy, Balint |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.10054 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Length independent generalization bounds for deep SSM architectures via Rademacher contraction and stability constraints
by: Rácz, Dániel, et al.
Published: (2024)
by: Rácz, Dániel, et al.
Published: (2024)
A finite-sample bound for identifying partially observed linear switched systems from a single trajectory
by: Racz, Daniel, et al.
Published: (2025)
by: Racz, Daniel, et al.
Published: (2025)
Meta-model Neural Process for Probabilistic Power Flow under Varying N-1 System Topologies
by: Ly, Sel, et al.
Published: (2025)
by: Ly, Sel, et al.
Published: (2025)
Beyond validation loss: Clinically-tailored optimization metrics improve a model's clinical performance
by: Delahunt, Charles B., et al.
Published: (2026)
by: Delahunt, Charles B., et al.
Published: (2026)
Intelligent Routing for Sparse Demand Forecasting: A Comparative Evaluation of Selection Strategies
by: Zhang, Qiwen
Published: (2025)
by: Zhang, Qiwen
Published: (2025)
Knowledge Abstraction for Knowledge-based Semantic Communication: A Generative Causality Invariant Approach
by: Nguyen, Minh-Duong, et al.
Published: (2025)
by: Nguyen, Minh-Duong, et al.
Published: (2025)
Does DQN Learn?
by: Gopalan, Aditya, et al.
Published: (2022)
by: Gopalan, Aditya, et al.
Published: (2022)
Progressive Feedforward Collapse of ResNet Training
by: Wang, Sicong, et al.
Published: (2024)
by: Wang, Sicong, et al.
Published: (2024)
ECG-FM: An Open Electrocardiogram Foundation Model
by: McKeen, Kaden, et al.
Published: (2024)
by: McKeen, Kaden, et al.
Published: (2024)
Are Transformers with One Layer Self-Attention Using Low-Rank Weight Matrices Universal Approximators?
by: Kajitsuka, Tokio, et al.
Published: (2023)
by: Kajitsuka, Tokio, et al.
Published: (2023)
Enhancing PyKEEN with Multiple Negative Sampling Solutions for Knowledge Graph Embedding Models
by: d'Amato, Claudia, et al.
Published: (2025)
by: d'Amato, Claudia, et al.
Published: (2025)
Understanding Input Selectivity in Mamba: Impact on Approximation Power, Memorization, and Associative Recall Capacity
by: Huang, Ningyuan, et al.
Published: (2025)
by: Huang, Ningyuan, et al.
Published: (2025)
Driving down Poisson error can offset classification error in clinical tasks
by: Delahunt, Charles B., et al.
Published: (2024)
by: Delahunt, Charles B., et al.
Published: (2024)
Machine Learning for Physical Simulation Challenge Results and Retrospective Analysis: Power Grid Use Case
by: Leyli-Abadi, Milad, et al.
Published: (2025)
by: Leyli-Abadi, Milad, et al.
Published: (2025)
Learning Actionable World Models for Industrial Process Control
by: Yan, Peng, et al.
Published: (2025)
by: Yan, Peng, et al.
Published: (2025)
A ZeNN architecture to avoid the Gaussian trap
by: Carvalho, Luís, et al.
Published: (2025)
by: Carvalho, Luís, et al.
Published: (2025)
Variance Reduced Policy Gradient Method for Multi-Objective Reinforcement Learning
by: Guidobene, Davide, et al.
Published: (2025)
by: Guidobene, Davide, et al.
Published: (2025)
A Hormetic Approach to the Value-Loading Problem: Preventing the Paperclip Apocalypse?
by: Henry, Nathan I. N., et al.
Published: (2024)
by: Henry, Nathan I. N., et al.
Published: (2024)
The Curious Case of In-Training Compression of State Space Models
by: Chahine, Makram, et al.
Published: (2025)
by: Chahine, Makram, et al.
Published: (2025)
GenAIOps for GenAI Model-Agility
by: Ueno, Ken, et al.
Published: (2024)
by: Ueno, Ken, et al.
Published: (2024)
Regime Change Hypothesis: Foundations for Decoupled Dynamics in Neural Network Training
by: Pérez-Corral, Cristian, et al.
Published: (2026)
by: Pérez-Corral, Cristian, et al.
Published: (2026)
Enhancing Feature Selection and Interpretability in AI Regression Tasks Through Feature Attribution
by: Hinterleitner, Alexander, et al.
Published: (2024)
by: Hinterleitner, Alexander, et al.
Published: (2024)
Customizing Graph Neural Networks using Path Reweighting
by: Chen, Jianpeng, et al.
Published: (2021)
by: Chen, Jianpeng, et al.
Published: (2021)
Correcting Stochastic Update Bias in Preconditioned Language Model Optimizers
by: Nayak, Nikhil, et al.
Published: (2026)
by: Nayak, Nikhil, et al.
Published: (2026)
On Privacy Leakage in Tabular Diffusion Models: Influential Factors, Attacker Knowledge, and Metrics
by: Shafieinejad, Masoumeh, et al.
Published: (2026)
by: Shafieinejad, Masoumeh, et al.
Published: (2026)
TraXion: Rethinking Pre-training Frameworks for Mobility and Beyond
by: Hsu, Shang-Ling, et al.
Published: (2026)
by: Hsu, Shang-Ling, et al.
Published: (2026)
Approximating Discrimination Within Models When Faced With Several Non-Binary Sensitive Attributes
by: Bian, Yijun, et al.
Published: (2024)
by: Bian, Yijun, et al.
Published: (2024)
Does Machine Bring in Extra Bias in Learning? Approximating Fairness in Models Promptly
by: Bian, Yijun, et al.
Published: (2024)
by: Bian, Yijun, et al.
Published: (2024)
Autonomous Cyber Resilience via a Co-Evolutionary Arms Race within a Fortified Digital Twin Sandbox
by: Malikussaid, et al.
Published: (2025)
by: Malikussaid, et al.
Published: (2025)
Dynamic Observation Policies in Observation Cost-Sensitive Reinforcement Learning
by: Bellinger, Colin, et al.
Published: (2023)
by: Bellinger, Colin, et al.
Published: (2023)
On the Invariants of Softmax Attention
by: Lee, Wonsuk
Published: (2026)
by: Lee, Wonsuk
Published: (2026)
Interpretability Can Be Actionable
by: Orgad, Hadas, et al.
Published: (2026)
by: Orgad, Hadas, et al.
Published: (2026)
Achieving Distributive Justice in Federated Learning via Uncertainty Quantification
by: Carey, Alycia, et al.
Published: (2025)
by: Carey, Alycia, et al.
Published: (2025)
Return of the Schema: Building Complete Datasets for Machine Learning and Reasoning on Knowledge Graphs
by: Diliso, Ivan, et al.
Published: (2026)
by: Diliso, Ivan, et al.
Published: (2026)
ATEX-CF: Attack-Informed Counterfactual Explanations for Graph Neural Networks
by: Zhang, Yu, et al.
Published: (2026)
by: Zhang, Yu, et al.
Published: (2026)
Improving Time Series Classification with Representation Soft Label Smoothing
by: Ma, Hengyi, et al.
Published: (2024)
by: Ma, Hengyi, et al.
Published: (2024)
Energy-Efficient Deep Learning Without Backpropagation: A Rigorous Evaluation of Forward-Only Algorithms
by: Spyra, Przemysław, et al.
Published: (2025)
by: Spyra, Przemysław, et al.
Published: (2025)
Self-Directed Task Identification
by: Gould, Timothy, et al.
Published: (2026)
by: Gould, Timothy, et al.
Published: (2026)
A Rapid Review of Clustering Algorithms
by: Yin, Hui, et al.
Published: (2024)
by: Yin, Hui, et al.
Published: (2024)
Graph Neural Networks Need Cluster-Normalize-Activate Modules
by: Skryagin, Arseny, et al.
Published: (2024)
by: Skryagin, Arseny, et al.
Published: (2024)
Similar Items
-
Length independent generalization bounds for deep SSM architectures via Rademacher contraction and stability constraints
by: Rácz, Dániel, et al.
Published: (2024) -
A finite-sample bound for identifying partially observed linear switched systems from a single trajectory
by: Racz, Daniel, et al.
Published: (2025) -
Meta-model Neural Process for Probabilistic Power Flow under Varying N-1 System Topologies
by: Ly, Sel, et al.
Published: (2025) -
Beyond validation loss: Clinically-tailored optimization metrics improve a model's clinical performance
by: Delahunt, Charles B., et al.
Published: (2026) -
Intelligent Routing for Sparse Demand Forecasting: A Comparative Evaluation of Selection Strategies
by: Zhang, Qiwen
Published: (2025)