:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autore principale:	Park, Jin Hyun
Natura:	Preprint
Pubblicazione:	2022
Soggetti:	Machine Learning Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2201.11653
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Adaptive multiple optimal learning factors for neural network training
di: Challagundla, Jeshwanth
Pubblicazione: (2024)

Anon: Extrapolating Adaptivity Beyond SGD and Adam
di: Zhang, Yiheng, et al.
Pubblicazione: (2026)

Flow reconstruction in time-varying geometries using graph neural networks
di: Danciu, Bogdan A., et al.
Pubblicazione: (2024)

LayerCollapse: Adaptive compression of neural networks
di: Shabgahi, Soheil Zibakhsh, et al.
Pubblicazione: (2023)

Conditional computation in neural networks: principles and research trends
di: Scardapane, Simone, et al.
Pubblicazione: (2024)

Understanding the learned look-ahead behavior of chess neural networks
di: Cruz, Diogo
Pubblicazione: (2025)

Time-varying Interaction Graph ODE for Dynamic Graph Representation Learning
di: Wang, Xiaoyi, et al.
Pubblicazione: (2026)

Discrete Dictionary-based Decomposition Layer for Structured Representation Learning
di: Park, Taewon, et al.
Pubblicazione: (2024)

Enhancing DP-SGD through Non-monotonous Adaptive Scaling Gradient Weight
di: Huang, Tao, et al.
Pubblicazione: (2024)

Adaptive Selection of LoRA Components in Privacy-Preserving Federated Learning
di: Kim, Myoungjun, et al.
Pubblicazione: (2026)

Why Adam Can Beat SGD: Second-Moment Normalization Yields Sharper Tails
di: Jin, Ruinan, et al.
Pubblicazione: (2026)

Less is more: Embracing sparsity and interpolation with Esiformer for time series forecasting
di: Guo, Yangyang, et al.
Pubblicazione: (2024)

Interpolating neural network: A novel unification of machine learning and interpolation theory
di: Park, Chanwook, et al.
Pubblicazione: (2024)

TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
di: Bae, Junik, et al.
Pubblicazione: (2024)

Multiverse: Language-Conditioned Multi-Game Level Blending via Shared Representation
di: Baek, In-Chang, et al.
Pubblicazione: (2026)

PIG: Physics-Informed Gaussians as Adaptive Parametric Mesh Representations
di: Kang, Namgyu, et al.
Pubblicazione: (2024)

MORE-CLEAR: Multimodal Offline Reinforcement learning for Clinical notes Leveraged Enhanced State Representation
di: Lim, Yooseok, et al.
Pubblicazione: (2025)

Bootstrap SGD: Algorithmic Stability and Robustness
di: Christmann, Andreas, et al.
Pubblicazione: (2024)

MoE-PHDS: One MoE checkpoint for flexible runtime sparsity
di: Hannah, Lauren. A, et al.
Pubblicazione: (2025)

Feature learning as alignment: a structural property of gradient descent in non-linear neural networks
di: Beaglehole, Daniel, et al.
Pubblicazione: (2024)

On permutation-invariant neural networks
di: Kimura, Masanari, et al.
Pubblicazione: (2024)

Sobolev acceleration for neural networks
di: Oh, Jong Kwon, et al.
Pubblicazione: (2025)

Attention mechanisms in neural networks
di: Hays, Hasi
Pubblicazione: (2026)

APOLLO: SGD-like Memory, AdamW-level Performance
di: Zhu, Hanqing, et al.
Pubblicazione: (2024)

Target noise: A pre-training based neural network initialization for efficient high resolution learning
di: Wang, Shaowen, et al.
Pubblicazione: (2026)

Zero-Incentive Dynamics: a look at reward sparsity through the lens of unrewarded subgoals
di: Molinghen, Yannick, et al.
Pubblicazione: (2025)

Accumulative SGD Influence Estimation for Data Attribution
di: Shi, Yunxiao, et al.
Pubblicazione: (2025)

Scaling Laws of SignSGD in Linear Regression: When Does It Outperform SGD?
di: Kim, Jihwan, et al.
Pubblicazione: (2026)

Principles of Lipschitz continuity in neural networks
di: Luo, Róisín
Pubblicazione: (2026)

Linearity-based neural network compression
di: Dobler, Silas, et al.
Pubblicazione: (2025)

Dynamic sparsity in tree-structured feed-forward layers at scale
di: Sedghi, Reza, et al.
Pubblicazione: (2026)

On-site estimation of battery electrochemical parameters via transfer learning based physics-informed neural network approach
di: Yeregui, Josu, et al.
Pubblicazione: (2025)

RQP-SGD: Differential Private Machine Learning through Noisy SGD and Randomized Quantization
di: Feng, Ce, et al.
Pubblicazione: (2024)

Diagonalisation SGD: Fast & Convergent SGD for Non-Differentiable Models via Reparameterisation and Smoothing
di: Wagner, Dominik, et al.
Pubblicazione: (2024)

From 2:4 to 8:16 sparsity patterns in LLMs for Outliers and Weights with Variance Correction
di: Maximov, Egor, et al.
Pubblicazione: (2025)

Beyond 2:4: exploring V:N:M sparsity for efficient transformer inference on GPUs
di: Zhao, Kang, et al.
Pubblicazione: (2024)

Worker Disagreement Reveals Sharp Directions in Local SGD
di: Dimlioglu, Tolga, et al.
Pubblicazione: (2026)

Similarity-based context aware continual learning for spiking neural networks
di: Han, Bing, et al.
Pubblicazione: (2024)

Applying graph neural network to SupplyGraph for supply chain network
di: Han, Kihwan
Pubblicazione: (2024)

Active teacher selection for reward learning
di: Freedman, Rachel, et al.
Pubblicazione: (2023)