:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Li, Yuqing, Luo, Tao, Zhou, Qixuan
Format:	Preprint
Veröffentlicht:	2024
Schlagworte:	Machine Learning
Online-Zugang:	https://arxiv.org/abs/2404.04859
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Quantifying Training Difficulty and Accelerating Convergence in Neural Network-Based PDE Solvers
von: Chen, Chuqi, et al.
Veröffentlicht: (2024)

Demystifying Distributed Training of Graph Neural Networks for Link Prediction
von: Huang, Xin, et al.
Veröffentlicht: (2025)

A priori Estimates for Deep Residual Network in Continuous-time Reinforcement Learning
von: Yin, Shuyu, et al.
Veröffentlicht: (2024)

LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers
von: Shen, Xuan, et al.
Veröffentlicht: (2024)

ProPINN: Demystifying Propagation Failures in Physics-Informed Neural Networks
von: Wu, Haixu, et al.
Veröffentlicht: (2025)

Residual Attention Physics-Informed Neural Networks for Robust Multiphysics Simulation of Steady-State Electrothermal Energy Systems
von: Zhou, Yuqing, et al.
Veröffentlicht: (2026)

Grokking as the Transition from Lazy to Rich Training Dynamics
von: Kumar, Tanishq, et al.
Veröffentlicht: (2023)

Demystifying Higher-Order Graph Neural Networks
von: Besta, Maciej, et al.
Veröffentlicht: (2024)

Demystifying Oversmoothing in Attention-Based Graph Neural Networks
von: Wu, Xinyi, et al.
Veröffentlicht: (2023)

Convergence of Stochastic Gradient Langevin Dynamics in the Lazy Training Regime
von: Oberweis, Noah, et al.
Veröffentlicht: (2025)

PRISM: Demystifying Retention and Interaction in Mid-Training
von: Runwal, Bharat, et al.
Veröffentlicht: (2026)

AutoSGNN: Automatic Propagation Mechanism Discovery for Spectral Graph Neural Networks
von: Mo, Shibing, et al.
Veröffentlicht: (2024)

Has the Deep Neural Network learned the Stochastic Process? An Evaluation Viewpoint
von: Kumar, Harshit, et al.
Veröffentlicht: (2024)

Phase Diagram of Initial Condensation for Two-layer Neural Networks
von: Chen, Zhengan, et al.
Veröffentlicht: (2023)

From Lazy to Rich: Exact Learning Dynamics in Deep Linear Networks
von: Dominé, Clémentine C. J., et al.
Veröffentlicht: (2024)

Lazy FSCA for Unsupervised Variable Selection
von: Zocco, Federico, et al.
Veröffentlicht: (2021)

Mixed Dynamics In Linear Networks: Unifying the Lazy and Active Regimes
von: Tu, Zhenfeng, et al.
Veröffentlicht: (2024)

Enhancing Trustworthiness of Graph Neural Networks with Rank-Based Conformal Training
von: Wang, Ting, et al.
Veröffentlicht: (2025)

Demystifying Network Foundation Models
von: Beltiukov, Sylee, et al.
Veröffentlicht: (2025)

DR-CircuitGNN: Training Acceleration of Heterogeneous Circuit Graph Neural Network on GPUs
von: Luo, Yuebo, et al.
Veröffentlicht: (2025)

Provable Acceleration of Nesterov's Accelerated Gradient for Rectangular Matrix Factorization and Linear Neural Networks
von: Xu, Zhenghao, et al.
Veröffentlicht: (2024)

APEX: Probing Neural Networks via Activation Perturbation
von: Ren, Tao, et al.
Veröffentlicht: (2026)

Partially Lazy Gradient Descent for Smoothed Online Learning
von: Mhaisen, Naram, et al.
Veröffentlicht: (2026)

John Ellipsoids via Lazy Updates
von: Woodruff, David P., et al.
Veröffentlicht: (2025)

Sharpened Lazy Incremental Quasi-Newton Method
von: Lahoti, Aakash, et al.
Veröffentlicht: (2023)

Why Are DMD Students Lazy? Understanding the Copying Behavior in Few-Step Distillation
von: Li, Shucheng, et al.
Veröffentlicht: (2026)

Loss Spike in Training Neural Networks
von: Li, Xiaolong, et al.
Veröffentlicht: (2023)

LazyDP: Co-Designing Algorithm-Software for Scalable Training of Differentially Private Recommendation Models
von: Lim, Juntaek, et al.
Veröffentlicht: (2024)

Demystify Protein Generation with Hierarchical Conditional Diffusion Models
von: Ling, Zinan, et al.
Veröffentlicht: (2025)

Lazy Data Practices Harm Fairness Research
von: Simson, Jan, et al.
Veröffentlicht: (2024)

Three-Class Text Sentiment Analysis Based on LSTM
von: Qixuan, Yin
Veröffentlicht: (2024)

Demystifying Prediction Powered Inference
von: Song, Yilin, et al.
Veröffentlicht: (2026)

Uncertainty in Graph Neural Networks: A Survey
von: Wang, Fangxin, et al.
Veröffentlicht: (2024)

The Importance of Being Lazy: Scaling Limits of Continual Learning
von: Graldi, Jacopo, et al.
Veröffentlicht: (2025)

Flow Matching from Viewpoint of Proximal Operators
von: Fukumizu, Kenji, et al.
Veröffentlicht: (2026)

In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention
von: He, Jianliang, et al.
Veröffentlicht: (2025)

Frequency-adaptive Multi-scale Deep Neural Networks
von: Huang, Jizu, et al.
Veröffentlicht: (2024)

BSFA: Leveraging the Subspace Dichotomy to Accelerate Neural Network Training
von: Zhou, Wenjie, et al.
Veröffentlicht: (2025)

Uncovering Critical Sets of Deep Neural Networks via Sample-Independent Critical Lifting
von: Zhang, Leyang, et al.
Veröffentlicht: (2025)

Policy Compatible Skill Incremental Learning via Lazy Learning Interface
von: Lee, Daehee, et al.
Veröffentlicht: (2025)