:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Hongkang, Zhang, Shuai, Zhang, Yihua, Wang, Meng, Liu, Sijia, Chen, Pin-Yu
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2403.07310
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers
by: Li, Hongkang, et al.
Published: (2025)

Learning on Transformers is Provable Low-Rank and Sparse: A One-layer Analysis
by: Li, Hongkang, et al.
Published: (2024)

Visual prompting reimagined: The power of the Activation Prompts
by: Zhang, Yihua, et al.
Published: (2026)

Large deviations of one-hidden-layer neural networks
by: Hirsch, Christian, et al.
Published: (2024)

What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding
by: Li, Hongkang, et al.
Published: (2024)

How Do Nonlinear Transformers Learn and Generalize in In-Context Learning?
by: Li, Hongkang, et al.
Published: (2024)

LLM Unlearning on Noisy Forget Sets: A Study of Incomplete, Rewritten, and Watermarked Data
by: Wang, Changsheng, et al.
Published: (2025)

Two-hidden-layer ReLU neural networks and finite elements
by: Jin, Pengzhan
Published: (2024)

Predictive power of a Bayesian effective action for fully-connected one hidden layer neural networks in the proportional limit
by: Baglioni, P., et al.
Published: (2024)

Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis
by: Li, Hongkang, et al.
Published: (2024)

Can Mamba Learn In Context with Outliers? A Theoretical Generalization Analysis
by: Li, Hongkang, et al.
Published: (2025)

Generalization performance of narrow one-hidden layer networks in the teacher-student setting
by: Ortiz, Rodrigo Pérez, et al.
Published: (2025)

A Theoretical Analysis of Mamba's Training Dynamics: Filtering Relevant Features for Generalization in State Space Models
by: Shandirasegaran, Mugunthan, et al.
Published: (2026)

Exact capacity of the \emph{wide} hidden layer treelike neural networks with generic activations
by: Stojnic, Mihailo
Published: (2024)

Enforcing hidden physics in physics-informed neural networks
by: Chen, Nanxi, et al.
Published: (2025)

How does online shopping affect offline price sensitivity?
by: Biswas, Shirsho, et al.
Published: (2025)

Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated by Machine Unlearning
by: Chen, Yiwei, et al.
Published: (2025)

How does communication affect breastfeeding?
by: Tripdatabase
Published: (2025)

The Power of Few: Accelerating and Enhancing Data Reweighting with Coreset Selection
by: Jafari, Mohammad, et al.
Published: (2024)

How does training shape the Riemannian geometry of neural network representations?
by: Zavatone-Veth, Jacob A., et al.
Published: (2023)

Neutron star envelopes with machine learning: a single-hidden-layer neural network application
by: Kovlakas, K., et al.
Published: (2025)

How does temperature affect rural income: Channels and implication of adaptation
by: Qingen Gai, et al.
Published: (2024)

Probabilistic forecasting of power system imbalance using neural network-based ensembles
by: Van Gompel, Jonas, et al.
Published: (2024)

Essentially degenerate hidden nodal lines in two-dimensional magnetic layer groups
by: Li, Xiao-Ping, et al.
Published: (2025)

How does node centrality in a financial network affect asset price prediction?
by: Xu, Yuhong, et al.
Published: (2023)

How does zinc affect wound care?
by: Tripdatabase
Published: (2026)

How does urbanization affect natural selection?
by: Anne Charmantier, et al.
Published: (2024)

Kernel shape renormalization explains output-output correlations in finite Bayesian one-hidden-layer networks
by: Baglioni, P., et al.
Published: (2024)

Unlearners Can Lie: Evaluating and Improving Honesty in LLM Unlearning
by: Gu, Renjie, et al.
Published: (2026)

Pruning then Reweighting: Towards Data-Efficient Training of Diffusion Models
by: Li, Yize, et al.
Published: (2024)

Event triggered synchronization of generalized variable‐order fractional neural networks with time delay
by: Weiwei Zhang, et al.
Published: (2025)

Forgetting to Forget: Attention Sink as A Gateway for Backdooring LLM Unlearning
by: Shang, Bingqi, et al.
Published: (2025)

How does over-squashing affect the power of GNNs?
by: Di Giovanni, Francesco, et al.
Published: (2023)

Unlocking potential: How flexibility i‐deals promote job crafting through social interaction among persons with disabilities
by: Xue Zhang, et al.
Published: (2025)

Excitation and inhibition imbalance affects dynamical complexity through symmetries
by: Ouellet, Mathieu, et al.
Published: (2022)

One Token Embedding Is Enough to Deadlock Your Large Reasoning Model
by: Zhang, Mohan, et al.
Published: (2025)

Physics-informed neural networks for hidden boundary detection and flow field reconstruction
by: Zhu, Yongzheng, et al.
Published: (2025)

An unsupervised tour through the hidden pathways of deep neural networks
by: Doimo, Diego
Published: (2025)

GDP nowcasting with artificial neural networks: How much does long-term memory matter?
by: Németh, Kristóf, et al.
Published: (2023)

On the uncertainty principle of neural networks
by: Zhang, Jun-Jie, et al.
Published: (2022)