Saved in:
| Main Authors: | Li, Hongkang, Zhang, Shuai, Zhang, Yihua, Wang, Meng, Liu, Sijia, Chen, Pin-Yu |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.07310 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers
by: Li, Hongkang, et al.
Published: (2025)
by: Li, Hongkang, et al.
Published: (2025)
Learning on Transformers is Provable Low-Rank and Sparse: A One-layer Analysis
by: Li, Hongkang, et al.
Published: (2024)
by: Li, Hongkang, et al.
Published: (2024)
Visual prompting reimagined: The power of the Activation Prompts
by: Zhang, Yihua, et al.
Published: (2026)
by: Zhang, Yihua, et al.
Published: (2026)
Large deviations of one-hidden-layer neural networks
by: Hirsch, Christian, et al.
Published: (2024)
by: Hirsch, Christian, et al.
Published: (2024)
What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding
by: Li, Hongkang, et al.
Published: (2024)
by: Li, Hongkang, et al.
Published: (2024)
How Do Nonlinear Transformers Learn and Generalize in In-Context Learning?
by: Li, Hongkang, et al.
Published: (2024)
by: Li, Hongkang, et al.
Published: (2024)
LLM Unlearning on Noisy Forget Sets: A Study of Incomplete, Rewritten, and Watermarked Data
by: Wang, Changsheng, et al.
Published: (2025)
by: Wang, Changsheng, et al.
Published: (2025)
Two-hidden-layer ReLU neural networks and finite elements
by: Jin, Pengzhan
Published: (2024)
by: Jin, Pengzhan
Published: (2024)
Predictive power of a Bayesian effective action for fully-connected one hidden layer neural networks in the proportional limit
by: Baglioni, P., et al.
Published: (2024)
by: Baglioni, P., et al.
Published: (2024)
Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis
by: Li, Hongkang, et al.
Published: (2024)
by: Li, Hongkang, et al.
Published: (2024)
Can Mamba Learn In Context with Outliers? A Theoretical Generalization Analysis
by: Li, Hongkang, et al.
Published: (2025)
by: Li, Hongkang, et al.
Published: (2025)
Generalization performance of narrow one-hidden layer networks in the teacher-student setting
by: Ortiz, Rodrigo Pérez, et al.
Published: (2025)
by: Ortiz, Rodrigo Pérez, et al.
Published: (2025)
A Theoretical Analysis of Mamba's Training Dynamics: Filtering Relevant Features for Generalization in State Space Models
by: Shandirasegaran, Mugunthan, et al.
Published: (2026)
by: Shandirasegaran, Mugunthan, et al.
Published: (2026)
Exact capacity of the \emph{wide} hidden layer treelike neural networks with generic activations
by: Stojnic, Mihailo
Published: (2024)
by: Stojnic, Mihailo
Published: (2024)
Enforcing hidden physics in physics-informed neural networks
by: Chen, Nanxi, et al.
Published: (2025)
by: Chen, Nanxi, et al.
Published: (2025)
How does online shopping affect offline price sensitivity?
by: Biswas, Shirsho, et al.
Published: (2025)
by: Biswas, Shirsho, et al.
Published: (2025)
Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-Tuning and Can Be Mitigated by Machine Unlearning
by: Chen, Yiwei, et al.
Published: (2025)
by: Chen, Yiwei, et al.
Published: (2025)
How does communication affect breastfeeding?
by: Tripdatabase
Published: (2025)
by: Tripdatabase
Published: (2025)
The Power of Few: Accelerating and Enhancing Data Reweighting with Coreset Selection
by: Jafari, Mohammad, et al.
Published: (2024)
by: Jafari, Mohammad, et al.
Published: (2024)
How does training shape the Riemannian geometry of neural network representations?
by: Zavatone-Veth, Jacob A., et al.
Published: (2023)
by: Zavatone-Veth, Jacob A., et al.
Published: (2023)
Neutron star envelopes with machine learning: a single-hidden-layer neural network application
by: Kovlakas, K., et al.
Published: (2025)
by: Kovlakas, K., et al.
Published: (2025)
How does temperature affect rural income: Channels and implication of adaptation
by: Qingen Gai, et al.
Published: (2024)
by: Qingen Gai, et al.
Published: (2024)
Probabilistic forecasting of power system imbalance using neural network-based ensembles
by: Van Gompel, Jonas, et al.
Published: (2024)
by: Van Gompel, Jonas, et al.
Published: (2024)
Essentially degenerate hidden nodal lines in two-dimensional magnetic layer groups
by: Li, Xiao-Ping, et al.
Published: (2025)
by: Li, Xiao-Ping, et al.
Published: (2025)
How does node centrality in a financial network affect asset price prediction?
by: Xu, Yuhong, et al.
Published: (2023)
by: Xu, Yuhong, et al.
Published: (2023)
How does zinc affect wound care?
by: Tripdatabase
Published: (2026)
by: Tripdatabase
Published: (2026)
How does urbanization affect natural selection?
by: Anne Charmantier, et al.
Published: (2024)
by: Anne Charmantier, et al.
Published: (2024)
Kernel shape renormalization explains output-output correlations in finite Bayesian one-hidden-layer networks
by: Baglioni, P., et al.
Published: (2024)
by: Baglioni, P., et al.
Published: (2024)
Unlearners Can Lie: Evaluating and Improving Honesty in LLM Unlearning
by: Gu, Renjie, et al.
Published: (2026)
by: Gu, Renjie, et al.
Published: (2026)
Pruning then Reweighting: Towards Data-Efficient Training of Diffusion Models
by: Li, Yize, et al.
Published: (2024)
by: Li, Yize, et al.
Published: (2024)
Event triggered synchronization of generalized variable‐order fractional neural networks with time delay
by: Weiwei Zhang, et al.
Published: (2025)
by: Weiwei Zhang, et al.
Published: (2025)
Forgetting to Forget: Attention Sink as A Gateway for Backdooring LLM Unlearning
by: Shang, Bingqi, et al.
Published: (2025)
by: Shang, Bingqi, et al.
Published: (2025)
How does over-squashing affect the power of GNNs?
by: Di Giovanni, Francesco, et al.
Published: (2023)
by: Di Giovanni, Francesco, et al.
Published: (2023)
Unlocking potential: How flexibility i‐deals promote job crafting through social interaction among persons with disabilities
by: Xue Zhang, et al.
Published: (2025)
by: Xue Zhang, et al.
Published: (2025)
Excitation and inhibition imbalance affects dynamical complexity through symmetries
by: Ouellet, Mathieu, et al.
Published: (2022)
by: Ouellet, Mathieu, et al.
Published: (2022)
One Token Embedding Is Enough to Deadlock Your Large Reasoning Model
by: Zhang, Mohan, et al.
Published: (2025)
by: Zhang, Mohan, et al.
Published: (2025)
Physics-informed neural networks for hidden boundary detection and flow field reconstruction
by: Zhu, Yongzheng, et al.
Published: (2025)
by: Zhu, Yongzheng, et al.
Published: (2025)
An unsupervised tour through the hidden pathways of deep neural networks
by: Doimo, Diego
Published: (2025)
by: Doimo, Diego
Published: (2025)
GDP nowcasting with artificial neural networks: How much does long-term memory matter?
by: Németh, Kristóf, et al.
Published: (2023)
by: Németh, Kristóf, et al.
Published: (2023)
On the uncertainty principle of neural networks
by: Zhang, Jun-Jie, et al.
Published: (2022)
by: Zhang, Jun-Jie, et al.
Published: (2022)
Similar Items
-
When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers
by: Li, Hongkang, et al.
Published: (2025) -
Learning on Transformers is Provable Low-Rank and Sparse: A One-layer Analysis
by: Li, Hongkang, et al.
Published: (2024) -
Visual prompting reimagined: The power of the Activation Prompts
by: Zhang, Yihua, et al.
Published: (2026) -
Large deviations of one-hidden-layer neural networks
by: Hirsch, Christian, et al.
Published: (2024) -
What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding
by: Li, Hongkang, et al.
Published: (2024)