Saved in:
| Main Authors: | Wang, Peihao, Yang, Shenghao, Li, Shu, Wang, Zhangyang, Li, Pan |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2307.04001 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Why Neural Network Can Discover Symbolic Structures with Gradient-based Training: An Algebraic and Geometric Foundation for Neurosymbolic Reasoning
by: Wang, Peihao, et al.
Published: (2025)
by: Wang, Peihao, et al.
Published: (2025)
Generalization Error Analysis for Sparse Mixture-of-Experts: A Preliminary Study
by: Zhao, Jinze, et al.
Published: (2024)
by: Zhao, Jinze, et al.
Published: (2024)
Position: Weight Space Should Be a First-Class Generative AI Modality
by: Wang, Zhangyang, et al.
Published: (2026)
by: Wang, Zhangyang, et al.
Published: (2026)
Graph-KV: Breaking Sequence via Injecting Structural Biases into Large Language Models
by: Wang, Haoyu, et al.
Published: (2025)
by: Wang, Haoyu, et al.
Published: (2025)
Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
by: Wang, Peihao, et al.
Published: (2024)
by: Wang, Peihao, et al.
Published: (2024)
$\nabla$-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space
by: Wang, Peihao, et al.
Published: (2026)
by: Wang, Peihao, et al.
Published: (2026)
When Do Graph Foundation Models Transfer? A Data-Centric Theory
by: Zhu, Jiajun, et al.
Published: (2026)
by: Zhu, Jiajun, et al.
Published: (2026)
Meta ControlNet: Enhancing Task Adaptation via Meta Learning
by: Yang, Junjie, et al.
Published: (2023)
by: Yang, Junjie, et al.
Published: (2023)
Seeking the Sufficiency and Necessity Causal Features in Multimodal Representation Learning
by: Chen, Boyu, et al.
Published: (2024)
by: Chen, Boyu, et al.
Published: (2024)
Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
by: Zhu, Jiajun, et al.
Published: (2025)
by: Zhu, Jiajun, et al.
Published: (2025)
Beyond Test-Time Memory: State-Space Optimal Control for LLM Reasoning
by: Wang, Peihao, et al.
Published: (2026)
by: Wang, Peihao, et al.
Published: (2026)
Linearly Separable Features in Shallow Nonlinear Networks: Width Scales Polynomially with Intrinsic Data Dimension
by: Xu, Alec S., et al.
Published: (2025)
by: Xu, Alec S., et al.
Published: (2025)
Unleashing the Potential of Acquisition Functions in High-Dimensional Bayesian Optimization
by: Zhao, Jiayu, et al.
Published: (2023)
by: Zhao, Jiayu, et al.
Published: (2023)
Sparse Mixture-of-Experts for Compositional Generalization: Empirical Evidence and Theoretical Foundations of Optimal Sparsity
by: Zhao, Jinze, et al.
Published: (2024)
by: Zhao, Jinze, et al.
Published: (2024)
Causally Sufficient and Necessary Feature Expansion for Class-Incremental Learning
by: Zhang, Zhen, et al.
Published: (2026)
by: Zhang, Zhen, et al.
Published: (2026)
Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design
by: Cai, Ruisi, et al.
Published: (2024)
by: Cai, Ruisi, et al.
Published: (2024)
Minimal Sufficient Representations for Self-interpretable Deep Neural Networks
by: Tan, Zhiyao, et al.
Published: (2026)
by: Tan, Zhiyao, et al.
Published: (2026)
Estimating Conditional Average Treatment Effects via Sufficient Representation Learning
by: Shi, Pengfei, et al.
Published: (2024)
by: Shi, Pengfei, et al.
Published: (2024)
Action-Sufficient Goal Representations
by: Hyeon, Jinu, et al.
Published: (2026)
by: Hyeon, Jinu, et al.
Published: (2026)
Graph as Point Set
by: Wang, Xiyuan, et al.
Published: (2024)
by: Wang, Xiyuan, et al.
Published: (2024)
PIPA: Preference Alignment as Prior-Informed Statistical Estimation
by: Li, Junbo, et al.
Published: (2025)
by: Li, Junbo, et al.
Published: (2025)
Open-Set Fault Diagnosis in Multimode Processes via Fine-Grained Deep Feature Representation
by: Li, Guangqiang, et al.
Published: (2025)
by: Li, Guangqiang, et al.
Published: (2025)
Residual Feature Integration is Sufficient to Prevent Negative Transfer
by: Xu, Yichen, et al.
Published: (2025)
by: Xu, Yichen, et al.
Published: (2025)
Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk
by: Li, Zhangheng, et al.
Published: (2024)
by: Li, Zhangheng, et al.
Published: (2024)
Spectral Condition for $μ$P under Width-Depth Scaling
by: Zheng, Chenyu, et al.
Published: (2026)
by: Zheng, Chenyu, et al.
Published: (2026)
Taming Mode Collapse in Score Distillation for Text-to-3D Generation
by: Wang, Peihao, et al.
Published: (2023)
by: Wang, Peihao, et al.
Published: (2023)
LLM4XCE: Large Language Models for Extremely Large-Scale Massive MIMO Channel Estimation
by: Li, Renbin, et al.
Published: (2025)
by: Li, Renbin, et al.
Published: (2025)
Data-light Uncertainty Set Merging with Admissibility
by: Qin, Shenghao, et al.
Published: (2024)
by: Qin, Shenghao, et al.
Published: (2024)
Polynomial Threshold Functions of Bounded Tree-Width: Some Explainability and Complexity Aspects
by: Chubarian, Karine, et al.
Published: (2025)
by: Chubarian, Karine, et al.
Published: (2025)
Fair Sufficient Representation Learning
by: Zhou, Xueyu, et al.
Published: (2025)
by: Zhou, Xueyu, et al.
Published: (2025)
Synergy Between Sufficient Changes and Sparse Mixing Procedure for Disentangled Representation Learning
by: Li, Zijian, et al.
Published: (2025)
by: Li, Zijian, et al.
Published: (2025)
Virtual Width Networks
by: Seed, et al.
Published: (2025)
by: Seed, et al.
Published: (2025)
DeepSuM: Deep Sufficient Modality Learning Framework
by: Gao, Zhe, et al.
Published: (2025)
by: Gao, Zhe, et al.
Published: (2025)
Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights
by: Liang, Zhiyuan, et al.
Published: (2025)
by: Liang, Zhiyuan, et al.
Published: (2025)
Diffusion-aided Task-oriented Semantic Communications with Model Inversion Attack
by: Wang, Xuesong, et al.
Published: (2025)
by: Wang, Xuesong, et al.
Published: (2025)
Feature Attribution with Necessity and Sufficiency via Dual-stage Perturbation Test for Causal Explanation
by: Chen, Xuexin, et al.
Published: (2024)
by: Chen, Xuexin, et al.
Published: (2024)
Local Urysohn Width: A Topological Complexity Measure for Classification
by: Li, Xin
Published: (2026)
by: Li, Xin
Published: (2026)
Unifying Invariant and Variant Features for Graph Out-of-Distribution via Probability of Necessity and Sufficiency
by: Chen, Xuexin, et al.
Published: (2024)
by: Chen, Xuexin, et al.
Published: (2024)
Transfer Learning in Infinite Width Feature Learning Networks
by: Lauditi, Clarissa, et al.
Published: (2025)
by: Lauditi, Clarissa, et al.
Published: (2025)
Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis
by: Yang, Hongru, et al.
Published: (2024)
by: Yang, Hongru, et al.
Published: (2024)
Similar Items
-
Why Neural Network Can Discover Symbolic Structures with Gradient-based Training: An Algebraic and Geometric Foundation for Neurosymbolic Reasoning
by: Wang, Peihao, et al.
Published: (2025) -
Generalization Error Analysis for Sparse Mixture-of-Experts: A Preliminary Study
by: Zhao, Jinze, et al.
Published: (2024) -
Position: Weight Space Should Be a First-Class Generative AI Modality
by: Wang, Zhangyang, et al.
Published: (2026) -
Graph-KV: Breaking Sequence via Injecting Structural Biases into Large Language Models
by: Wang, Haoyu, et al.
Published: (2025) -
Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
by: Wang, Peihao, et al.
Published: (2024)