:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Peihao, Yang, Shenghao, Li, Shu, Wang, Zhangyang, Li, Pan
Format:	Preprint
Published:	2023
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2307.04001
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Why Neural Network Can Discover Symbolic Structures with Gradient-based Training: An Algebraic and Geometric Foundation for Neurosymbolic Reasoning
by: Wang, Peihao, et al.
Published: (2025)

Generalization Error Analysis for Sparse Mixture-of-Experts: A Preliminary Study
by: Zhao, Jinze, et al.
Published: (2024)

Position: Weight Space Should Be a First-Class Generative AI Modality
by: Wang, Zhangyang, et al.
Published: (2026)

Graph-KV: Breaking Sequence via Injecting Structural Biases into Large Language Models
by: Wang, Haoyu, et al.
Published: (2025)

Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
by: Wang, Peihao, et al.
Published: (2024)

$\nabla$-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space
by: Wang, Peihao, et al.
Published: (2026)

When Do Graph Foundation Models Transfer? A Data-Centric Theory
by: Zhu, Jiajun, et al.
Published: (2026)

Meta ControlNet: Enhancing Task Adaptation via Meta Learning
by: Yang, Junjie, et al.
Published: (2023)

Seeking the Sufficiency and Necessity Causal Features in Multimodal Representation Learning
by: Chen, Boyu, et al.
Published: (2024)

Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
by: Zhu, Jiajun, et al.
Published: (2025)

Beyond Test-Time Memory: State-Space Optimal Control for LLM Reasoning
by: Wang, Peihao, et al.
Published: (2026)

Linearly Separable Features in Shallow Nonlinear Networks: Width Scales Polynomially with Intrinsic Data Dimension
by: Xu, Alec S., et al.
Published: (2025)

Unleashing the Potential of Acquisition Functions in High-Dimensional Bayesian Optimization
by: Zhao, Jiayu, et al.
Published: (2023)

Sparse Mixture-of-Experts for Compositional Generalization: Empirical Evidence and Theoretical Foundations of Optimal Sparsity
by: Zhao, Jinze, et al.
Published: (2024)

Causally Sufficient and Necessary Feature Expansion for Class-Incremental Learning
by: Zhang, Zhen, et al.
Published: (2026)

Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design
by: Cai, Ruisi, et al.
Published: (2024)

Minimal Sufficient Representations for Self-interpretable Deep Neural Networks
by: Tan, Zhiyao, et al.
Published: (2026)

Estimating Conditional Average Treatment Effects via Sufficient Representation Learning
by: Shi, Pengfei, et al.
Published: (2024)

Action-Sufficient Goal Representations
by: Hyeon, Jinu, et al.
Published: (2026)

Graph as Point Set
by: Wang, Xiyuan, et al.
Published: (2024)

PIPA: Preference Alignment as Prior-Informed Statistical Estimation
by: Li, Junbo, et al.
Published: (2025)

Open-Set Fault Diagnosis in Multimode Processes via Fine-Grained Deep Feature Representation
by: Li, Guangqiang, et al.
Published: (2025)

Residual Feature Integration is Sufficient to Prevent Negative Transfer
by: Xu, Yichen, et al.
Published: (2025)

Shake to Leak: Fine-tuning Diffusion Models Can Amplify the Generative Privacy Risk
by: Li, Zhangheng, et al.
Published: (2024)

Spectral Condition for $μ$P under Width-Depth Scaling
by: Zheng, Chenyu, et al.
Published: (2026)

Taming Mode Collapse in Score Distillation for Text-to-3D Generation
by: Wang, Peihao, et al.
Published: (2023)

LLM4XCE: Large Language Models for Extremely Large-Scale Massive MIMO Channel Estimation
by: Li, Renbin, et al.
Published: (2025)

Data-light Uncertainty Set Merging with Admissibility
by: Qin, Shenghao, et al.
Published: (2024)

Polynomial Threshold Functions of Bounded Tree-Width: Some Explainability and Complexity Aspects
by: Chubarian, Karine, et al.
Published: (2025)

Fair Sufficient Representation Learning
by: Zhou, Xueyu, et al.
Published: (2025)

Synergy Between Sufficient Changes and Sparse Mixing Procedure for Disentangled Representation Learning
by: Li, Zijian, et al.
Published: (2025)

Virtual Width Networks
by: Seed, et al.
Published: (2025)

DeepSuM: Deep Sufficient Modality Learning Framework
by: Gao, Zhe, et al.
Published: (2025)

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights
by: Liang, Zhiyuan, et al.
Published: (2025)

Diffusion-aided Task-oriented Semantic Communications with Model Inversion Attack
by: Wang, Xuesong, et al.
Published: (2025)

Feature Attribution with Necessity and Sufficiency via Dual-stage Perturbation Test for Causal Explanation
by: Chen, Xuexin, et al.
Published: (2024)

Local Urysohn Width: A Topological Complexity Measure for Classification
by: Li, Xin
Published: (2026)

Unifying Invariant and Variant Features for Graph Out-of-Distribution via Probability of Necessity and Sufficiency
by: Chen, Xuexin, et al.
Published: (2024)

Transfer Learning in Infinite Width Feature Learning Networks
by: Lauditi, Clarissa, et al.
Published: (2025)

Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis
by: Yang, Hongru, et al.
Published: (2024)