Saved in:
| Main Authors: | Peiwen, Yuan, Liu, Henan, Changsheng, Zhu, Wang, Yuyi |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2208.13315 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FedAPA: Federated Learning with Adaptive Prototype Aggregation Toward Heterogeneous Wi-Fi CSI-based Crowd Counting
by: Guo, Jingtao, et al.
Published: (2025)
by: Guo, Jingtao, et al.
Published: (2025)
Astro: Activation-guided Structured Regularization for Outlier-Robust LLM Post-Training Quantization
by: Chen, Xi, et al.
Published: (2026)
by: Chen, Xi, et al.
Published: (2026)
On the Mathematical Relationship Between Layer Normalization and Dynamic Activation Functions
by: Stollenwerk, Felix
Published: (2025)
by: Stollenwerk, Felix
Published: (2025)
A Method on Searching Better Activation Functions
by: Sun, Haoyuan, et al.
Published: (2024)
by: Sun, Haoyuan, et al.
Published: (2024)
MetaSymNet: A Tree-like Symbol Network with Adaptive Architecture and Activation Functions
by: Li, Yanjie, et al.
Published: (2023)
by: Li, Yanjie, et al.
Published: (2023)
RepAct: The Re-parameterizable Adaptive Activation Function
by: Wu, Xian, et al.
Published: (2024)
by: Wu, Xian, et al.
Published: (2024)
GRANOLA: Adaptive Normalization for Graph Neural Networks
by: Eliasof, Moshe, et al.
Published: (2024)
by: Eliasof, Moshe, et al.
Published: (2024)
Continuous-Time Analysis of Adaptive Optimization and Normalization
by: Gould, Rhys, et al.
Published: (2024)
by: Gould, Rhys, et al.
Published: (2024)
Learning to refine domain knowledge for biological network inference
by: Li, Peiwen, et al.
Published: (2024)
by: Li, Peiwen, et al.
Published: (2024)
CLeAN: Continual Learning Adaptive Normalization in Dynamic Environments
by: Marasco, Isabella, et al.
Published: (2026)
by: Marasco, Isabella, et al.
Published: (2026)
RealTCD: Temporal Causal Discovery from Interventional Data with Large Language Model
by: Li, Peiwen, et al.
Published: (2024)
by: Li, Peiwen, et al.
Published: (2024)
FusionCast: Enhancing Precipitation Nowcasting with Asymmetric Cross-Modal Fusion and Future Radar Priors
by: Wang, Henan, et al.
Published: (2026)
by: Wang, Henan, et al.
Published: (2026)
SmartMixed: A Two-Phase Training Strategy for Adaptive Activation Function Learning in Neural Networks
by: Omidvar, Amin
Published: (2025)
by: Omidvar, Amin
Published: (2025)
Global Convergence in Neural ODEs: Impact of Activation Functions
by: Gao, Tianxiang, et al.
Published: (2025)
by: Gao, Tianxiang, et al.
Published: (2025)
DREAM: Domain-agnostic Reverse Engineering Attributes of Black-box Model
by: Li, Rongqing, et al.
Published: (2024)
by: Li, Rongqing, et al.
Published: (2024)
Mining Generalizable Activation Functions
by: Vitvitskyi, Alex, et al.
Published: (2026)
by: Vitvitskyi, Alex, et al.
Published: (2026)
SCALA: Split Federated Learning with Concatenated Activations and Logit Adjustments
by: Yang, Jiarong, et al.
Published: (2024)
by: Yang, Jiarong, et al.
Published: (2024)
Causal-aware Graph Neural Architecture Search under Distribution Shifts
by: Li, Peiwen, et al.
Published: (2024)
by: Li, Peiwen, et al.
Published: (2024)
WiSparse: Boosting LLM Inference Efficiency with Weight-Aware Mixed Activation Sparsity
by: Chen, Lei, et al.
Published: (2026)
by: Chen, Lei, et al.
Published: (2026)
Frequency Adaptive Normalization For Non-stationary Time Series Forecasting
by: Ye, Weiwei, et al.
Published: (2024)
by: Ye, Weiwei, et al.
Published: (2024)
SpherE: Expressive and Interpretable Knowledge Graph Embedding for Set Retrieval
by: Li, Zihao, et al.
Published: (2024)
by: Li, Zihao, et al.
Published: (2024)
On the Expressive Power and Limitations of Multi-Layer SSMs
by: Zubić, Nikola, et al.
Published: (2026)
by: Zubić, Nikola, et al.
Published: (2026)
TimeGMM: Single-Pass Probabilistic Forecasting via Adaptive Gaussian Mixture Models with Reversible Normalization
by: Liu, Lei, et al.
Published: (2026)
by: Liu, Lei, et al.
Published: (2026)
More Expressive Feedforward Layers: Part I. Token-Adaptive Mixing of Activations
by: Wang, Mingze, et al.
Published: (2026)
by: Wang, Mingze, et al.
Published: (2026)
Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?
by: Chen, Xi, et al.
Published: (2024)
by: Chen, Xi, et al.
Published: (2024)
Unifying Perplexing Behaviors in Modified BP Attributions through Alignment Perspective
by: Zheng, Guanhua, et al.
Published: (2025)
by: Zheng, Guanhua, et al.
Published: (2025)
Revisiting Essential and Nonessential Settings of Evidential Deep Learning
by: Chen, Mengyuan, et al.
Published: (2024)
by: Chen, Mengyuan, et al.
Published: (2024)
ScoresActivation: A New Activation Function for Model Agnostic Global Explainability by Design
by: Covaci, Emanuel, et al.
Published: (2025)
by: Covaci, Emanuel, et al.
Published: (2025)
IBNorm: Information-Bottleneck Inspired Normalization for Representation Learning
by: Zou, Xiandong, et al.
Published: (2025)
by: Zou, Xiandong, et al.
Published: (2025)
DyTTP: Trajectory Prediction with Normalization-Free Transformers
by: Zhu, JianLin, et al.
Published: (2025)
by: Zhu, JianLin, et al.
Published: (2025)
TimeAPN: Adaptive Amplitude-Phase Non-Stationarity Normalization for Time Series Forecasting
by: Hu, Yue, et al.
Published: (2026)
by: Hu, Yue, et al.
Published: (2026)
Adaptive Normalization Mamba with Multi Scale Trend Decomposition and Patch MoE Encoding
by: Jeon, MinCheol
Published: (2025)
by: Jeon, MinCheol
Published: (2025)
Efficient Search for Customized Activation Functions with Gradient Descent
by: Strack, Lukas, et al.
Published: (2024)
by: Strack, Lukas, et al.
Published: (2024)
Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation
by: Yuan, Peiwen, et al.
Published: (2025)
by: Yuan, Peiwen, et al.
Published: (2025)
Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time Scaling
by: Wang, Xinglin, et al.
Published: (2025)
by: Wang, Xinglin, et al.
Published: (2025)
FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression
by: Gao, Yifei, et al.
Published: (2025)
by: Gao, Yifei, et al.
Published: (2025)
ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse LLMs
by: Zhang, Zhengyan, et al.
Published: (2024)
by: Zhang, Zhengyan, et al.
Published: (2024)
Adaptive Transformer Modelling of Density Function for Nonparametric Survival Analysis
by: Zhang, Xin, et al.
Published: (2024)
by: Zhang, Xin, et al.
Published: (2024)
Erasing Self-Supervised Learning Backdoor by Cluster Activation Masking
by: Qian, Shengsheng, et al.
Published: (2023)
by: Qian, Shengsheng, et al.
Published: (2023)
HoReN: Normalized Hopfield Retrieval for Large-Scale Sequential Model Editing
by: Fang, Yuan, et al.
Published: (2026)
by: Fang, Yuan, et al.
Published: (2026)
Similar Items
-
FedAPA: Federated Learning with Adaptive Prototype Aggregation Toward Heterogeneous Wi-Fi CSI-based Crowd Counting
by: Guo, Jingtao, et al.
Published: (2025) -
Astro: Activation-guided Structured Regularization for Outlier-Robust LLM Post-Training Quantization
by: Chen, Xi, et al.
Published: (2026) -
On the Mathematical Relationship Between Layer Normalization and Dynamic Activation Functions
by: Stollenwerk, Felix
Published: (2025) -
A Method on Searching Better Activation Functions
by: Sun, Haoyuan, et al.
Published: (2024) -
MetaSymNet: A Tree-like Symbol Network with Adaptive Architecture and Activation Functions
by: Li, Yanjie, et al.
Published: (2023)