:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Peiwen, Yuan, Liu, Henan, Changsheng, Zhu, Wang, Yuyi
Format:	Preprint
Published:	2022
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2208.13315
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

FedAPA: Federated Learning with Adaptive Prototype Aggregation Toward Heterogeneous Wi-Fi CSI-based Crowd Counting
by: Guo, Jingtao, et al.
Published: (2025)

Astro: Activation-guided Structured Regularization for Outlier-Robust LLM Post-Training Quantization
by: Chen, Xi, et al.
Published: (2026)

On the Mathematical Relationship Between Layer Normalization and Dynamic Activation Functions
by: Stollenwerk, Felix
Published: (2025)

A Method on Searching Better Activation Functions
by: Sun, Haoyuan, et al.
Published: (2024)

MetaSymNet: A Tree-like Symbol Network with Adaptive Architecture and Activation Functions
by: Li, Yanjie, et al.
Published: (2023)

RepAct: The Re-parameterizable Adaptive Activation Function
by: Wu, Xian, et al.
Published: (2024)

GRANOLA: Adaptive Normalization for Graph Neural Networks
by: Eliasof, Moshe, et al.
Published: (2024)

Continuous-Time Analysis of Adaptive Optimization and Normalization
by: Gould, Rhys, et al.
Published: (2024)

Learning to refine domain knowledge for biological network inference
by: Li, Peiwen, et al.
Published: (2024)

CLeAN: Continual Learning Adaptive Normalization in Dynamic Environments
by: Marasco, Isabella, et al.
Published: (2026)

RealTCD: Temporal Causal Discovery from Interventional Data with Large Language Model
by: Li, Peiwen, et al.
Published: (2024)

FusionCast: Enhancing Precipitation Nowcasting with Asymmetric Cross-Modal Fusion and Future Radar Priors
by: Wang, Henan, et al.
Published: (2026)

SmartMixed: A Two-Phase Training Strategy for Adaptive Activation Function Learning in Neural Networks
by: Omidvar, Amin
Published: (2025)

Global Convergence in Neural ODEs: Impact of Activation Functions
by: Gao, Tianxiang, et al.
Published: (2025)

DREAM: Domain-agnostic Reverse Engineering Attributes of Black-box Model
by: Li, Rongqing, et al.
Published: (2024)

Mining Generalizable Activation Functions
by: Vitvitskyi, Alex, et al.
Published: (2026)

SCALA: Split Federated Learning with Concatenated Activations and Logit Adjustments
by: Yang, Jiarong, et al.
Published: (2024)

Causal-aware Graph Neural Architecture Search under Distribution Shifts
by: Li, Peiwen, et al.
Published: (2024)

WiSparse: Boosting LLM Inference Efficiency with Weight-Aware Mixed Activation Sparsity
by: Chen, Lei, et al.
Published: (2026)

Frequency Adaptive Normalization For Non-stationary Time Series Forecasting
by: Ye, Weiwei, et al.
Published: (2024)

SpherE: Expressive and Interpretable Knowledge Graph Embedding for Set Retrieval
by: Li, Zihao, et al.
Published: (2024)

On the Expressive Power and Limitations of Multi-Layer SSMs
by: Zubić, Nikola, et al.
Published: (2026)

TimeGMM: Single-Pass Probabilistic Forecasting via Adaptive Gaussian Mixture Models with Reversible Normalization
by: Liu, Lei, et al.
Published: (2026)

More Expressive Feedforward Layers: Part I. Token-Adaptive Mixing of Activations
by: Wang, Mingze, et al.
Published: (2026)

Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?
by: Chen, Xi, et al.
Published: (2024)

Unifying Perplexing Behaviors in Modified BP Attributions through Alignment Perspective
by: Zheng, Guanhua, et al.
Published: (2025)

Revisiting Essential and Nonessential Settings of Evidential Deep Learning
by: Chen, Mengyuan, et al.
Published: (2024)

ScoresActivation: A New Activation Function for Model Agnostic Global Explainability by Design
by: Covaci, Emanuel, et al.
Published: (2025)

IBNorm: Information-Bottleneck Inspired Normalization for Representation Learning
by: Zou, Xiandong, et al.
Published: (2025)

DyTTP: Trajectory Prediction with Normalization-Free Transformers
by: Zhu, JianLin, et al.
Published: (2025)

TimeAPN: Adaptive Amplitude-Phase Non-Stationarity Normalization for Time Series Forecasting
by: Hu, Yue, et al.
Published: (2026)

Adaptive Normalization Mamba with Multi Scale Trend Decomposition and Patch MoE Encoding
by: Jeon, MinCheol
Published: (2025)

Efficient Search for Customized Activation Functions with Gradient Descent
by: Strack, Lukas, et al.
Published: (2024)

Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation
by: Yuan, Peiwen, et al.
Published: (2025)

Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time Scaling
by: Wang, Xinglin, et al.
Published: (2025)

FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression
by: Gao, Yifei, et al.
Published: (2025)

ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse LLMs
by: Zhang, Zhengyan, et al.
Published: (2024)

Adaptive Transformer Modelling of Density Function for Nonparametric Survival Analysis
by: Zhang, Xin, et al.
Published: (2024)

Erasing Self-Supervised Learning Backdoor by Cluster Activation Masking
by: Qian, Shengsheng, et al.
Published: (2023)

HoReN: Normalized Hopfield Retrieval for Large-Scale Sequential Model Editing
by: Fang, Yuan, et al.
Published: (2026)