Saved in:
| Main Authors: | Yan, Ruiqing, Du, Xingbo, Deng, Haoyu, Zheng, Linghan, Sun, Qiuzhuang, Hu, Jifang, Shao, Yuhang, Jiang, Penghao, Jiang, Jinrong, Zhao, Lian |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.01601 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
RecurFormer: Not All Transformer Heads Need Self-Attention
by: Yan, Ruiqing, et al.
Published: (2024)
by: Yan, Ruiqing, et al.
Published: (2024)
UniPhy: Unifying Riemannian-Clifford Geometry and Biorthogonal Dynamics for Planetary-Scale Continuous Weather Modeling
by: Yan, Ruiqing, et al.
Published: (2026)
by: Yan, Ruiqing, et al.
Published: (2026)
Multi-Modal Video Feature Extraction for Popularity Prediction
by: Liu, Haixu, et al.
Published: (2025)
by: Liu, Haixu, et al.
Published: (2025)
Tighnari: Multi-modal Plant Species Prediction Based on Hierarchical Cross-Attention Using Graph-Based and Vision Backbone-Extracted Features
by: Liu, Haixu, et al.
Published: (2025)
by: Liu, Haixu, et al.
Published: (2025)
The Effect of Attention Head Count on Transformer Approximation
by: Yu, Penghao, et al.
Published: (2025)
by: Yu, Penghao, et al.
Published: (2025)
nnY-Net: Swin-NeXt with Cross-Attention for 3D Medical Images Segmentation
by: Liu, Haixu, et al.
Published: (2025)
by: Liu, Haixu, et al.
Published: (2025)
Optimal Abort Policy for Mission-Critical Systems under Imperfect Condition Monitoring
by: Sun, Qiuzhuang, et al.
Published: (2025)
by: Sun, Qiuzhuang, et al.
Published: (2025)
SARIMAX-Based Power Outage Prediction During Extreme Weather Events
by: Ye, Haoran, et al.
Published: (2025)
by: Ye, Haoran, et al.
Published: (2025)
Anomalous shift and optical vorticity in the steady photovoltaic current
by: Zhu, Penghao, et al.
Published: (2023)
by: Zhu, Penghao, et al.
Published: (2023)
IPGPhormer: Interpretable Pathology Graph-Transformer for Survival Analysis
by: Tang, Guo, et al.
Published: (2025)
by: Tang, Guo, et al.
Published: (2025)
Anomalous Point-Gap Interactions Unveil the Mirage Bath
by: Sun, Yue, et al.
Published: (2025)
by: Sun, Yue, et al.
Published: (2025)
InfoFlow: A Framework for Multi-Layer Transformer Analysis
by: Yu, Penghao, et al.
Published: (2026)
by: Yu, Penghao, et al.
Published: (2026)
Attentive Convolution: Unifying the Expressivity of Self-Attention with Convolutional Efficiency
by: Yu, Hao, et al.
Published: (2025)
by: Yu, Hao, et al.
Published: (2025)
RulePlanner: All-in-One Reinforcement Learner for Unifying Design Rules in 3D Floorplanning
by: Zhong, Ruizhe, et al.
Published: (2026)
by: Zhong, Ruizhe, et al.
Published: (2026)
The $L_{p}$-Brunn-Minkowski inequalities for variational functionals with $0\leq p<1$
by: Hu, Jinrong
Published: (2024)
by: Hu, Jinrong
Published: (2024)
Uniqueness of solutions to the isotropic $L_{p}$ Gaussian Minkowski problem
by: Hu, Jinrong
Published: (2024)
by: Hu, Jinrong
Published: (2024)
The generalized Gaussian log-Minkowski problem
by: Hu, Jinrong
Published: (2024)
by: Hu, Jinrong
Published: (2024)
The dual Minkowski problem for positive indices
by: Hu, Jinrong
Published: (2025)
by: Hu, Jinrong
Published: (2025)
Mixture of Distributions Matters: Dynamic Sparse Attention for Efficient Video Diffusion Transformers
by: Liu, Yuxi, et al.
Published: (2026)
by: Liu, Yuxi, et al.
Published: (2026)
IFViT: Interpretable Fixed-Length Representation for Fingerprint Matching via Vision Transformer
by: Qiu, Yuhang, et al.
Published: (2024)
by: Qiu, Yuhang, et al.
Published: (2024)
Allocation of Parameters in Transformers
by: Yu, Ruoxi, et al.
Published: (2025)
by: Yu, Ruoxi, et al.
Published: (2025)
Efficient Data-aware Distance Comparison Operations for High-Dimensional Approximate Nearest Neighbor Search
by: Deng, Liwei, et al.
Published: (2024)
by: Deng, Liwei, et al.
Published: (2024)
Online Planning of Power Flows for Power Systems Against Bushfires Using Spatial Context
by: Xu, Jianyu, et al.
Published: (2024)
by: Xu, Jianyu, et al.
Published: (2024)
SG-OIF: A Stability-Guided Online Influence Framework for Reliable Vision Data
by: Rao, Penghao, et al.
Published: (2025)
by: Rao, Penghao, et al.
Published: (2025)
Google is all you need: Semi-Supervised Transfer Learning Strategy For Light Multimodal Multi-Task Classification Model
by: Liu, Haixu, et al.
Published: (2025)
by: Liu, Haixu, et al.
Published: (2025)
EatGAN: An Edge-Attention Guided Generative Adversarial Network for Single Image Super-Resolution
by: Rao, Penghao, et al.
Published: (2025)
by: Rao, Penghao, et al.
Published: (2025)
Characterizing circle graphs with binomial partial Petrial polynomials
by: Feng, Ruiqing, et al.
Published: (2025)
by: Feng, Ruiqing, et al.
Published: (2025)
Data Efficient Any Transformer-to-Mamba Distillation via Attention Bridge
by: Wang, Penghao, et al.
Published: (2025)
by: Wang, Penghao, et al.
Published: (2025)
From the Brunn-Minkowski inequality to a class of generalized Poincaré-type inequalities for torsional rigidity
by: Fang, Niufa, et al.
Published: (2023)
by: Fang, Niufa, et al.
Published: (2023)
Meta-Black-Box Optimization with Bi-Space Landscape Analysis and Dual-Control Mechanism for SAEA
by: Du, Yukun, et al.
Published: (2025)
by: Du, Yukun, et al.
Published: (2025)
Unveiling the Strong Interaction origin of Baryon Masses with Lattice QCD
by: Hu, Bolun, et al.
Published: (2024)
by: Hu, Bolun, et al.
Published: (2024)
Anomalous Localization Crossovers From the Competition Between Disorder and Lattice Order in Moiré Lattices
by: Qingying Quan, et al.
Published: (2026)
by: Qingying Quan, et al.
Published: (2026)
End-to-End Spatial-Temporal Transformer for Real-time 4D HOI Reconstruction
by: Zhang, Haoyu, et al.
Published: (2026)
by: Zhang, Haoyu, et al.
Published: (2026)
Dynamic Differential Linear Attention: Enhancing Linear Diffusion Transformer for High-Quality Image Generation
by: Cao, Boyuan, et al.
Published: (2026)
by: Cao, Boyuan, et al.
Published: (2026)
Capillary John ellipsoid theorem with applications to capillary curvature problems
by: Hu, Jinrong, et al.
Published: (2026)
by: Hu, Jinrong, et al.
Published: (2026)
Task Structure Reverses Layerwise State Encoding in Sequence Models
by: Jiang, Yuhang
Published: (2026)
by: Jiang, Yuhang
Published: (2026)
PInVerify: An Offline Embodied Benchmark for Active Instance Verification
by: Jiang, Yuhang
Published: (2026)
by: Jiang, Yuhang
Published: (2026)
Detection vs. Execution: Single-Bucket Probes Miss Half the Mamba-2 State Sink
by: Jiang, Yuhang
Published: (2026)
by: Jiang, Yuhang
Published: (2026)
“Confined Eutectic” Strategy for Visual Refrigeration Responsive Fluorescent Materials with Easy Preparation and Multi‐Color Tunability
by: Jifang Zhao, et al.
Published: (2025)
by: Jifang Zhao, et al.
Published: (2025)
Distributed Channel Estimation and Optimization for 6D Movable Antenna: Unveiling Directional Sparsity
by: Shao, Xiaodan, et al.
Published: (2024)
by: Shao, Xiaodan, et al.
Published: (2024)
Similar Items
-
RecurFormer: Not All Transformer Heads Need Self-Attention
by: Yan, Ruiqing, et al.
Published: (2024) -
UniPhy: Unifying Riemannian-Clifford Geometry and Biorthogonal Dynamics for Planetary-Scale Continuous Weather Modeling
by: Yan, Ruiqing, et al.
Published: (2026) -
Multi-Modal Video Feature Extraction for Popularity Prediction
by: Liu, Haixu, et al.
Published: (2025) -
Tighnari: Multi-modal Plant Species Prediction Based on Hierarchical Cross-Attention Using Graph-Based and Vision Backbone-Extracted Features
by: Liu, Haixu, et al.
Published: (2025) -
The Effect of Attention Head Count on Transformer Approximation
by: Yu, Penghao, et al.
Published: (2025)