Saved in:
| Main Author: | Shi, Dai |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2311.17132 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
EmoNeXt: an Adapted ConvNeXt for Facial Emotion Recognition
by: Boudouri, Yassine El, et al.
Published: (2025)
by: Boudouri, Yassine El, et al.
Published: (2025)
CoAtNeXt:An Attention-Enhanced ConvNeXtV2-Transformer Hybrid Model for Gastric Tissue Classification
by: Yurdakul, Mustafa, et al.
Published: (2025)
by: Yurdakul, Mustafa, et al.
Published: (2025)
InceptionNeXt: When Inception Meets ConvNeXt
by: Yu, Weihao, et al.
Published: (2023)
by: Yu, Weihao, et al.
Published: (2023)
Ensemble of radiomics and ConvNeXt for breast cancer diagnosis
by: Garza-Abdala, Jorge Alberto, et al.
Published: (2026)
by: Garza-Abdala, Jorge Alberto, et al.
Published: (2026)
FourCastNeXt: Optimizing FourCastNet Training for Limited Compute
by: Guo, Edison, et al.
Published: (2024)
by: Guo, Edison, et al.
Published: (2024)
Reviving ConvNeXt for Efficient Convolutional Diffusion Models
by: Kwon, Taesung, et al.
Published: (2026)
by: Kwon, Taesung, et al.
Published: (2026)
Enhancing kelp forest detection in remote sensing images using crowdsourced labels with Mixed Vision Transformers and ConvNeXt segmentation models
by: Nasios, Ioannis
Published: (2025)
by: Nasios, Ioannis
Published: (2025)
Deep Learning-Based Rock Particulate Classification Using Attention-Enhanced ConvNeXt
by: Amankwah, Anthony, et al.
Published: (2025)
by: Amankwah, Anthony, et al.
Published: (2025)
MambaNeXt-YOLO: A Hybrid State Space Model for Real-time Object Detection
by: Lei, Xiaochun, et al.
Published: (2025)
by: Lei, Xiaochun, et al.
Published: (2025)
Multi-encoder ConvNeXt Network with Smooth Attentional Feature Fusion for Multispectral Semantic Segmentation
by: Ramos, Leo Thomas, et al.
Published: (2026)
by: Ramos, Leo Thomas, et al.
Published: (2026)
MedNet-PVS: A MedNeXt-Based Deep Learning Model for Automated Segmentation of Perivascular Spaces
by: Low, Zhen Xuen Brandon, et al.
Published: (2025)
by: Low, Zhen Xuen Brandon, et al.
Published: (2025)
SAFER-AiD: Saccade-Assisted Foveal-peripheral vision Enhanced Reconstruction for Adversarial Defense
by: Liu, Jiayang, et al.
Published: (2025)
by: Liu, Jiayang, et al.
Published: (2025)
VLANeXt: Recipes for Building Strong VLA Models
by: Wu, Xiao-Ming, et al.
Published: (2026)
by: Wu, Xiao-Ming, et al.
Published: (2026)
FovealNet: Advancing AI-Driven Gaze Tracking Solutions for Optimized Foveated Rendering System Performance in Virtual Reality
by: Liu, Wenxuan, et al.
Published: (2024)
by: Liu, Wenxuan, et al.
Published: (2024)
3D Motion Perception of Binocular Vision Target with PID-CNN
by: Shi, Jiazhao, et al.
Published: (2025)
by: Shi, Jiazhao, et al.
Published: (2025)
MicroCrackAttentionNeXt: Advancing Microcrack Detection in Wave Field Analysis Using Deep Neural Networks through Feature Visualization
by: Moreh, Fatahlla, et al.
Published: (2024)
by: Moreh, Fatahlla, et al.
Published: (2024)
Beyond the Doors of Perception: Vision Transformers Represent Relations Between Objects
by: Lepori, Michael A., et al.
Published: (2024)
by: Lepori, Michael A., et al.
Published: (2024)
GranViT: A Fine-Grained Vision Model With Autoregressive Perception For MLLMs
by: Zheng, Guanghao, et al.
Published: (2025)
by: Zheng, Guanghao, et al.
Published: (2025)
NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows
by: Tang, Zhenggang, et al.
Published: (2024)
by: Tang, Zhenggang, et al.
Published: (2024)
EVCC: Enhanced Vision Transformer-ConvNeXt-CoAtNet Fusion for Classification
by: Hasan, Kazi Reyazul, et al.
Published: (2025)
by: Hasan, Kazi Reyazul, et al.
Published: (2025)
VisionCoach: Reinforcing Grounded Video Reasoning via Visual-Perception Prompting
by: Lee, Daeun, et al.
Published: (2026)
by: Lee, Daeun, et al.
Published: (2026)
VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images
by: Zhou, Guanyu, et al.
Published: (2026)
by: Zhou, Guanyu, et al.
Published: (2026)
StrokeNeXt: A Siamese-encoder Approach for Brain Stroke Classification in Computed Tomography Imagery
by: Ramos, Leo Thomas, et al.
Published: (2026)
by: Ramos, Leo Thomas, et al.
Published: (2026)
LiteNeXt: A Novel Lightweight ConvMixer-based Model with Self-embedding Representation Parallel for Medical Image Segmentation
by: Tran, Ngoc-Du, et al.
Published: (2024)
by: Tran, Ngoc-Du, et al.
Published: (2024)
SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision
by: Vani, Ankit, et al.
Published: (2024)
by: Vani, Ankit, et al.
Published: (2024)
Towards Robust Vision Transformer via Masked Adaptive Ensemble
by: Lin, Fudong, et al.
Published: (2024)
by: Lin, Fudong, et al.
Published: (2024)
MedNeXt-v2: Scaling 3D ConvNeXts for Large-Scale Supervised Representation Learning in Medical Image Segmentation
by: Roy, Saikat, et al.
Published: (2025)
by: Roy, Saikat, et al.
Published: (2025)
Integrating ConvNeXt and Vision Transformers for Enhancing Facial Age Estimation
by: Maroun, Gaby, et al.
Published: (2025)
by: Maroun, Gaby, et al.
Published: (2025)
Perception Before Reasoning: Two-Stage Reinforcement Learning for Visual Reasoning in Vision-Language Models
by: Chen, Yan, et al.
Published: (2025)
by: Chen, Yan, et al.
Published: (2025)
Understanding Graphical Perception in Data Visualization through Zero-shot Prompting of Vision-Language Models
by: Guo, Grace, et al.
Published: (2024)
by: Guo, Grace, et al.
Published: (2024)
UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction
by: Nayak, Shravan, et al.
Published: (2025)
by: Nayak, Shravan, et al.
Published: (2025)
Not There Yet: Evaluating Vision Language Models in Simulating the Visual Perception of People with Low Vision
by: Natalie, Rosiana, et al.
Published: (2025)
by: Natalie, Rosiana, et al.
Published: (2025)
Linear Differential Vision Transformer: Learning Visual Contrasts via Pairwise Differentials
by: Pu, Yifan, et al.
Published: (2025)
by: Pu, Yifan, et al.
Published: (2025)
Attention Guided CAM: Visual Explanations of Vision Transformer Guided by Self-Attention
by: Leem, Saebom, et al.
Published: (2024)
by: Leem, Saebom, et al.
Published: (2024)
Breaking Through the Haze: An Advanced Non-Homogeneous Dehazing Method based on Fast Fourier Convolution and ConvNeXt
by: Zhou, Han, et al.
Published: (2023)
by: Zhou, Han, et al.
Published: (2023)
Two-stage Vision Transformers and Hard Masking offer Robust Object Representations
by: Aniraj, Ananthu, et al.
Published: (2025)
by: Aniraj, Ananthu, et al.
Published: (2025)
MVP-Bench: Can Large Vision--Language Models Conduct Multi-level Visual Perception Like Humans?
by: Li, Guanzhen, et al.
Published: (2024)
by: Li, Guanzhen, et al.
Published: (2024)
LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks
by: Nguyen, Truong Thanh Hung, et al.
Published: (2024)
by: Nguyen, Truong Thanh Hung, et al.
Published: (2024)
ViPER: Empowering the Self-Evolution of Visual Perception Abilities in Vision-Language Model
by: Zhang, Juntian, et al.
Published: (2025)
by: Zhang, Juntian, et al.
Published: (2025)
DinoTwins: Combining DINO and Barlow Twins for Robust, Label-Efficient Vision Transformers
by: Podsiadly, Michael, et al.
Published: (2025)
by: Podsiadly, Michael, et al.
Published: (2025)
Similar Items
-
EmoNeXt: an Adapted ConvNeXt for Facial Emotion Recognition
by: Boudouri, Yassine El, et al.
Published: (2025) -
CoAtNeXt:An Attention-Enhanced ConvNeXtV2-Transformer Hybrid Model for Gastric Tissue Classification
by: Yurdakul, Mustafa, et al.
Published: (2025) -
InceptionNeXt: When Inception Meets ConvNeXt
by: Yu, Weihao, et al.
Published: (2023) -
Ensemble of radiomics and ConvNeXt for breast cancer diagnosis
by: Garza-Abdala, Jorge Alberto, et al.
Published: (2026) -
FourCastNeXt: Optimizing FourCastNet Training for Limited Compute
by: Guo, Edison, et al.
Published: (2024)