Saved in:
| Main Authors: | Li, Hulin, Ren, Qiliang, Li, Jun, Wei, Hanbing, Liu, Zheng, Fan, Linfang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.05012 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Slim-neck by GSConv: A lightweight-design for real-time detector architectures
by: Li, Hulin, et al.
Published: (2022)
by: Li, Hulin, et al.
Published: (2022)
A biological vision inspired framework for machine perception of abutting grating illusory contours
by: Zhang, Xiao, et al.
Published: (2025)
by: Zhang, Xiao, et al.
Published: (2025)
Rethinking Features-Fused-Pyramid-Neck for Object Detection
by: Li, Hulin
Published: (2025)
by: Li, Hulin
Published: (2025)
Quantitative evaluation of brain-inspired vision sensors in high-speed robotic perception
by: Wang, Taoyi, et al.
Published: (2025)
by: Wang, Taoyi, et al.
Published: (2025)
FOVI: A biologically-inspired foveated interface for deep vision models
by: Blauch, Nicholas M., et al.
Published: (2026)
by: Blauch, Nicholas M., et al.
Published: (2026)
Beyond conventional vision: RGB-event fusion for robust object detection in dynamic traffic scenarios
by: Liu, Zhanwen, et al.
Published: (2025)
by: Liu, Zhanwen, et al.
Published: (2025)
Unified modality separation: A vision-language framework for unsupervised domain adaptation
by: Li, Xinyao, et al.
Published: (2025)
by: Li, Xinyao, et al.
Published: (2025)
Multi-label classification for multi-temporal, multi-spatial coral reef condition monitoring using vision foundation model with adapter learning
by: Shao, Xinlei, et al.
Published: (2025)
by: Shao, Xinlei, et al.
Published: (2025)
Brain-inspired spike-timing plasticity for reliable label-efficient event-camera vision
by: Sadoun, Mohamad Yazan, et al.
Published: (2026)
by: Sadoun, Mohamad Yazan, et al.
Published: (2026)
Traffic control using intelligent timing of traffic lights with reinforcement learning technique and real-time processing of surveillance camera images
by: Jamebozorg, Mahdi, et al.
Published: (2024)
by: Jamebozorg, Mahdi, et al.
Published: (2024)
Object Gaussian for Monocular 6D Pose Estimation from Sparse Views
by: Luo, Luqing, et al.
Published: (2024)
by: Luo, Luqing, et al.
Published: (2024)
Enhancing medical vision-language contrastive learning via inter-matching relation modelling
by: Li, Mingjian, et al.
Published: (2024)
by: Li, Mingjian, et al.
Published: (2024)
Refining time-space traffic diagrams: A neighborhood-adaptive linear regression method
by: Yao, Zhihong, et al.
Published: (2026)
by: Yao, Zhihong, et al.
Published: (2026)
GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement
by: Zheng, Linfang, et al.
Published: (2024)
by: Zheng, Linfang, et al.
Published: (2024)
Expert-level vision-language foundation model for real-world radiology and comprehensive evaluation
by: Liu, Xiaohong, et al.
Published: (2024)
by: Liu, Xiaohong, et al.
Published: (2024)
Euler-inspired Decoupling Neural Operator for Efficient Pansharpening
by: Zhu, Anqi, et al.
Published: (2026)
by: Zhu, Anqi, et al.
Published: (2026)
Brain-inspired analogical mixture prototypes for few-shot class-incremental learning
by: Li, Wanyi, et al.
Published: (2025)
by: Li, Wanyi, et al.
Published: (2025)
Traffic Cameras to detect inland waterway barge traffic: An Application of machine learning
by: Agorku, Geoffery, et al.
Published: (2024)
by: Agorku, Geoffery, et al.
Published: (2024)
PIE: Physics-inspired Low-light Enhancement
by: Liang, Dong, et al.
Published: (2024)
by: Liang, Dong, et al.
Published: (2024)
On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning?
by: Zanella, Maxime, et al.
Published: (2024)
by: Zanella, Maxime, et al.
Published: (2024)
MiCo: Multiple Instance Learning with Context-Aware Clustering for Whole Slide Image Analysis
by: Li, Junjian, et al.
Published: (2025)
by: Li, Junjian, et al.
Published: (2025)
Multimodal joint prediction of traffic spatial-temporal data with graph sparse attention mechanism and bidirectional temporal convolutional network
by: Zhang, Dongran, et al.
Published: (2024)
by: Zhang, Dongran, et al.
Published: (2024)
Matrix-game 2.0: An open-source real-time and streaming interactive world model
by: He, Xianglong, et al.
Published: (2025)
by: He, Xianglong, et al.
Published: (2025)
A dynamic vision sensor object recognition model based on trainable event-driven convolution and spiking attention mechanism
by: Zheng, Peng, et al.
Published: (2024)
by: Zheng, Peng, et al.
Published: (2024)
Improving the perception of visual fiducial markers in the field using Adaptive Active Exposure Control
by: Ren, Ziang, et al.
Published: (2024)
by: Ren, Ziang, et al.
Published: (2024)
bi-modal textual prompt learning for vision-language models in remote sensing
by: Kashyap, Pankhi, et al.
Published: (2026)
by: Kashyap, Pankhi, et al.
Published: (2026)
Multi-scale frequency separation network for image deblurring
by: Zhang, Yanni, et al.
Published: (2022)
by: Zhang, Yanni, et al.
Published: (2022)
Enhancing the vision-language foundation model with key semantic knowledge-emphasized report refinement
by: Huang, Weijian, et al.
Published: (2024)
by: Huang, Weijian, et al.
Published: (2024)
DarkShot: Lighting Dark Images with Low-Compute and High-Quality
by: Zheng, Jiazhang, et al.
Published: (2023)
by: Zheng, Jiazhang, et al.
Published: (2023)
An Information Theory-inspired Strategy for Automatic Network Pruning
by: Zheng, Xiawu, et al.
Published: (2021)
by: Zheng, Xiawu, et al.
Published: (2021)
Bio-inspired fine-tuning for selective transfer learning in image classification
by: Davila, Ana, et al.
Published: (2026)
by: Davila, Ana, et al.
Published: (2026)
SpaAct: Spatially-Activated Transition Learning with Curriculum Adaptation for Vision-Language Navigation
by: Li, Pengna, et al.
Published: (2026)
by: Li, Pengna, et al.
Published: (2026)
When language and vision meet road safety: leveraging multimodal large language models for video-based traffic accident analysis
by: Zhang, Ruixuan, et al.
Published: (2025)
by: Zhang, Ruixuan, et al.
Published: (2025)
Deep learning models are vulnerable, but adversarial examples are even more vulnerable
by: Li, Jun, et al.
Published: (2025)
by: Li, Jun, et al.
Published: (2025)
First qualitative observations on deep learning vision model YOLO and DETR for automated driving in Austria
by: Schoder, Stefan
Published: (2023)
by: Schoder, Stefan
Published: (2023)
Do computer vision foundation models learn the low-level characteristics of the human visual system?
by: Cai, Yancheng, et al.
Published: (2025)
by: Cai, Yancheng, et al.
Published: (2025)
A multi-modal vision-language model for generalizable annotation-free pathology localization
by: Yang, Hao, et al.
Published: (2024)
by: Yang, Hao, et al.
Published: (2024)
Do vision models perceive illusory motion in static images like humans?
by: Rosario, Isabella Elaine, et al.
Published: (2026)
by: Rosario, Isabella Elaine, et al.
Published: (2026)
Enhancing seeding efficiency using a computer vision system to monitor furrow quality in real-time
by: Rai, Sidharth, et al.
Published: (2025)
by: Rai, Sidharth, et al.
Published: (2025)
Reasoning in machine vision: learning to think fast and slow
by: Saeed, Shaheer U., et al.
Published: (2025)
by: Saeed, Shaheer U., et al.
Published: (2025)
Similar Items
-
Slim-neck by GSConv: A lightweight-design for real-time detector architectures
by: Li, Hulin, et al.
Published: (2022) -
A biological vision inspired framework for machine perception of abutting grating illusory contours
by: Zhang, Xiao, et al.
Published: (2025) -
Rethinking Features-Fused-Pyramid-Neck for Object Detection
by: Li, Hulin
Published: (2025) -
Quantitative evaluation of brain-inspired vision sensors in high-speed robotic perception
by: Wang, Taoyi, et al.
Published: (2025) -
FOVI: A biologically-inspired foveated interface for deep vision models
by: Blauch, Nicholas M., et al.
Published: (2026)