:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Hulin, Ren, Qiliang, Li, Jun, Wei, Hanbing, Liu, Zheng, Fan, Linfang
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2509.05012
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Slim-neck by GSConv: A lightweight-design for real-time detector architectures
by: Li, Hulin, et al.
Published: (2022)

A biological vision inspired framework for machine perception of abutting grating illusory contours
by: Zhang, Xiao, et al.
Published: (2025)

Rethinking Features-Fused-Pyramid-Neck for Object Detection
by: Li, Hulin
Published: (2025)

Quantitative evaluation of brain-inspired vision sensors in high-speed robotic perception
by: Wang, Taoyi, et al.
Published: (2025)

FOVI: A biologically-inspired foveated interface for deep vision models
by: Blauch, Nicholas M., et al.
Published: (2026)

Beyond conventional vision: RGB-event fusion for robust object detection in dynamic traffic scenarios
by: Liu, Zhanwen, et al.
Published: (2025)

Unified modality separation: A vision-language framework for unsupervised domain adaptation
by: Li, Xinyao, et al.
Published: (2025)

Multi-label classification for multi-temporal, multi-spatial coral reef condition monitoring using vision foundation model with adapter learning
by: Shao, Xinlei, et al.
Published: (2025)

Brain-inspired spike-timing plasticity for reliable label-efficient event-camera vision
by: Sadoun, Mohamad Yazan, et al.
Published: (2026)

Traffic control using intelligent timing of traffic lights with reinforcement learning technique and real-time processing of surveillance camera images
by: Jamebozorg, Mahdi, et al.
Published: (2024)

Object Gaussian for Monocular 6D Pose Estimation from Sparse Views
by: Luo, Luqing, et al.
Published: (2024)

Enhancing medical vision-language contrastive learning via inter-matching relation modelling
by: Li, Mingjian, et al.
Published: (2024)

Refining time-space traffic diagrams: A neighborhood-adaptive linear regression method
by: Yao, Zhihong, et al.
Published: (2026)

GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement
by: Zheng, Linfang, et al.
Published: (2024)

Expert-level vision-language foundation model for real-world radiology and comprehensive evaluation
by: Liu, Xiaohong, et al.
Published: (2024)

Euler-inspired Decoupling Neural Operator for Efficient Pansharpening
by: Zhu, Anqi, et al.
Published: (2026)

Brain-inspired analogical mixture prototypes for few-shot class-incremental learning
by: Li, Wanyi, et al.
Published: (2025)

Traffic Cameras to detect inland waterway barge traffic: An Application of machine learning
by: Agorku, Geoffery, et al.
Published: (2024)

PIE: Physics-inspired Low-light Enhancement
by: Liang, Dong, et al.
Published: (2024)

On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning?
by: Zanella, Maxime, et al.
Published: (2024)

MiCo: Multiple Instance Learning with Context-Aware Clustering for Whole Slide Image Analysis
by: Li, Junjian, et al.
Published: (2025)

Multimodal joint prediction of traffic spatial-temporal data with graph sparse attention mechanism and bidirectional temporal convolutional network
by: Zhang, Dongran, et al.
Published: (2024)

Matrix-game 2.0: An open-source real-time and streaming interactive world model
by: He, Xianglong, et al.
Published: (2025)

A dynamic vision sensor object recognition model based on trainable event-driven convolution and spiking attention mechanism
by: Zheng, Peng, et al.
Published: (2024)

Improving the perception of visual fiducial markers in the field using Adaptive Active Exposure Control
by: Ren, Ziang, et al.
Published: (2024)

bi-modal textual prompt learning for vision-language models in remote sensing
by: Kashyap, Pankhi, et al.
Published: (2026)

Multi-scale frequency separation network for image deblurring
by: Zhang, Yanni, et al.
Published: (2022)

Enhancing the vision-language foundation model with key semantic knowledge-emphasized report refinement
by: Huang, Weijian, et al.
Published: (2024)

DarkShot: Lighting Dark Images with Low-Compute and High-Quality
by: Zheng, Jiazhang, et al.
Published: (2023)

An Information Theory-inspired Strategy for Automatic Network Pruning
by: Zheng, Xiawu, et al.
Published: (2021)

Bio-inspired fine-tuning for selective transfer learning in image classification
by: Davila, Ana, et al.
Published: (2026)

SpaAct: Spatially-Activated Transition Learning with Curriculum Adaptation for Vision-Language Navigation
by: Li, Pengna, et al.
Published: (2026)

When language and vision meet road safety: leveraging multimodal large language models for video-based traffic accident analysis
by: Zhang, Ruixuan, et al.
Published: (2025)

Deep learning models are vulnerable, but adversarial examples are even more vulnerable
by: Li, Jun, et al.
Published: (2025)

First qualitative observations on deep learning vision model YOLO and DETR for automated driving in Austria
by: Schoder, Stefan
Published: (2023)

Do computer vision foundation models learn the low-level characteristics of the human visual system?
by: Cai, Yancheng, et al.
Published: (2025)

A multi-modal vision-language model for generalizable annotation-free pathology localization
by: Yang, Hao, et al.
Published: (2024)

Do vision models perceive illusory motion in static images like humans?
by: Rosario, Isabella Elaine, et al.
Published: (2026)

Enhancing seeding efficiency using a computer vision system to monitor furrow quality in real-time
by: Rai, Sidharth, et al.
Published: (2025)

Reasoning in machine vision: learning to think fast and slow
by: Saeed, Shaheer U., et al.
Published: (2025)