:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mhdawi, Ammar K Al, Nnamoko, Nonso, Raafat, Safanah Mudheher, Al-Mhdawi, M. K. S., Humaidi, Amjad J
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2506.18924
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Intelligent Spatial Estimation for Fire Hazards in Engineering Sites: An Enhanced YOLOv8-Powered Proximity Analysis Framework
by: AlMhdawi, Ammar K., et al.
Published: (2026)

Predicting Road Crossing Behaviour using Pose Detection and Sequence Modelling
by: Dasgupta, Subhasis, et al.
Published: (2025)

AI-Powered Deepfake Detection Using CNN and Vision Transformer Architectures
by: Urmi, Sifatullah Sheikh, et al.
Published: (2026)

When Does Supervised Training Pay Off? The Hidden Economics of Object Detection in the Era of Vision-Language Models
by: Al-Hamadani, Samer
Published: (2025)

Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
by: Pramanick, Shraman, et al.
Published: (2023)

V-SenseDrive: A Privacy-Preserving Road Video and In-Vehicle Sensor Fusion Framework for Road Safety & Driver Behaviour Modelling
by: Naveed, Muhammad, et al.
Published: (2025)

Gender Stereotypes in Professional Roles Among Saudis: An Analytical Study of AI-Generated Images Using Language Models
by: AlKhalifah, Khaloud S., et al.
Published: (2025)

AI-generated faces influence gender stereotypes and racial homogenization
by: AlDahoul, Nouar, et al.
Published: (2024)

An AI-Powered Autonomous Underwater System for Sea Exploration and Scientific Research
by: Almazrouei, Hamad, et al.
Published: (2025)

ClaudesLens: Uncertainty Quantification in Computer Vision Models
by: Shaar, Mohamad Al, et al.
Published: (2024)

An Ensemble Model with Attention Based Mechanism for Image Captioning
by: Badarneh, Israa Al, et al.
Published: (2025)

ResAF-Net: An Anchor-Free Attention-Based Network for Tree Detection and Agricultural Mapping in Palestine
by: Al-Qasem, Rabee
Published: (2026)

Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of Vision Transformers for Medical Image Classification
by: Alkhunaizi, Naif, et al.
Published: (2024)

Vision-Based Approach for Food Weight Estimation from 2D Images
by: Wimalasiri, Chathura, et al.
Published: (2024)

XAI-CLIP: ROI-Guided Perturbation Framework for Explainable Medical Image Segmentation in Multimodal Vision-Language Models
by: Alzubaidi, Thuraya, et al.
Published: (2026)

Confidence-Guided Diffusion Augmentation for Enhanced Bangla Compound Character Recognition
by: Rayhan, Md. Sultan Al
Published: (2026)

Enhancing Construction Site Safety: A Lightweight Convolutional Network for Effective Helmet Detection
by: Alif, Mujadded Al Rabbani
Published: (2024)

YOLOv11 for Vehicle Detection: Advancements, Performance, and Applications in Intelligent Transportation Systems
by: Alif, Mujadded Al Rabbani
Published: (2024)

Adaptive Image Restoration for Video Surveillance: A Real-Time Approach
by: Amin, Muhammad Awais, et al.
Published: (2025)

IGAN: A New Inception-based Model for Stable and High-Fidelity Image Synthesis Using Generative Adversarial Networks
by: Hashim, Ahmed A., et al.
Published: (2026)

A Real-Time DETR Approach to Bangladesh Road Object Detection for Autonomous Vehicles
by: Shahan, Irfan Nafiz, et al.
Published: (2024)

Automated Road Distress Detection Using Vision Transformersand Generative Adversarial Networks
by: Rodriguez, Cesar Portocarrero, et al.
Published: (2025)

Clear Roads, Clear Vision: Advancements in Multi-Weather Restoration for Smart Transportation
by: Galshetwar, Vijay M., et al.
Published: (2025)

Approach to Designing CV Systems for Medical Applications: Data, Architecture and AI
by: Ryabtsev, Dmitry, et al.
Published: (2025)

Infrastructure-Guided Connectivity-Enhanced Road Crack Detection and Estimation
by: Xiao, Haosong, et al.
Published: (2026)

Spiking Vision Transformer with Saccadic Attention
by: Wang, Shuai, et al.
Published: (2025)

ViTs are Everywhere: A Comprehensive Study Showcasing Vision Transformers in Different Domain
by: Mia, Md Sohag, et al.
Published: (2023)

Provenance-Driven Reliable Semantic Medical Image Vector Reconstruction via Lightweight Blockchain-Verified Latent Fingerprints
by: Rasheed, Mohsin, et al.
Published: (2025)

PoseGaze-AHP: A Knowledge-Based 3D Dataset for AI-Driven Ocular and Postural Diagnosis
by: Al-Dabet, Saja, et al.
Published: (2025)

Bearded Dragon Activity Recognition Pipeline: An AI-Based Approach to Behavioural Monitoring
by: Yermukan, Arsen, et al.
Published: (2025)

TPC: Cross-Temporal Prediction Connection for Vision-Language Model Hallucination Reduction
by: Wang, Chao, et al.
Published: (2025)

OpenCarbon: A Contrastive Learning-based Cross-Modality Neural Approach for High-Resolution Carbon Emission Prediction Using Open Data
by: Zeng, Jinwei, et al.
Published: (2025)

CLIP-RL: Surgical Scene Segmentation Using Contrastive Language-Vision Pretraining & Reinforcement Learning
by: Ahmed, Fatmaelzahraa Ali, et al.
Published: (2025)

iPhoneBlur: A Difficulty-Stratified Benchmark for Consumer Device Motion Deblurring
by: Shafi, Abdullah Al, et al.
Published: (2026)

YOLOv12: A Breakdown of the Key Architectural Features
by: Alif, Mujadded Al Rabbani, et al.
Published: (2025)

Real-time Yemeni Currency Detection
by: AL-Edreesi, Edrees, et al.
Published: (2024)

The Effects of Visual Priming on Cooperative Behavior in Vision-Language Models
by: Ong, Kenneth J. K.
Published: (2026)

Real, fake and synthetic faces -- does the coin have three sides?
by: Naeem, Shahzeb, et al.
Published: (2024)

Training a Vision Language Model as Smartphone Assistant
by: Dorka, Nicolai, et al.
Published: (2024)

Multimodal AI for Body Fat Estimation: Computer Vision and Anthropometry with DEXA Benchmarks
by: Aldajani, Rayan
Published: (2025)