Saved in:
| Main Authors: | Mhdawi, Ammar K Al, Nnamoko, Nonso, Raafat, Safanah Mudheher, Al-Mhdawi, M. K. S., Humaidi, Amjad J |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.18924 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Intelligent Spatial Estimation for Fire Hazards in Engineering Sites: An Enhanced YOLOv8-Powered Proximity Analysis Framework
by: AlMhdawi, Ammar K., et al.
Published: (2026)
by: AlMhdawi, Ammar K., et al.
Published: (2026)
Predicting Road Crossing Behaviour using Pose Detection and Sequence Modelling
by: Dasgupta, Subhasis, et al.
Published: (2025)
by: Dasgupta, Subhasis, et al.
Published: (2025)
AI-Powered Deepfake Detection Using CNN and Vision Transformer Architectures
by: Urmi, Sifatullah Sheikh, et al.
Published: (2026)
by: Urmi, Sifatullah Sheikh, et al.
Published: (2026)
When Does Supervised Training Pay Off? The Hidden Economics of Object Detection in the Era of Vision-Language Models
by: Al-Hamadani, Samer
Published: (2025)
by: Al-Hamadani, Samer
Published: (2025)
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
by: Pramanick, Shraman, et al.
Published: (2023)
by: Pramanick, Shraman, et al.
Published: (2023)
V-SenseDrive: A Privacy-Preserving Road Video and In-Vehicle Sensor Fusion Framework for Road Safety & Driver Behaviour Modelling
by: Naveed, Muhammad, et al.
Published: (2025)
by: Naveed, Muhammad, et al.
Published: (2025)
Gender Stereotypes in Professional Roles Among Saudis: An Analytical Study of AI-Generated Images Using Language Models
by: AlKhalifah, Khaloud S., et al.
Published: (2025)
by: AlKhalifah, Khaloud S., et al.
Published: (2025)
AI-generated faces influence gender stereotypes and racial homogenization
by: AlDahoul, Nouar, et al.
Published: (2024)
by: AlDahoul, Nouar, et al.
Published: (2024)
An AI-Powered Autonomous Underwater System for Sea Exploration and Scientific Research
by: Almazrouei, Hamad, et al.
Published: (2025)
by: Almazrouei, Hamad, et al.
Published: (2025)
ClaudesLens: Uncertainty Quantification in Computer Vision Models
by: Shaar, Mohamad Al, et al.
Published: (2024)
by: Shaar, Mohamad Al, et al.
Published: (2024)
An Ensemble Model with Attention Based Mechanism for Image Captioning
by: Badarneh, Israa Al, et al.
Published: (2025)
by: Badarneh, Israa Al, et al.
Published: (2025)
ResAF-Net: An Anchor-Free Attention-Based Network for Tree Detection and Agricultural Mapping in Palestine
by: Al-Qasem, Rabee
Published: (2026)
by: Al-Qasem, Rabee
Published: (2026)
Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of Vision Transformers for Medical Image Classification
by: Alkhunaizi, Naif, et al.
Published: (2024)
by: Alkhunaizi, Naif, et al.
Published: (2024)
Vision-Based Approach for Food Weight Estimation from 2D Images
by: Wimalasiri, Chathura, et al.
Published: (2024)
by: Wimalasiri, Chathura, et al.
Published: (2024)
XAI-CLIP: ROI-Guided Perturbation Framework for Explainable Medical Image Segmentation in Multimodal Vision-Language Models
by: Alzubaidi, Thuraya, et al.
Published: (2026)
by: Alzubaidi, Thuraya, et al.
Published: (2026)
Confidence-Guided Diffusion Augmentation for Enhanced Bangla Compound Character Recognition
by: Rayhan, Md. Sultan Al
Published: (2026)
by: Rayhan, Md. Sultan Al
Published: (2026)
Enhancing Construction Site Safety: A Lightweight Convolutional Network for Effective Helmet Detection
by: Alif, Mujadded Al Rabbani
Published: (2024)
by: Alif, Mujadded Al Rabbani
Published: (2024)
YOLOv11 for Vehicle Detection: Advancements, Performance, and Applications in Intelligent Transportation Systems
by: Alif, Mujadded Al Rabbani
Published: (2024)
by: Alif, Mujadded Al Rabbani
Published: (2024)
Adaptive Image Restoration for Video Surveillance: A Real-Time Approach
by: Amin, Muhammad Awais, et al.
Published: (2025)
by: Amin, Muhammad Awais, et al.
Published: (2025)
IGAN: A New Inception-based Model for Stable and High-Fidelity Image Synthesis Using Generative Adversarial Networks
by: Hashim, Ahmed A., et al.
Published: (2026)
by: Hashim, Ahmed A., et al.
Published: (2026)
A Real-Time DETR Approach to Bangladesh Road Object Detection for Autonomous Vehicles
by: Shahan, Irfan Nafiz, et al.
Published: (2024)
by: Shahan, Irfan Nafiz, et al.
Published: (2024)
Automated Road Distress Detection Using Vision Transformersand Generative Adversarial Networks
by: Rodriguez, Cesar Portocarrero, et al.
Published: (2025)
by: Rodriguez, Cesar Portocarrero, et al.
Published: (2025)
Clear Roads, Clear Vision: Advancements in Multi-Weather Restoration for Smart Transportation
by: Galshetwar, Vijay M., et al.
Published: (2025)
by: Galshetwar, Vijay M., et al.
Published: (2025)
Approach to Designing CV Systems for Medical Applications: Data, Architecture and AI
by: Ryabtsev, Dmitry, et al.
Published: (2025)
by: Ryabtsev, Dmitry, et al.
Published: (2025)
Infrastructure-Guided Connectivity-Enhanced Road Crack Detection and Estimation
by: Xiao, Haosong, et al.
Published: (2026)
by: Xiao, Haosong, et al.
Published: (2026)
Spiking Vision Transformer with Saccadic Attention
by: Wang, Shuai, et al.
Published: (2025)
by: Wang, Shuai, et al.
Published: (2025)
ViTs are Everywhere: A Comprehensive Study Showcasing Vision Transformers in Different Domain
by: Mia, Md Sohag, et al.
Published: (2023)
by: Mia, Md Sohag, et al.
Published: (2023)
Provenance-Driven Reliable Semantic Medical Image Vector Reconstruction via Lightweight Blockchain-Verified Latent Fingerprints
by: Rasheed, Mohsin, et al.
Published: (2025)
by: Rasheed, Mohsin, et al.
Published: (2025)
PoseGaze-AHP: A Knowledge-Based 3D Dataset for AI-Driven Ocular and Postural Diagnosis
by: Al-Dabet, Saja, et al.
Published: (2025)
by: Al-Dabet, Saja, et al.
Published: (2025)
Bearded Dragon Activity Recognition Pipeline: An AI-Based Approach to Behavioural Monitoring
by: Yermukan, Arsen, et al.
Published: (2025)
by: Yermukan, Arsen, et al.
Published: (2025)
TPC: Cross-Temporal Prediction Connection for Vision-Language Model Hallucination Reduction
by: Wang, Chao, et al.
Published: (2025)
by: Wang, Chao, et al.
Published: (2025)
OpenCarbon: A Contrastive Learning-based Cross-Modality Neural Approach for High-Resolution Carbon Emission Prediction Using Open Data
by: Zeng, Jinwei, et al.
Published: (2025)
by: Zeng, Jinwei, et al.
Published: (2025)
CLIP-RL: Surgical Scene Segmentation Using Contrastive Language-Vision Pretraining & Reinforcement Learning
by: Ahmed, Fatmaelzahraa Ali, et al.
Published: (2025)
by: Ahmed, Fatmaelzahraa Ali, et al.
Published: (2025)
iPhoneBlur: A Difficulty-Stratified Benchmark for Consumer Device Motion Deblurring
by: Shafi, Abdullah Al, et al.
Published: (2026)
by: Shafi, Abdullah Al, et al.
Published: (2026)
YOLOv12: A Breakdown of the Key Architectural Features
by: Alif, Mujadded Al Rabbani, et al.
Published: (2025)
by: Alif, Mujadded Al Rabbani, et al.
Published: (2025)
Real-time Yemeni Currency Detection
by: AL-Edreesi, Edrees, et al.
Published: (2024)
by: AL-Edreesi, Edrees, et al.
Published: (2024)
The Effects of Visual Priming on Cooperative Behavior in Vision-Language Models
by: Ong, Kenneth J. K.
Published: (2026)
by: Ong, Kenneth J. K.
Published: (2026)
Real, fake and synthetic faces -- does the coin have three sides?
by: Naeem, Shahzeb, et al.
Published: (2024)
by: Naeem, Shahzeb, et al.
Published: (2024)
Training a Vision Language Model as Smartphone Assistant
by: Dorka, Nicolai, et al.
Published: (2024)
by: Dorka, Nicolai, et al.
Published: (2024)
Multimodal AI for Body Fat Estimation: Computer Vision and Anthropometry with DEXA Benchmarks
by: Aldajani, Rayan
Published: (2025)
by: Aldajani, Rayan
Published: (2025)
Similar Items
-
Intelligent Spatial Estimation for Fire Hazards in Engineering Sites: An Enhanced YOLOv8-Powered Proximity Analysis Framework
by: AlMhdawi, Ammar K., et al.
Published: (2026) -
Predicting Road Crossing Behaviour using Pose Detection and Sequence Modelling
by: Dasgupta, Subhasis, et al.
Published: (2025) -
AI-Powered Deepfake Detection Using CNN and Vision Transformer Architectures
by: Urmi, Sifatullah Sheikh, et al.
Published: (2026) -
When Does Supervised Training Pay Off? The Hidden Economics of Object Detection in the Era of Vision-Language Models
by: Al-Hamadani, Samer
Published: (2025) -
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
by: Pramanick, Shraman, et al.
Published: (2023)