:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yusuf, Md Abu, Khan, Md Rezaul Karim, Saha, Partha Pratim, Rahaman, Mohammed Mahbubur
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2412.03490
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Comprehensive Review on the Advancement of Home Automation System
by: Habib, Md. Rawshan, et al.
Published: (2024)

ANNA: A Deep Learning Based Dataset in Heterogeneous Traffic for Autonomous Vehicles
by: Kamal, Mahedi, et al.
Published: (2024)

A Heterogeneous Two-Stream Framework for Video Action Recognition with Comparative Fusion Analysis
by: Rahaman, Md. Afzalur, et al.
Published: (2026)

Enhancing Small Object Encoding in Deep Neural Networks: Introducing Fast&Focused-Net with Volume-wise Dot Product Layer
by: Tofik, Ali, et al.
Published: (2024)

Discrete Wavelet Transform as a Facilitator for Expressive Latent Space Representation in Variational Autoencoders in Satellite Imagery
by: Mahara, Arpan, et al.
Published: (2025)

Automated Road Extraction from Satellite Imagery Integrating Dense Depthwise Dilated Separable Spatial Pyramid Pooling with DeepLabV3+
by: Mahara, Arpan, et al.
Published: (2024)

ChatGPT in Research and Education: Exploring Benefits and Threats
by: Miah, Abu Saleh Musa, et al.
Published: (2024)

Extreme Model Compression for Edge Vision-Language Models: Sparse Temporal Token Fusion and Adaptive Neural Compression
by: Tanvir, Md Tasnin, et al.
Published: (2025)

GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection
by: Mia, Md Sohag, et al.
Published: (2025)

Cross Spatial Temporal Fusion Attention for Remote Sensing Object Detection via Image Feature Matching
by: Amit, Abu Sadat Mohammad Salehin, et al.
Published: (2025)

AquaFuse: Waterbody Fusion for Physics Guided View Synthesis of Underwater Scenes
by: Siddique, Md Abu Bakr, et al.
Published: (2024)

DANet: Enhancing Small Object Detection through an Efficient Deformable Attention Network
by: Mia, Md Sohag, et al.
Published: (2023)

Object Detection and Tracking
by: Pranto, Md, et al.
Published: (2025)

C^2DA: Contrastive and Context-aware Domain Adaptive Semantic Segmentation
by: Khan, Md. Al-Masrur, et al.
Published: (2024)

Context-Aware Zero-Shot Anomaly Detection in Surveillance Using Contrastive and Predictive Spatiotemporal Modeling
by: Khan, Md. Rashid Shahriar, et al.
Published: (2025)

Context Aware Grounded Teacher for Source Free Object Detection
by: Ashraf, Tajamul, et al.
Published: (2025)

A Comprehensive Review of Sign Language Recognition: Different Types, Modalities, and Datasets
by: Madhiarasan, M., et al.
Published: (2022)

Handwritten Text Recognition for Low Resource Languages
by: Dey, Sayantan, et al.
Published: (2025)

Vision Models for Medical Imaging: A Hybrid Approach for PCOS Detection from Ultrasound Scans
by: Hoque, Md Mahmudul, et al.
Published: (2026)

A Computer Vision Based Approach for Stalking Detection Using a CNN-LSTM-MLP Hybrid Fusion Model
by: Hasan, Murad, et al.
Published: (2024)

Monocular Depth Estimation and Segmentation for Transparent Object with Iterative Semantic and Geometric Fusion
by: Liu, Jiangyuan, et al.
Published: (2025)

DepthFusion: Depth-Aware Hybrid Feature Fusion for LiDAR-Camera 3D Object Detection
by: Ji, Mingqian, et al.
Published: (2025)

Learning to Weigh Waste: A Physics-Informed Multimodal Fusion Framework and Large-Scale Dataset for Commercial and Industrial Applications
by: Islam, Md. Adnanul, et al.
Published: (2026)

ReFrame: Rectification Framework for Image Explaining Architectures
by: Adhikary, Debjyoti Das, et al.
Published: (2025)

FUSE: Unifying Spectral and Semantic Cues for Robust AI-Generated Image Detection
by: Hossain, Md. Zahid, et al.
Published: (2025)

UStyle: Waterbody Style Transfer of Underwater Scenes by Depth-Guided Feature Synthesis
by: Siddique, Md Abu Bakr, et al.
Published: (2025)

Position and Rotation Invariant Sign Language Recognition from 3D Kinect Data with Recurrent Neural Networks
by: Roy, Prasun, et al.
Published: (2020)

MED-VT++: Unifying Multimodal Learning with a Multiscale Encoder-Decoder Video Transformer
by: Karim, Rezaul, et al.
Published: (2023)

Attention Based Feature Fusion Network for Monkeypox Skin Lesion Detection
by: Kundu, Niloy Kumar, et al.
Published: (2024)

AFRDA: Attentive Feature Refinement for Domain Adaptive Semantic Segmentation
by: Khan, Md. Al-Masrur, et al.
Published: (2025)

Multimodal Retrieval-Augmented Generation with Large Language Models for Medical VQA
by: Karim, A H M Rezaul, et al.
Published: (2025)

Machine Learning Based Object Tracking
by: Akanda, Md Rakibul Karim, et al.
Published: (2024)

Point-GR: Graph Residual Point Cloud Network for 3D Object Classification and Segmentation
by: Meraz, Md, et al.
Published: (2024)

FOCUS: Forcing In-Context Object Localization through Visual Support Constraints and Policy Optimization
by: Karim, Mohammed Asad, et al.
Published: (2026)

Detection and Recovery of Adversarial Slow-Pose Drift in Offloaded Visual-Inertial Odometry
by: Saha, Soruya, et al.
Published: (2025)

SonoVision: A Computer Vision Approach for Helping Visually Challenged Individuals Locate Objects with the Help of Sound Cues
by: Zishan, Md Abu Obaida, et al.
Published: (2025)

A Semantic Segmentation Approach on Sweet Orange Leaf Diseases Detection Utilizing YOLO
by: Preanto, Sabit Ahamed, et al.
Published: (2024)

Leveraging Vision-Language Embeddings for Zero-Shot Learning in Histopathology Images
by: Rahaman, Md Mamunur, et al.
Published: (2025)

A Methodological and Structural Review of Hand Gesture Recognition Across Diverse Data Modalities
by: Shin, Jungpil, et al.
Published: (2024)

DeepAgent: A Dual Stream Multi Agent Fusion for Robust Multimodal Deepfake Detection
by: Zaman, Sayeem Been, et al.
Published: (2025)