Saved in:
| Main Authors: | Yusuf, Md Abu, Khan, Md Rezaul Karim, Saha, Partha Pratim, Rahaman, Mohammed Mahbubur |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.03490 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Comprehensive Review on the Advancement of Home Automation System
by: Habib, Md. Rawshan, et al.
Published: (2024)
by: Habib, Md. Rawshan, et al.
Published: (2024)
ANNA: A Deep Learning Based Dataset in Heterogeneous Traffic for Autonomous Vehicles
by: Kamal, Mahedi, et al.
Published: (2024)
by: Kamal, Mahedi, et al.
Published: (2024)
A Heterogeneous Two-Stream Framework for Video Action Recognition with Comparative Fusion Analysis
by: Rahaman, Md. Afzalur, et al.
Published: (2026)
by: Rahaman, Md. Afzalur, et al.
Published: (2026)
Enhancing Small Object Encoding in Deep Neural Networks: Introducing Fast&Focused-Net with Volume-wise Dot Product Layer
by: Tofik, Ali, et al.
Published: (2024)
by: Tofik, Ali, et al.
Published: (2024)
Discrete Wavelet Transform as a Facilitator for Expressive Latent Space Representation in Variational Autoencoders in Satellite Imagery
by: Mahara, Arpan, et al.
Published: (2025)
by: Mahara, Arpan, et al.
Published: (2025)
Automated Road Extraction from Satellite Imagery Integrating Dense Depthwise Dilated Separable Spatial Pyramid Pooling with DeepLabV3+
by: Mahara, Arpan, et al.
Published: (2024)
by: Mahara, Arpan, et al.
Published: (2024)
ChatGPT in Research and Education: Exploring Benefits and Threats
by: Miah, Abu Saleh Musa, et al.
Published: (2024)
by: Miah, Abu Saleh Musa, et al.
Published: (2024)
Extreme Model Compression for Edge Vision-Language Models: Sparse Temporal Token Fusion and Adaptive Neural Compression
by: Tanvir, Md Tasnin, et al.
Published: (2025)
by: Tanvir, Md Tasnin, et al.
Published: (2025)
GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection
by: Mia, Md Sohag, et al.
Published: (2025)
by: Mia, Md Sohag, et al.
Published: (2025)
Cross Spatial Temporal Fusion Attention for Remote Sensing Object Detection via Image Feature Matching
by: Amit, Abu Sadat Mohammad Salehin, et al.
Published: (2025)
by: Amit, Abu Sadat Mohammad Salehin, et al.
Published: (2025)
AquaFuse: Waterbody Fusion for Physics Guided View Synthesis of Underwater Scenes
by: Siddique, Md Abu Bakr, et al.
Published: (2024)
by: Siddique, Md Abu Bakr, et al.
Published: (2024)
DANet: Enhancing Small Object Detection through an Efficient Deformable Attention Network
by: Mia, Md Sohag, et al.
Published: (2023)
by: Mia, Md Sohag, et al.
Published: (2023)
Object Detection and Tracking
by: Pranto, Md, et al.
Published: (2025)
by: Pranto, Md, et al.
Published: (2025)
C^2DA: Contrastive and Context-aware Domain Adaptive Semantic Segmentation
by: Khan, Md. Al-Masrur, et al.
Published: (2024)
by: Khan, Md. Al-Masrur, et al.
Published: (2024)
Context-Aware Zero-Shot Anomaly Detection in Surveillance Using Contrastive and Predictive Spatiotemporal Modeling
by: Khan, Md. Rashid Shahriar, et al.
Published: (2025)
by: Khan, Md. Rashid Shahriar, et al.
Published: (2025)
Context Aware Grounded Teacher for Source Free Object Detection
by: Ashraf, Tajamul, et al.
Published: (2025)
by: Ashraf, Tajamul, et al.
Published: (2025)
A Comprehensive Review of Sign Language Recognition: Different Types, Modalities, and Datasets
by: Madhiarasan, M., et al.
Published: (2022)
by: Madhiarasan, M., et al.
Published: (2022)
Handwritten Text Recognition for Low Resource Languages
by: Dey, Sayantan, et al.
Published: (2025)
by: Dey, Sayantan, et al.
Published: (2025)
Vision Models for Medical Imaging: A Hybrid Approach for PCOS Detection from Ultrasound Scans
by: Hoque, Md Mahmudul, et al.
Published: (2026)
by: Hoque, Md Mahmudul, et al.
Published: (2026)
A Computer Vision Based Approach for Stalking Detection Using a CNN-LSTM-MLP Hybrid Fusion Model
by: Hasan, Murad, et al.
Published: (2024)
by: Hasan, Murad, et al.
Published: (2024)
Monocular Depth Estimation and Segmentation for Transparent Object with Iterative Semantic and Geometric Fusion
by: Liu, Jiangyuan, et al.
Published: (2025)
by: Liu, Jiangyuan, et al.
Published: (2025)
DepthFusion: Depth-Aware Hybrid Feature Fusion for LiDAR-Camera 3D Object Detection
by: Ji, Mingqian, et al.
Published: (2025)
by: Ji, Mingqian, et al.
Published: (2025)
Learning to Weigh Waste: A Physics-Informed Multimodal Fusion Framework and Large-Scale Dataset for Commercial and Industrial Applications
by: Islam, Md. Adnanul, et al.
Published: (2026)
by: Islam, Md. Adnanul, et al.
Published: (2026)
ReFrame: Rectification Framework for Image Explaining Architectures
by: Adhikary, Debjyoti Das, et al.
Published: (2025)
by: Adhikary, Debjyoti Das, et al.
Published: (2025)
FUSE: Unifying Spectral and Semantic Cues for Robust AI-Generated Image Detection
by: Hossain, Md. Zahid, et al.
Published: (2025)
by: Hossain, Md. Zahid, et al.
Published: (2025)
UStyle: Waterbody Style Transfer of Underwater Scenes by Depth-Guided Feature Synthesis
by: Siddique, Md Abu Bakr, et al.
Published: (2025)
by: Siddique, Md Abu Bakr, et al.
Published: (2025)
Position and Rotation Invariant Sign Language Recognition from 3D Kinect Data with Recurrent Neural Networks
by: Roy, Prasun, et al.
Published: (2020)
by: Roy, Prasun, et al.
Published: (2020)
MED-VT++: Unifying Multimodal Learning with a Multiscale Encoder-Decoder Video Transformer
by: Karim, Rezaul, et al.
Published: (2023)
by: Karim, Rezaul, et al.
Published: (2023)
Attention Based Feature Fusion Network for Monkeypox Skin Lesion Detection
by: Kundu, Niloy Kumar, et al.
Published: (2024)
by: Kundu, Niloy Kumar, et al.
Published: (2024)
AFRDA: Attentive Feature Refinement for Domain Adaptive Semantic Segmentation
by: Khan, Md. Al-Masrur, et al.
Published: (2025)
by: Khan, Md. Al-Masrur, et al.
Published: (2025)
Multimodal Retrieval-Augmented Generation with Large Language Models for Medical VQA
by: Karim, A H M Rezaul, et al.
Published: (2025)
by: Karim, A H M Rezaul, et al.
Published: (2025)
Machine Learning Based Object Tracking
by: Akanda, Md Rakibul Karim, et al.
Published: (2024)
by: Akanda, Md Rakibul Karim, et al.
Published: (2024)
Point-GR: Graph Residual Point Cloud Network for 3D Object Classification and Segmentation
by: Meraz, Md, et al.
Published: (2024)
by: Meraz, Md, et al.
Published: (2024)
FOCUS: Forcing In-Context Object Localization through Visual Support Constraints and Policy Optimization
by: Karim, Mohammed Asad, et al.
Published: (2026)
by: Karim, Mohammed Asad, et al.
Published: (2026)
Detection and Recovery of Adversarial Slow-Pose Drift in Offloaded Visual-Inertial Odometry
by: Saha, Soruya, et al.
Published: (2025)
by: Saha, Soruya, et al.
Published: (2025)
SonoVision: A Computer Vision Approach for Helping Visually Challenged Individuals Locate Objects with the Help of Sound Cues
by: Zishan, Md Abu Obaida, et al.
Published: (2025)
by: Zishan, Md Abu Obaida, et al.
Published: (2025)
A Semantic Segmentation Approach on Sweet Orange Leaf Diseases Detection Utilizing YOLO
by: Preanto, Sabit Ahamed, et al.
Published: (2024)
by: Preanto, Sabit Ahamed, et al.
Published: (2024)
Leveraging Vision-Language Embeddings for Zero-Shot Learning in Histopathology Images
by: Rahaman, Md Mamunur, et al.
Published: (2025)
by: Rahaman, Md Mamunur, et al.
Published: (2025)
A Methodological and Structural Review of Hand Gesture Recognition Across Diverse Data Modalities
by: Shin, Jungpil, et al.
Published: (2024)
by: Shin, Jungpil, et al.
Published: (2024)
DeepAgent: A Dual Stream Multi Agent Fusion for Robust Multimodal Deepfake Detection
by: Zaman, Sayeem Been, et al.
Published: (2025)
by: Zaman, Sayeem Been, et al.
Published: (2025)
Similar Items
-
A Comprehensive Review on the Advancement of Home Automation System
by: Habib, Md. Rawshan, et al.
Published: (2024) -
ANNA: A Deep Learning Based Dataset in Heterogeneous Traffic for Autonomous Vehicles
by: Kamal, Mahedi, et al.
Published: (2024) -
A Heterogeneous Two-Stream Framework for Video Action Recognition with Comparative Fusion Analysis
by: Rahaman, Md. Afzalur, et al.
Published: (2026) -
Enhancing Small Object Encoding in Deep Neural Networks: Introducing Fast&Focused-Net with Volume-wise Dot Product Layer
by: Tofik, Ali, et al.
Published: (2024) -
Discrete Wavelet Transform as a Facilitator for Expressive Latent Space Representation in Variational Autoencoders in Satellite Imagery
by: Mahara, Arpan, et al.
Published: (2025)