:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Raiyan, Syed Rifat, Amio, Zibran Zarif, Ahmed, Sabbir
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2408.10360
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Beyond Symbolic Solving: Multi Chain-of-Thought Voting for Geometric Reasoning in Large Language Models
by: Siddique, Md. Abu Bakor, et al.
Published: (2026)

Securing Vision-Language Models with a Robust Encoder Against Jailbreak and Adversarial Attacks
by: Hossain, Md Zarif, et al.
Published: (2024)

Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models
by: Hossain, Md Zarif, et al.
Published: (2024)

Narrative over Numbers: The Identifiable Victim Effect and its Amplification Under Alignment and Reasoning in Large Language Models
by: Raiyan, Syed Rifat
Published: (2026)

Enhancing Bidirectional Sign Language Communication: Integrating YOLOv8 and NLP for Real-Time Gesture Recognition & Translation
by: Bhuiyan, Hasnat Jamil, et al.
Published: (2024)

Unmasking Puppeteers: Leveraging Biometric Leakage to Expose Impersonation in AI-Based Videoconferencing
by: Vahdati, Danial Samadi, et al.
Published: (2025)

Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics
by: Li, Ruining, et al.
Published: (2024)

Survey on Hand Gesture Recognition from Visual Input
by: Linardakis, Manousos, et al.
Published: (2025)

Hands-on Evaluation of Visual Transformers for Object Recognition and Detection
by: Vlachogiannis, Dimitrios N., et al.
Published: (2025)

Hand3R: Online 4D Hand-Scene Reconstruction in the Wild
by: Hu, Wendi, et al.
Published: (2026)

AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild
by: Park, Junho, et al.
Published: (2024)

An Efficient Deep Learning Framework for Brain Stroke Diagnosis Using Computed Tomography Images
by: Hossen, Md. Sabbir, et al.
Published: (2025)

Fast-HaMeR: Boosting Hand Mesh Reconstruction using Knowledge Distillation
by: Jillani, Hunain Ahmed, et al.
Published: (2026)

Online Hand Gesture Recognition Using 3D Convolutional Neural Networks
by: Qin, Yinghao, et al.
Published: (2026)

ForCM: Forest Cover Mapping from Multispectral Sentinel-2 Image by Integrating Deep Learning with Object-Based Image Analysis
by: Haque, Maisha, et al.
Published: (2025)

HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
by: Narasimhaswamy, Supreeth, et al.
Published: (2024)

HaGRIDv2: 1M Images for Static and Dynamic Hand Gesture Recognition
by: Nuzhdin, Anton, et al.
Published: (2024)

MatteViT: High-Frequency-Aware Document Shadow Removal with Shadow Matte Guidance
by: Kim, Chaewon, et al.
Published: (2025)

DocShaDiffusion: Diffusion Model in Latent Space for Document Image Shadow Removal
by: Liu, Wenjie, et al.
Published: (2025)

Object Detection Approaches to Identifying Hand Images with High Forensic Values
by: Nguyen, Thanh Thi, et al.
Published: (2024)

Advancing Histopathology-Based Breast Cancer Diagnosis: Insights into Multi-Modality and Explainability
by: Abdullakutty, Faseela, et al.
Published: (2024)

FUSED-Net: Detecting Traffic Signs with Limited Data
by: Rahman, Md. Atiqur, et al.
Published: (2024)

ShadowWolf -- Automatic Labelling, Evaluation and Model Training Optimised for Camera Trap Wildlife Images
by: Dede, Jens, et al.
Published: (2025)

VM-BHINet:Vision Mamba Bimanual Hand Interaction Network for 3D Interacting Hand Mesh Recovery From a Single RGB Image
by: Bi, Han, et al.
Published: (2025)

MaskHand: Generative Masked Modeling for Robust Hand Mesh Reconstruction in the Wild
by: Saleem, Muhammad Usama, et al.
Published: (2024)

ShadowDraw: From Any Object to Shadow-Drawing Compositional Art
by: Luo, Rundong, et al.
Published: (2025)

CAM-Seg: A Continuous-valued Embedding Approach for Semantic Image Generation
by: Ahmed, Masud, et al.
Published: (2025)

SPLite Hand: Sparsity-Aware Lightweight 3D Hand Pose Estimation
by: Hao, Yeh Keng, et al.
Published: (2025)

Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics
by: Tse, Tze Ho Elden, et al.
Published: (2025)

Towards Counterfactual and Contrastive Explainability and Transparency of DCNN Image Classifiers
by: Tariq, Syed Ali, et al.
Published: (2025)

DARDA: Domain-Aware Real-Time Dynamic Neural Network Adaptation
by: Rifat, Shahriar, et al.
Published: (2024)

LOOPE: Learnable Optimal Patch Order in Positional Embeddings for Vision Transformers
by: Chowdhury, Md Abtahi Majeed, et al.
Published: (2025)

Deep Tree Tensor Networks for Image Recognition
by: Nie, Chang, et al.
Published: (2025)

Latent Feature-Guided Diffusion Models for Shadow Removal
by: Mei, Kangfu, et al.
Published: (2023)

An Evolutionary Network Architecture Search Framework with Adaptive Multimodal Fusion for Hand Gesture Recognition
by: Xia, Yizhang, et al.
Published: (2024)

EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition
by: Abdelkawy, Ahmed, et al.
Published: (2024)

Timeline and Boundary Guided Diffusion Network for Video Shadow Detection
by: Zhou, Haipeng, et al.
Published: (2024)

Contrast-Prior Enhanced Duality for Mask-Free Shadow Removal
by: Wu, Jiyu, et al.
Published: (2025)

FindingEmo: An Image Dataset for Emotion Recognition in the Wild
by: Mertens, Laurent, et al.
Published: (2024)

Flatten: Video Action Recognition is an Image Classification task
by: Chen, Junlin, et al.
Published: (2024)