Saved in:
| Main Authors: | Raiyan, Syed Rifat, Amio, Zibran Zarif, Ahmed, Sabbir |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.10360 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Beyond Symbolic Solving: Multi Chain-of-Thought Voting for Geometric Reasoning in Large Language Models
by: Siddique, Md. Abu Bakor, et al.
Published: (2026)
by: Siddique, Md. Abu Bakor, et al.
Published: (2026)
Securing Vision-Language Models with a Robust Encoder Against Jailbreak and Adversarial Attacks
by: Hossain, Md Zarif, et al.
Published: (2024)
by: Hossain, Md Zarif, et al.
Published: (2024)
Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models
by: Hossain, Md Zarif, et al.
Published: (2024)
by: Hossain, Md Zarif, et al.
Published: (2024)
Narrative over Numbers: The Identifiable Victim Effect and its Amplification Under Alignment and Reasoning in Large Language Models
by: Raiyan, Syed Rifat
Published: (2026)
by: Raiyan, Syed Rifat
Published: (2026)
Enhancing Bidirectional Sign Language Communication: Integrating YOLOv8 and NLP for Real-Time Gesture Recognition & Translation
by: Bhuiyan, Hasnat Jamil, et al.
Published: (2024)
by: Bhuiyan, Hasnat Jamil, et al.
Published: (2024)
Unmasking Puppeteers: Leveraging Biometric Leakage to Expose Impersonation in AI-Based Videoconferencing
by: Vahdati, Danial Samadi, et al.
Published: (2025)
by: Vahdati, Danial Samadi, et al.
Published: (2025)
Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics
by: Li, Ruining, et al.
Published: (2024)
by: Li, Ruining, et al.
Published: (2024)
Survey on Hand Gesture Recognition from Visual Input
by: Linardakis, Manousos, et al.
Published: (2025)
by: Linardakis, Manousos, et al.
Published: (2025)
Hands-on Evaluation of Visual Transformers for Object Recognition and Detection
by: Vlachogiannis, Dimitrios N., et al.
Published: (2025)
by: Vlachogiannis, Dimitrios N., et al.
Published: (2025)
Hand3R: Online 4D Hand-Scene Reconstruction in the Wild
by: Hu, Wendi, et al.
Published: (2026)
by: Hu, Wendi, et al.
Published: (2026)
AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild
by: Park, Junho, et al.
Published: (2024)
by: Park, Junho, et al.
Published: (2024)
An Efficient Deep Learning Framework for Brain Stroke Diagnosis Using Computed Tomography Images
by: Hossen, Md. Sabbir, et al.
Published: (2025)
by: Hossen, Md. Sabbir, et al.
Published: (2025)
Fast-HaMeR: Boosting Hand Mesh Reconstruction using Knowledge Distillation
by: Jillani, Hunain Ahmed, et al.
Published: (2026)
by: Jillani, Hunain Ahmed, et al.
Published: (2026)
Online Hand Gesture Recognition Using 3D Convolutional Neural Networks
by: Qin, Yinghao, et al.
Published: (2026)
by: Qin, Yinghao, et al.
Published: (2026)
ForCM: Forest Cover Mapping from Multispectral Sentinel-2 Image by Integrating Deep Learning with Object-Based Image Analysis
by: Haque, Maisha, et al.
Published: (2025)
by: Haque, Maisha, et al.
Published: (2025)
HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
by: Narasimhaswamy, Supreeth, et al.
Published: (2024)
by: Narasimhaswamy, Supreeth, et al.
Published: (2024)
HaGRIDv2: 1M Images for Static and Dynamic Hand Gesture Recognition
by: Nuzhdin, Anton, et al.
Published: (2024)
by: Nuzhdin, Anton, et al.
Published: (2024)
MatteViT: High-Frequency-Aware Document Shadow Removal with Shadow Matte Guidance
by: Kim, Chaewon, et al.
Published: (2025)
by: Kim, Chaewon, et al.
Published: (2025)
DocShaDiffusion: Diffusion Model in Latent Space for Document Image Shadow Removal
by: Liu, Wenjie, et al.
Published: (2025)
by: Liu, Wenjie, et al.
Published: (2025)
Object Detection Approaches to Identifying Hand Images with High Forensic Values
by: Nguyen, Thanh Thi, et al.
Published: (2024)
by: Nguyen, Thanh Thi, et al.
Published: (2024)
Advancing Histopathology-Based Breast Cancer Diagnosis: Insights into Multi-Modality and Explainability
by: Abdullakutty, Faseela, et al.
Published: (2024)
by: Abdullakutty, Faseela, et al.
Published: (2024)
FUSED-Net: Detecting Traffic Signs with Limited Data
by: Rahman, Md. Atiqur, et al.
Published: (2024)
by: Rahman, Md. Atiqur, et al.
Published: (2024)
ShadowWolf -- Automatic Labelling, Evaluation and Model Training Optimised for Camera Trap Wildlife Images
by: Dede, Jens, et al.
Published: (2025)
by: Dede, Jens, et al.
Published: (2025)
VM-BHINet:Vision Mamba Bimanual Hand Interaction Network for 3D Interacting Hand Mesh Recovery From a Single RGB Image
by: Bi, Han, et al.
Published: (2025)
by: Bi, Han, et al.
Published: (2025)
MaskHand: Generative Masked Modeling for Robust Hand Mesh Reconstruction in the Wild
by: Saleem, Muhammad Usama, et al.
Published: (2024)
by: Saleem, Muhammad Usama, et al.
Published: (2024)
ShadowDraw: From Any Object to Shadow-Drawing Compositional Art
by: Luo, Rundong, et al.
Published: (2025)
by: Luo, Rundong, et al.
Published: (2025)
CAM-Seg: A Continuous-valued Embedding Approach for Semantic Image Generation
by: Ahmed, Masud, et al.
Published: (2025)
by: Ahmed, Masud, et al.
Published: (2025)
SPLite Hand: Sparsity-Aware Lightweight 3D Hand Pose Estimation
by: Hao, Yeh Keng, et al.
Published: (2025)
by: Hao, Yeh Keng, et al.
Published: (2025)
Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics
by: Tse, Tze Ho Elden, et al.
Published: (2025)
by: Tse, Tze Ho Elden, et al.
Published: (2025)
Towards Counterfactual and Contrastive Explainability and Transparency of DCNN Image Classifiers
by: Tariq, Syed Ali, et al.
Published: (2025)
by: Tariq, Syed Ali, et al.
Published: (2025)
DARDA: Domain-Aware Real-Time Dynamic Neural Network Adaptation
by: Rifat, Shahriar, et al.
Published: (2024)
by: Rifat, Shahriar, et al.
Published: (2024)
LOOPE: Learnable Optimal Patch Order in Positional Embeddings for Vision Transformers
by: Chowdhury, Md Abtahi Majeed, et al.
Published: (2025)
by: Chowdhury, Md Abtahi Majeed, et al.
Published: (2025)
Deep Tree Tensor Networks for Image Recognition
by: Nie, Chang, et al.
Published: (2025)
by: Nie, Chang, et al.
Published: (2025)
Latent Feature-Guided Diffusion Models for Shadow Removal
by: Mei, Kangfu, et al.
Published: (2023)
by: Mei, Kangfu, et al.
Published: (2023)
An Evolutionary Network Architecture Search Framework with Adaptive Multimodal Fusion for Hand Gesture Recognition
by: Xia, Yizhang, et al.
Published: (2024)
by: Xia, Yizhang, et al.
Published: (2024)
EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition
by: Abdelkawy, Ahmed, et al.
Published: (2024)
by: Abdelkawy, Ahmed, et al.
Published: (2024)
Timeline and Boundary Guided Diffusion Network for Video Shadow Detection
by: Zhou, Haipeng, et al.
Published: (2024)
by: Zhou, Haipeng, et al.
Published: (2024)
Contrast-Prior Enhanced Duality for Mask-Free Shadow Removal
by: Wu, Jiyu, et al.
Published: (2025)
by: Wu, Jiyu, et al.
Published: (2025)
FindingEmo: An Image Dataset for Emotion Recognition in the Wild
by: Mertens, Laurent, et al.
Published: (2024)
by: Mertens, Laurent, et al.
Published: (2024)
Flatten: Video Action Recognition is an Image Classification task
by: Chen, Junlin, et al.
Published: (2024)
by: Chen, Junlin, et al.
Published: (2024)
Similar Items
-
Beyond Symbolic Solving: Multi Chain-of-Thought Voting for Geometric Reasoning in Large Language Models
by: Siddique, Md. Abu Bakor, et al.
Published: (2026) -
Securing Vision-Language Models with a Robust Encoder Against Jailbreak and Adversarial Attacks
by: Hossain, Md Zarif, et al.
Published: (2024) -
Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models
by: Hossain, Md Zarif, et al.
Published: (2024) -
Narrative over Numbers: The Identifiable Victim Effect and its Amplification Under Alignment and Reasoning in Large Language Models
by: Raiyan, Syed Rifat
Published: (2026) -
Enhancing Bidirectional Sign Language Communication: Integrating YOLOv8 and NLP for Real-Time Gesture Recognition & Translation
by: Bhuiyan, Hasnat Jamil, et al.
Published: (2024)