:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shahi, Soroush, Shahabi, Farzad, Nabulsi, Rama, Fernandes, Glenn, Katsaggelos, Aggelos, Alshurafa, Nabil
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2507.06442
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CPDR: Towards Highly-Efficient Salient Object Detection via Crossed Post-decoder Refinement
by: Li, Yijie, et al.
Published: (2025)

Brighteye: Glaucoma Screening with Color Fundus Photographs based on Vision Transformer
by: Lin, Hui, et al.
Published: (2024)

Physics-Informed Image Restoration via Progressive PDE Integration
by: Likhite, Shamika, et al.
Published: (2025)

MetaSR: Content-Adaptive Metadata Orchestration for Generative Super-Resolution
by: Guo, Jiaqi, et al.
Published: (2026)

Real-World Atmospheric Turbulence Correction via Domain Adaptation
by: Wang, Xijun, et al.
Published: (2024)

VDPI: Video Deblurring with Pseudo-inverse Modeling
by: Huang, Zhihao, et al.
Published: (2024)

Advancing Limited-Angle CT Reconstruction Through Diffusion-Based Sinogram Completion
by: Guo, Jiaqi, et al.
Published: (2025)

Vision-Language Enhanced Foundation Model for Semi-supervised Medical Image Segmentation
by: Guo, Jiaqi, et al.
Published: (2025)

THOR: Text to Human-Object Interaction Diffusion via Relation Intervention
by: Wu, Qianyang, et al.
Published: (2024)

Cross-Temporal Spectrogram Autoencoder (CTSAE): Unsupervised Dimensionality Reduction for Clustering Gravitational Wave Glitches
by: Li, Yi, et al.
Published: (2024)

Probabilistic smooth attention for deep multiple instance learning in medical imaging
by: Castro-Macías, Francisco M., et al.
Published: (2025)

Explainable Transformer Prototypes for Medical Diagnoses
by: Demir, Ugur, et al.
Published: (2024)

Sm: enhanced localization in Multiple Instance Learning for medical imaging classification
by: Castro-Macías, Francisco M., et al.
Published: (2024)

DRL-STNet: Unsupervised Domain Adaptation for Cross-modality Medical Image Segmentation via Disentangled Representation Learning
by: Lin, Hui, et al.
Published: (2024)

Gaze-guided Hand-Object Interaction Synthesis: Dataset and Method
by: Tian, Jie, et al.
Published: (2024)

A General Method to Incorporate Spatial Information into Loss Functions for GAN-based Super-resolution Models
by: Wang, Xijun, et al.
Published: (2024)

HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions
by: Xu, Hao, et al.
Published: (2024)

TOUCH: Text-guided Controllable Generation of Free-Form Hand-Object Interactions
by: Han, Guangyi, et al.
Published: (2025)

Caption-Driven Explainability: Probing CNNs for Bias via CLIP
by: Koller, Patrick, et al.
Published: (2025)

HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction
by: Bao, Chen, et al.
Published: (2024)

THOR2: Topological Analysis for 3D Shape and Color-Based Human-Inspired Object Recognition in Unseen Environments
by: Samani, Ekta U., et al.
Published: (2024)

HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision
by: Bansal, Siddhant, et al.
Published: (2024)

Interpretable Image Classification with Adaptive Prototype-based Vision Transformers
by: Ma, Chiyu, et al.
Published: (2024)

Zero-Shot Personalization of Objects via Textual Inversion
by: Roy, Aniket, et al.
Published: (2026)

Text2HOI: Text-guided 3D Motion Generation for Hand-Object Interaction
by: Cha, Junuk, et al.
Published: (2024)

Focused Active Learning for Histopathological Image Classification
by: Schmidt, Arne, et al.
Published: (2024)

HandVQA: Diagnosing and Improving Fine-Grained Spatial Reasoning about Hands in Vision-Language Models
by: Sayem, MD Khalequzzaman Chowdhury, et al.
Published: (2026)

From Objects to Events: Unlocking Complex Visual Understanding in Object Detectors via LLM-guided Symbolic Reasoning
by: Zeng, Yuhui, et al.
Published: (2025)

Efficient Lung Ultrasound Severity Scoring Using Dedicated Feature Extractor
by: Guo, Jiaqi, et al.
Published: (2025)

SimA: Simple Softmax-free Attention for Vision Transformers
by: Koohpayegani, Soroush Abbasi, et al.
Published: (2022)

A Computer Vision Approach for Autonomous Cars to Drive Safe at Construction Zone
by: Ahammed, Abu Shad, et al.
Published: (2024)

Addressing Overthinking in Large Vision-Language Models via Gated Perception-Reasoning Optimization
by: Diao, Xingjian, et al.
Published: (2026)

Geo2Vec: Shape- and Distance-Aware Neural Representation of Geospatial Entities
by: Chu, Chen, et al.
Published: (2025)

MEgoHand: Multimodal Egocentric Hand-Object Interaction Motion Generation
by: Zhou, Bohan, et al.
Published: (2025)

Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling
by: Hao, Yuze, et al.
Published: (2024)

Enhanced Drift-Aware Computer Vision Architecture for Autonomous Driving
by: Hossain, Md Shahi Amran, et al.
Published: (2025)

AVI-HT: Adaptive Vision-IMU Fusion for 3D Hand Tracking
by: Kou, Ziyi, et al.
Published: (2026)

OSMGen: Highly Controllable Satellite Image Synthesis using OpenStreetMap Data
by: Ziashahabi, Amir, et al.
Published: (2025)

Annotation Free Semantic Segmentation with Vision Foundation Models
by: Seifi, Soroush, et al.
Published: (2024)

Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
by: Wang, Kejie, et al.
Published: (2024)