Saved in:
| Main Authors: | Shahi, Soroush, Shahabi, Farzad, Nabulsi, Rama, Fernandes, Glenn, Katsaggelos, Aggelos, Alshurafa, Nabil |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.06442 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CPDR: Towards Highly-Efficient Salient Object Detection via Crossed Post-decoder Refinement
by: Li, Yijie, et al.
Published: (2025)
by: Li, Yijie, et al.
Published: (2025)
Brighteye: Glaucoma Screening with Color Fundus Photographs based on Vision Transformer
by: Lin, Hui, et al.
Published: (2024)
by: Lin, Hui, et al.
Published: (2024)
Physics-Informed Image Restoration via Progressive PDE Integration
by: Likhite, Shamika, et al.
Published: (2025)
by: Likhite, Shamika, et al.
Published: (2025)
MetaSR: Content-Adaptive Metadata Orchestration for Generative Super-Resolution
by: Guo, Jiaqi, et al.
Published: (2026)
by: Guo, Jiaqi, et al.
Published: (2026)
Real-World Atmospheric Turbulence Correction via Domain Adaptation
by: Wang, Xijun, et al.
Published: (2024)
by: Wang, Xijun, et al.
Published: (2024)
VDPI: Video Deblurring with Pseudo-inverse Modeling
by: Huang, Zhihao, et al.
Published: (2024)
by: Huang, Zhihao, et al.
Published: (2024)
Advancing Limited-Angle CT Reconstruction Through Diffusion-Based Sinogram Completion
by: Guo, Jiaqi, et al.
Published: (2025)
by: Guo, Jiaqi, et al.
Published: (2025)
Vision-Language Enhanced Foundation Model for Semi-supervised Medical Image Segmentation
by: Guo, Jiaqi, et al.
Published: (2025)
by: Guo, Jiaqi, et al.
Published: (2025)
THOR: Text to Human-Object Interaction Diffusion via Relation Intervention
by: Wu, Qianyang, et al.
Published: (2024)
by: Wu, Qianyang, et al.
Published: (2024)
Cross-Temporal Spectrogram Autoencoder (CTSAE): Unsupervised Dimensionality Reduction for Clustering Gravitational Wave Glitches
by: Li, Yi, et al.
Published: (2024)
by: Li, Yi, et al.
Published: (2024)
Probabilistic smooth attention for deep multiple instance learning in medical imaging
by: Castro-Macías, Francisco M., et al.
Published: (2025)
by: Castro-Macías, Francisco M., et al.
Published: (2025)
Explainable Transformer Prototypes for Medical Diagnoses
by: Demir, Ugur, et al.
Published: (2024)
by: Demir, Ugur, et al.
Published: (2024)
Sm: enhanced localization in Multiple Instance Learning for medical imaging classification
by: Castro-Macías, Francisco M., et al.
Published: (2024)
by: Castro-Macías, Francisco M., et al.
Published: (2024)
DRL-STNet: Unsupervised Domain Adaptation for Cross-modality Medical Image Segmentation via Disentangled Representation Learning
by: Lin, Hui, et al.
Published: (2024)
by: Lin, Hui, et al.
Published: (2024)
Gaze-guided Hand-Object Interaction Synthesis: Dataset and Method
by: Tian, Jie, et al.
Published: (2024)
by: Tian, Jie, et al.
Published: (2024)
A General Method to Incorporate Spatial Information into Loss Functions for GAN-based Super-resolution Models
by: Wang, Xijun, et al.
Published: (2024)
by: Wang, Xijun, et al.
Published: (2024)
HandBooster: Boosting 3D Hand-Mesh Reconstruction by Conditional Synthesis and Sampling of Hand-Object Interactions
by: Xu, Hao, et al.
Published: (2024)
by: Xu, Hao, et al.
Published: (2024)
TOUCH: Text-guided Controllable Generation of Free-Form Hand-Object Interactions
by: Han, Guangyi, et al.
Published: (2025)
by: Han, Guangyi, et al.
Published: (2025)
Caption-Driven Explainability: Probing CNNs for Bias via CLIP
by: Koller, Patrick, et al.
Published: (2025)
by: Koller, Patrick, et al.
Published: (2025)
HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction
by: Bao, Chen, et al.
Published: (2024)
by: Bao, Chen, et al.
Published: (2024)
THOR2: Topological Analysis for 3D Shape and Color-Based Human-Inspired Object Recognition in Unseen Environments
by: Samani, Ekta U., et al.
Published: (2024)
by: Samani, Ekta U., et al.
Published: (2024)
HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision
by: Bansal, Siddhant, et al.
Published: (2024)
by: Bansal, Siddhant, et al.
Published: (2024)
Interpretable Image Classification with Adaptive Prototype-based Vision Transformers
by: Ma, Chiyu, et al.
Published: (2024)
by: Ma, Chiyu, et al.
Published: (2024)
Zero-Shot Personalization of Objects via Textual Inversion
by: Roy, Aniket, et al.
Published: (2026)
by: Roy, Aniket, et al.
Published: (2026)
Text2HOI: Text-guided 3D Motion Generation for Hand-Object Interaction
by: Cha, Junuk, et al.
Published: (2024)
by: Cha, Junuk, et al.
Published: (2024)
Focused Active Learning for Histopathological Image Classification
by: Schmidt, Arne, et al.
Published: (2024)
by: Schmidt, Arne, et al.
Published: (2024)
HandVQA: Diagnosing and Improving Fine-Grained Spatial Reasoning about Hands in Vision-Language Models
by: Sayem, MD Khalequzzaman Chowdhury, et al.
Published: (2026)
by: Sayem, MD Khalequzzaman Chowdhury, et al.
Published: (2026)
From Objects to Events: Unlocking Complex Visual Understanding in Object Detectors via LLM-guided Symbolic Reasoning
by: Zeng, Yuhui, et al.
Published: (2025)
by: Zeng, Yuhui, et al.
Published: (2025)
Efficient Lung Ultrasound Severity Scoring Using Dedicated Feature Extractor
by: Guo, Jiaqi, et al.
Published: (2025)
by: Guo, Jiaqi, et al.
Published: (2025)
SimA: Simple Softmax-free Attention for Vision Transformers
by: Koohpayegani, Soroush Abbasi, et al.
Published: (2022)
by: Koohpayegani, Soroush Abbasi, et al.
Published: (2022)
A Computer Vision Approach for Autonomous Cars to Drive Safe at Construction Zone
by: Ahammed, Abu Shad, et al.
Published: (2024)
by: Ahammed, Abu Shad, et al.
Published: (2024)
Addressing Overthinking in Large Vision-Language Models via Gated Perception-Reasoning Optimization
by: Diao, Xingjian, et al.
Published: (2026)
by: Diao, Xingjian, et al.
Published: (2026)
Geo2Vec: Shape- and Distance-Aware Neural Representation of Geospatial Entities
by: Chu, Chen, et al.
Published: (2025)
by: Chu, Chen, et al.
Published: (2025)
MEgoHand: Multimodal Egocentric Hand-Object Interaction Motion Generation
by: Zhou, Bohan, et al.
Published: (2025)
by: Zhou, Bohan, et al.
Published: (2025)
Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling
by: Hao, Yuze, et al.
Published: (2024)
by: Hao, Yuze, et al.
Published: (2024)
Enhanced Drift-Aware Computer Vision Architecture for Autonomous Driving
by: Hossain, Md Shahi Amran, et al.
Published: (2025)
by: Hossain, Md Shahi Amran, et al.
Published: (2025)
AVI-HT: Adaptive Vision-IMU Fusion for 3D Hand Tracking
by: Kou, Ziyi, et al.
Published: (2026)
by: Kou, Ziyi, et al.
Published: (2026)
OSMGen: Highly Controllable Satellite Image Synthesis using OpenStreetMap Data
by: Ziashahabi, Amir, et al.
Published: (2025)
by: Ziashahabi, Amir, et al.
Published: (2025)
Annotation Free Semantic Segmentation with Vision Foundation Models
by: Seifi, Soroush, et al.
Published: (2024)
by: Seifi, Soroush, et al.
Published: (2024)
Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
by: Wang, Kejie, et al.
Published: (2024)
by: Wang, Kejie, et al.
Published: (2024)
Similar Items
-
CPDR: Towards Highly-Efficient Salient Object Detection via Crossed Post-decoder Refinement
by: Li, Yijie, et al.
Published: (2025) -
Brighteye: Glaucoma Screening with Color Fundus Photographs based on Vision Transformer
by: Lin, Hui, et al.
Published: (2024) -
Physics-Informed Image Restoration via Progressive PDE Integration
by: Likhite, Shamika, et al.
Published: (2025) -
MetaSR: Content-Adaptive Metadata Orchestration for Generative Super-Resolution
by: Guo, Jiaqi, et al.
Published: (2026) -
Real-World Atmospheric Turbulence Correction via Domain Adaptation
by: Wang, Xijun, et al.
Published: (2024)