:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Hashempoor, Hamidreza
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2507.10115
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection
by: Zhang, Huaxin, et al.
Published: (2024)

FastTracker: Real-Time and Accurate Visual Tracking
by: Hashempoor, Hamidreza, et al.
Published: (2025)

Glance: Accelerating Diffusion Models with 1 Sample
by: Dong, Zhuobai, et al.
Published: (2025)

Glance and Focus Reinforcement for Pan-cancer Screening
by: Wu, Linshan, et al.
Published: (2026)

FeatureSORT: Essential Features for Effective Tracking
by: Hashempoor, Hamidreza, et al.
Published: (2024)

Glance and Focus: Memory Prompting for Multi-Event Video Question Answering
by: Bai, Ziyi, et al.
Published: (2024)

Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning
by: Bai, Tianyi, et al.
Published: (2025)

Glance-or-Gaze: Incentivizing LMMs to Adaptively Focus Search via Reinforcement Learning
by: Bai, Hongbo, et al.
Published: (2026)

Insights from Visual Cognition: Understanding Human Action Dynamics with Overall Glance and Refined Gaze Transformer
by: Xing, Bohao, et al.
Published: (2026)

Learning at a Glance: Towards Interpretable Data-limited Continual Semantic Segmentation via Semantic-Invariance Modelling
by: Yuan, Bo, et al.
Published: (2024)

Every Angle Is Worth A Second Glance: Mining Kinematic Skeletal Structures from Multi-view Joint Cloud
by: Jiang, Junkun, et al.
Published: (2025)

Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
by: Zhang, Jinrong, et al.
Published: (2024)

MS-Glance: Bio-Insipred Non-semantic Context Vectors and their Applications in Supervising Image Reconstruction
by: Gao, Ziqi, et al.
Published: (2024)

DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features
by: Wang, Letian, et al.
Published: (2024)

Initialize to Generalize: A Stronger Initialization Pipeline for Sparse-View 3DGS
by: Zhou, Feng, et al.
Published: (2025)

A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness
by: Jiang, Lutao, et al.
Published: (2024)

LM-MCVT: A Lightweight Multi-modal Multi-view Convolutional-Vision Transformer Approach for 3D Object Recognition
by: Xiong, Songsong, et al.
Published: (2025)

Asymmetric Dual Self-Distillation for 3D Self-Supervised Representation Learning
by: Leijenaar, Remco F., et al.
Published: (2025)

ETTA: Efficient Test-Time Adaptation for Vision-Language Models through Dynamic Embedding Updates
by: Dastmalchi, Hamidreza, et al.
Published: (2025)

Towards Open-World Grasping with Large Vision-Language Models
by: Tziafas, Georgios, et al.
Published: (2024)

Text2Graph VPR: A Text-to-Graph Expert System for Explainable Place Recognition in Changing Environments
by: Yousefzadeh, Saeideh, et al.
Published: (2025)

X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation
by: Ma, Yiwei, et al.
Published: (2024)

Fighting Hallucinations with Counterfactuals: Diffusion-Guided Perturbations for LVLM Hallucination Suppression
by: Dastmalchi, Hamidreza, et al.
Published: (2026)

U-Turn Diffusion
by: Behjoo, Hamidreza, et al.
Published: (2023)

A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene
by: Zhang, Wenbo, et al.
Published: (2024)

Progressive Checkerboards for Autoregressive Multiscale Image Generation
by: Eigen, David
Published: (2026)

GLip: A Global-Local Integrated Progressive Framework for Robust Visual Speech Recognition
by: Wang, Tianyue, et al.
Published: (2025)

DrivePTS: A Progressive Learning Framework with Textual and Structural Enhancement for Driving Scene Generation
by: Wang, Zhechao, et al.
Published: (2026)

EarthVL: A Progressive Earth Vision-Language Understanding and Generation Framework
by: Wang, Junjue, et al.
Published: (2026)

Test-Time Adaptation of 3D Point Clouds via Denoising Diffusion Models
by: Dastmalchi, Hamidreza, et al.
Published: (2024)

Structured Initialization for Vision Transformers
by: Zheng, Jianqiao, et al.
Published: (2025)

Twin Co-Adaptive Dialogue for Progressive Image Generation
by: Wang, Jianhui, et al.
Published: (2025)

Enhancing Image Generation Fidelity via Progressive Prompts
by: Xiong, Zhen, et al.
Published: (2025)

Spectral Progressive Diffusion for Efficient Image and Video Generation
by: Xiao, Howard, et al.
Published: (2026)

VITAL: Interactive Few-Shot Imitation Learning via Visual Human-in-the-Loop Corrections
by: Kasaei, Hamidreza, et al.
Published: (2024)

PSF-4D: A Progressive Sampling Framework for View Consistent 4D Editing
by: Iqbal, Hasan, et al.
Published: (2025)

Hypothesis Testing for Progressive Kernel Estimation and VCM Framework
by: Lin, Zehui, et al.
Published: (2025)

HABIT: Chrono-Synergia Robust Progressive Learning Framework for Composed Image Retrieval
by: Li, Zixu, et al.
Published: (2026)

Reliable Detection of Minute Targets in High-Resolution Aerial Imagery across Temporal Shifts
by: Gholizadeh, Mohammad Sadegh, et al.
Published: (2025)

Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion
by: Tan, Shiqi, et al.
Published: (2024)