Saved in:
| Main Author: | Hashempoor, Hamidreza |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.10115 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection
by: Zhang, Huaxin, et al.
Published: (2024)
by: Zhang, Huaxin, et al.
Published: (2024)
FastTracker: Real-Time and Accurate Visual Tracking
by: Hashempoor, Hamidreza, et al.
Published: (2025)
by: Hashempoor, Hamidreza, et al.
Published: (2025)
Glance: Accelerating Diffusion Models with 1 Sample
by: Dong, Zhuobai, et al.
Published: (2025)
by: Dong, Zhuobai, et al.
Published: (2025)
Glance and Focus Reinforcement for Pan-cancer Screening
by: Wu, Linshan, et al.
Published: (2026)
by: Wu, Linshan, et al.
Published: (2026)
FeatureSORT: Essential Features for Effective Tracking
by: Hashempoor, Hamidreza, et al.
Published: (2024)
by: Hashempoor, Hamidreza, et al.
Published: (2024)
Glance and Focus: Memory Prompting for Multi-Event Video Question Answering
by: Bai, Ziyi, et al.
Published: (2024)
by: Bai, Ziyi, et al.
Published: (2024)
Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning
by: Bai, Tianyi, et al.
Published: (2025)
by: Bai, Tianyi, et al.
Published: (2025)
Glance-or-Gaze: Incentivizing LMMs to Adaptively Focus Search via Reinforcement Learning
by: Bai, Hongbo, et al.
Published: (2026)
by: Bai, Hongbo, et al.
Published: (2026)
Insights from Visual Cognition: Understanding Human Action Dynamics with Overall Glance and Refined Gaze Transformer
by: Xing, Bohao, et al.
Published: (2026)
by: Xing, Bohao, et al.
Published: (2026)
Learning at a Glance: Towards Interpretable Data-limited Continual Semantic Segmentation via Semantic-Invariance Modelling
by: Yuan, Bo, et al.
Published: (2024)
by: Yuan, Bo, et al.
Published: (2024)
Every Angle Is Worth A Second Glance: Mining Kinematic Skeletal Structures from Multi-view Joint Cloud
by: Jiang, Junkun, et al.
Published: (2025)
by: Jiang, Junkun, et al.
Published: (2025)
Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
by: Zhang, Jinrong, et al.
Published: (2024)
by: Zhang, Jinrong, et al.
Published: (2024)
MS-Glance: Bio-Insipred Non-semantic Context Vectors and their Applications in Supervising Image Reconstruction
by: Gao, Ziqi, et al.
Published: (2024)
by: Gao, Ziqi, et al.
Published: (2024)
DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features
by: Wang, Letian, et al.
Published: (2024)
by: Wang, Letian, et al.
Published: (2024)
Initialize to Generalize: A Stronger Initialization Pipeline for Sparse-View 3DGS
by: Zhou, Feng, et al.
Published: (2025)
by: Zhou, Feng, et al.
Published: (2025)
A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness
by: Jiang, Lutao, et al.
Published: (2024)
by: Jiang, Lutao, et al.
Published: (2024)
LM-MCVT: A Lightweight Multi-modal Multi-view Convolutional-Vision Transformer Approach for 3D Object Recognition
by: Xiong, Songsong, et al.
Published: (2025)
by: Xiong, Songsong, et al.
Published: (2025)
Asymmetric Dual Self-Distillation for 3D Self-Supervised Representation Learning
by: Leijenaar, Remco F., et al.
Published: (2025)
by: Leijenaar, Remco F., et al.
Published: (2025)
ETTA: Efficient Test-Time Adaptation for Vision-Language Models through Dynamic Embedding Updates
by: Dastmalchi, Hamidreza, et al.
Published: (2025)
by: Dastmalchi, Hamidreza, et al.
Published: (2025)
Towards Open-World Grasping with Large Vision-Language Models
by: Tziafas, Georgios, et al.
Published: (2024)
by: Tziafas, Georgios, et al.
Published: (2024)
Text2Graph VPR: A Text-to-Graph Expert System for Explainable Place Recognition in Changing Environments
by: Yousefzadeh, Saeideh, et al.
Published: (2025)
by: Yousefzadeh, Saeideh, et al.
Published: (2025)
X-Oscar: A Progressive Framework for High-quality Text-guided 3D Animatable Avatar Generation
by: Ma, Yiwei, et al.
Published: (2024)
by: Ma, Yiwei, et al.
Published: (2024)
Fighting Hallucinations with Counterfactuals: Diffusion-Guided Perturbations for LVLM Hallucination Suppression
by: Dastmalchi, Hamidreza, et al.
Published: (2026)
by: Dastmalchi, Hamidreza, et al.
Published: (2026)
U-Turn Diffusion
by: Behjoo, Hamidreza, et al.
Published: (2023)
by: Behjoo, Hamidreza, et al.
Published: (2023)
A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene
by: Zhang, Wenbo, et al.
Published: (2024)
by: Zhang, Wenbo, et al.
Published: (2024)
Progressive Checkerboards for Autoregressive Multiscale Image Generation
by: Eigen, David
Published: (2026)
by: Eigen, David
Published: (2026)
GLip: A Global-Local Integrated Progressive Framework for Robust Visual Speech Recognition
by: Wang, Tianyue, et al.
Published: (2025)
by: Wang, Tianyue, et al.
Published: (2025)
DrivePTS: A Progressive Learning Framework with Textual and Structural Enhancement for Driving Scene Generation
by: Wang, Zhechao, et al.
Published: (2026)
by: Wang, Zhechao, et al.
Published: (2026)
EarthVL: A Progressive Earth Vision-Language Understanding and Generation Framework
by: Wang, Junjue, et al.
Published: (2026)
by: Wang, Junjue, et al.
Published: (2026)
Test-Time Adaptation of 3D Point Clouds via Denoising Diffusion Models
by: Dastmalchi, Hamidreza, et al.
Published: (2024)
by: Dastmalchi, Hamidreza, et al.
Published: (2024)
Structured Initialization for Vision Transformers
by: Zheng, Jianqiao, et al.
Published: (2025)
by: Zheng, Jianqiao, et al.
Published: (2025)
Twin Co-Adaptive Dialogue for Progressive Image Generation
by: Wang, Jianhui, et al.
Published: (2025)
by: Wang, Jianhui, et al.
Published: (2025)
Enhancing Image Generation Fidelity via Progressive Prompts
by: Xiong, Zhen, et al.
Published: (2025)
by: Xiong, Zhen, et al.
Published: (2025)
Spectral Progressive Diffusion for Efficient Image and Video Generation
by: Xiao, Howard, et al.
Published: (2026)
by: Xiao, Howard, et al.
Published: (2026)
VITAL: Interactive Few-Shot Imitation Learning via Visual Human-in-the-Loop Corrections
by: Kasaei, Hamidreza, et al.
Published: (2024)
by: Kasaei, Hamidreza, et al.
Published: (2024)
PSF-4D: A Progressive Sampling Framework for View Consistent 4D Editing
by: Iqbal, Hasan, et al.
Published: (2025)
by: Iqbal, Hasan, et al.
Published: (2025)
Hypothesis Testing for Progressive Kernel Estimation and VCM Framework
by: Lin, Zehui, et al.
Published: (2025)
by: Lin, Zehui, et al.
Published: (2025)
HABIT: Chrono-Synergia Robust Progressive Learning Framework for Composed Image Retrieval
by: Li, Zixu, et al.
Published: (2026)
by: Li, Zixu, et al.
Published: (2026)
Reliable Detection of Minute Targets in High-Resolution Aerial Imagery across Temporal Shifts
by: Gholizadeh, Mohammad Sadegh, et al.
Published: (2025)
by: Gholizadeh, Mohammad Sadegh, et al.
Published: (2025)
Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion
by: Tan, Shiqi, et al.
Published: (2024)
by: Tan, Shiqi, et al.
Published: (2024)
Similar Items
-
GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection
by: Zhang, Huaxin, et al.
Published: (2024) -
FastTracker: Real-Time and Accurate Visual Tracking
by: Hashempoor, Hamidreza, et al.
Published: (2025) -
Glance: Accelerating Diffusion Models with 1 Sample
by: Dong, Zhuobai, et al.
Published: (2025) -
Glance and Focus Reinforcement for Pan-cancer Screening
by: Wu, Linshan, et al.
Published: (2026) -
FeatureSORT: Essential Features for Effective Tracking
by: Hashempoor, Hamidreza, et al.
Published: (2024)