:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lall, Vishakha, Pinto, Capt. Stanley S, Peng, Capt. Chu Xing, Kaiwen, Wu
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2604.24193
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Eyes on Target: Gaze-Aware Object Detection in Egocentric Video
by: Lall, Vishakha, et al.
Published: (2025)

IV.—Fossil Shells discovered by Capt. Hay, 1st European Regiment, in the neighbourhood of Bajgah, Afghanistan
by: Hay, Capt
Published: (1840)

A Comprehensive Review of Knowledge Distillation in Computer Vision
by: Habib, Gousia, et al.
Published: (2024)

Garbage Vulnerable Point Monitoring using IoT and Computer Vision
by: Kumar, R., et al.
Published: (2025)

Knowledge Distillation in Vision Transformers: A Critical Review
by: Habib, Gousia, et al.
Published: (2023)

Optimizing Vision Transformers with Data-Free Knowledge Transfer
by: Habib, Gousia, et al.
Published: (2024)

Early and Prediagnostic Detection of Pancreatic Cancer from Computed Tomography
by: Li, Wenxuan, et al.
Published: (2026)

Halt the Hallucination: Decoupling Signal and Semantic OOD Detection Based on Cascaded Early Rejection
by: Peng, Ningkang, et al.
Published: (2026)

GraVITON: Graph based garment warping with attention guided inversion for Virtual-tryon
by: Pathak, Sanhita, et al.
Published: (2024)

Single Stage Warped Cloth Learning and Semantic-Contextual Attention Feature Fusion for Virtual TryOn
by: Pathak, Sanhita, et al.
Published: (2023)

DiffSTR: Controlled Diffusion Models for Scene Text Removal
by: Pathak, Sanhita, et al.
Published: (2024)

LIB-KD: Teaching Inductive Bias for Efficient Vision Transformer Distillation and Compression
by: Habib, Gousia, et al.
Published: (2023)

VPTracker: Global Vision-Language Tracking via Visual Prompt
by: Wang, Jingchao, et al.
Published: (2025)

Mitigating Information Loss under High Pruning Rates for Efficient Large Vision Language Models
by: Fu, Mingyu, et al.
Published: (2025)

A Unified Perspective on Adversarial Membership Manipulation in Vision Models
by: Gao, Ruize, et al.
Published: (2026)

Tutorial on Diffusion Models for Imaging and Vision
by: Chan, Stanley H.
Published: (2024)

Adversarial Attacked Teacher for Unsupervised Domain Adaptive Object Detection
by: Wang, Kaiwen, et al.
Published: (2024)

Deep Learning-Based Computer Vision Models for Early Cancer Detection Using Multimodal Medical Imaging and Radiogenomic Integration Frameworks
by: Oghenekaro, Emmanuella Avwerosuoghene
Published: (2025)

HiMix: Hierarchical Artifact-aware Mixup for Generalized Synthetic Image Detection
by: Zhou, Shuchang, et al.
Published: (2026)

Leveraging band diversity for feature selection in EO data
by: Hussain, Sadia, et al.
Published: (2025)

Parameters as Experts: Adapting Vision Models with Dynamic Parameter Routing
by: Lou, Meng, et al.
Published: (2026)

HandDreamer: Zero-Shot Text to 3D Hand Model Generation using Corrective Hand Shape Guidance
by: Rosh, Green, et al.
Published: (2026)

Stride-Net: Fairness-Aware Disentangled Representation Learning for Chest X-Ray Diagnosis
by: Rashid, Darakshan, et al.
Published: (2026)

Adversarial Defense Teacher for Cross-Domain Object Detection under Poor Visibility Conditions
by: Wang, Kaiwen, et al.
Published: (2024)

Detecting Omissions in Geographic Maps through Computer Vision
by: Nguyen, Phuc D. A., et al.
Published: (2024)

Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning
by: Cambrin, Daniele Rege, et al.
Published: (2024)

Task-based Loss Functions in Computer Vision: A Comprehensive Review
by: Elharrouss, Omar, et al.
Published: (2025)

Federated Vision Transformer with Adaptive Focal Loss for Medical Image Classification
by: Zhao, Xinyuan, et al.
Published: (2026)

A Computer Vision Approach to Estimate the Localized Sea State
by: Vorkapic, Aleksandar, et al.
Published: (2024)

VisionLLM-based Multimodal Fusion Network for Glottic Carcinoma Early Detection
by: Jin, Zhaohui, et al.
Published: (2024)

Unified Multi-Dataset Training for TBPS
by: Chatterjee, Nilanjana, et al.
Published: (2026)

Dual Thinking and Logical Processing -- Are Multi-modal Large Language Models Closing the Gap with Human Vision ?
by: Dayanandan, Kailas, et al.
Published: (2024)

An Automatic Detection Method for Hematoma Features in Placental Abruption Ultrasound Images Based on Few-Shot Learning
by: Liu, Xiaoqing, et al.
Published: (2025)

ElderFallGuard: Real-Time IoT and Computer Vision-Based Fall Detection System for Elderly Safety
by: Riahi, Tasrifur, et al.
Published: (2025)

REG: Refined Generalized Focal Loss for Road Asset Detection on Thai Highways Using Vision-Based Detection and Segmentation Models
by: Panboonyuen, Teerapong
Published: (2024)

Detection of Customer Interested Garments in Surveillance Video using Computer Vision
by: Ijjina, Earnest Paul, et al.
Published: (2025)

Continual Segmentation under Joint Nonstationarity
by: Pandey, Prashant, et al.
Published: (2026)

LearnPruner: Rethinking Attention-based Token Pruning in Vision Language Models
by: Takezoe, Rinyoichi, et al.
Published: (2026)

Revisiting the Generalization Problem of Low-level Vision Models Through the Lens of Image Deraining
by: Hu, Jinfan, et al.
Published: (2025)

SlowFastVAD: Video Anomaly Detection via Integrating Simple Detector and RAG-Enhanced Vision-Language Model
by: Ding, Zongcan, et al.
Published: (2025)