Saved in:
| Main Authors: | Lall, Vishakha, Pinto, Capt. Stanley S, Peng, Capt. Chu Xing, Kaiwen, Wu |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.24193 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Eyes on Target: Gaze-Aware Object Detection in Egocentric Video
by: Lall, Vishakha, et al.
Published: (2025)
by: Lall, Vishakha, et al.
Published: (2025)
IV.—Fossil Shells discovered by Capt. Hay, 1st European Regiment, in the neighbourhood of Bajgah, Afghanistan
by: Hay, Capt
Published: (1840)
by: Hay, Capt
Published: (1840)
A Comprehensive Review of Knowledge Distillation in Computer Vision
by: Habib, Gousia, et al.
Published: (2024)
by: Habib, Gousia, et al.
Published: (2024)
Garbage Vulnerable Point Monitoring using IoT and Computer Vision
by: Kumar, R., et al.
Published: (2025)
by: Kumar, R., et al.
Published: (2025)
Knowledge Distillation in Vision Transformers: A Critical Review
by: Habib, Gousia, et al.
Published: (2023)
by: Habib, Gousia, et al.
Published: (2023)
Optimizing Vision Transformers with Data-Free Knowledge Transfer
by: Habib, Gousia, et al.
Published: (2024)
by: Habib, Gousia, et al.
Published: (2024)
Early and Prediagnostic Detection of Pancreatic Cancer from Computed Tomography
by: Li, Wenxuan, et al.
Published: (2026)
by: Li, Wenxuan, et al.
Published: (2026)
Halt the Hallucination: Decoupling Signal and Semantic OOD Detection Based on Cascaded Early Rejection
by: Peng, Ningkang, et al.
Published: (2026)
by: Peng, Ningkang, et al.
Published: (2026)
GraVITON: Graph based garment warping with attention guided inversion for Virtual-tryon
by: Pathak, Sanhita, et al.
Published: (2024)
by: Pathak, Sanhita, et al.
Published: (2024)
Single Stage Warped Cloth Learning and Semantic-Contextual Attention Feature Fusion for Virtual TryOn
by: Pathak, Sanhita, et al.
Published: (2023)
by: Pathak, Sanhita, et al.
Published: (2023)
DiffSTR: Controlled Diffusion Models for Scene Text Removal
by: Pathak, Sanhita, et al.
Published: (2024)
by: Pathak, Sanhita, et al.
Published: (2024)
LIB-KD: Teaching Inductive Bias for Efficient Vision Transformer Distillation and Compression
by: Habib, Gousia, et al.
Published: (2023)
by: Habib, Gousia, et al.
Published: (2023)
VPTracker: Global Vision-Language Tracking via Visual Prompt
by: Wang, Jingchao, et al.
Published: (2025)
by: Wang, Jingchao, et al.
Published: (2025)
Mitigating Information Loss under High Pruning Rates for Efficient Large Vision Language Models
by: Fu, Mingyu, et al.
Published: (2025)
by: Fu, Mingyu, et al.
Published: (2025)
A Unified Perspective on Adversarial Membership Manipulation in Vision Models
by: Gao, Ruize, et al.
Published: (2026)
by: Gao, Ruize, et al.
Published: (2026)
Tutorial on Diffusion Models for Imaging and Vision
by: Chan, Stanley H.
Published: (2024)
by: Chan, Stanley H.
Published: (2024)
Adversarial Attacked Teacher for Unsupervised Domain Adaptive Object Detection
by: Wang, Kaiwen, et al.
Published: (2024)
by: Wang, Kaiwen, et al.
Published: (2024)
Deep Learning-Based Computer Vision Models for Early Cancer Detection Using Multimodal Medical Imaging and Radiogenomic Integration Frameworks
by: Oghenekaro, Emmanuella Avwerosuoghene
Published: (2025)
by: Oghenekaro, Emmanuella Avwerosuoghene
Published: (2025)
HiMix: Hierarchical Artifact-aware Mixup for Generalized Synthetic Image Detection
by: Zhou, Shuchang, et al.
Published: (2026)
by: Zhou, Shuchang, et al.
Published: (2026)
Leveraging band diversity for feature selection in EO data
by: Hussain, Sadia, et al.
Published: (2025)
by: Hussain, Sadia, et al.
Published: (2025)
Parameters as Experts: Adapting Vision Models with Dynamic Parameter Routing
by: Lou, Meng, et al.
Published: (2026)
by: Lou, Meng, et al.
Published: (2026)
HandDreamer: Zero-Shot Text to 3D Hand Model Generation using Corrective Hand Shape Guidance
by: Rosh, Green, et al.
Published: (2026)
by: Rosh, Green, et al.
Published: (2026)
Stride-Net: Fairness-Aware Disentangled Representation Learning for Chest X-Ray Diagnosis
by: Rashid, Darakshan, et al.
Published: (2026)
by: Rashid, Darakshan, et al.
Published: (2026)
Adversarial Defense Teacher for Cross-Domain Object Detection under Poor Visibility Conditions
by: Wang, Kaiwen, et al.
Published: (2024)
by: Wang, Kaiwen, et al.
Published: (2024)
Detecting Omissions in Geographic Maps through Computer Vision
by: Nguyen, Phuc D. A., et al.
Published: (2024)
by: Nguyen, Phuc D. A., et al.
Published: (2024)
Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning
by: Cambrin, Daniele Rege, et al.
Published: (2024)
by: Cambrin, Daniele Rege, et al.
Published: (2024)
Task-based Loss Functions in Computer Vision: A Comprehensive Review
by: Elharrouss, Omar, et al.
Published: (2025)
by: Elharrouss, Omar, et al.
Published: (2025)
Federated Vision Transformer with Adaptive Focal Loss for Medical Image Classification
by: Zhao, Xinyuan, et al.
Published: (2026)
by: Zhao, Xinyuan, et al.
Published: (2026)
A Computer Vision Approach to Estimate the Localized Sea State
by: Vorkapic, Aleksandar, et al.
Published: (2024)
by: Vorkapic, Aleksandar, et al.
Published: (2024)
VisionLLM-based Multimodal Fusion Network for Glottic Carcinoma Early Detection
by: Jin, Zhaohui, et al.
Published: (2024)
by: Jin, Zhaohui, et al.
Published: (2024)
Unified Multi-Dataset Training for TBPS
by: Chatterjee, Nilanjana, et al.
Published: (2026)
by: Chatterjee, Nilanjana, et al.
Published: (2026)
Dual Thinking and Logical Processing -- Are Multi-modal Large Language Models Closing the Gap with Human Vision ?
by: Dayanandan, Kailas, et al.
Published: (2024)
by: Dayanandan, Kailas, et al.
Published: (2024)
An Automatic Detection Method for Hematoma Features in Placental Abruption Ultrasound Images Based on Few-Shot Learning
by: Liu, Xiaoqing, et al.
Published: (2025)
by: Liu, Xiaoqing, et al.
Published: (2025)
ElderFallGuard: Real-Time IoT and Computer Vision-Based Fall Detection System for Elderly Safety
by: Riahi, Tasrifur, et al.
Published: (2025)
by: Riahi, Tasrifur, et al.
Published: (2025)
REG: Refined Generalized Focal Loss for Road Asset Detection on Thai Highways Using Vision-Based Detection and Segmentation Models
by: Panboonyuen, Teerapong
Published: (2024)
by: Panboonyuen, Teerapong
Published: (2024)
Detection of Customer Interested Garments in Surveillance Video using Computer Vision
by: Ijjina, Earnest Paul, et al.
Published: (2025)
by: Ijjina, Earnest Paul, et al.
Published: (2025)
Continual Segmentation under Joint Nonstationarity
by: Pandey, Prashant, et al.
Published: (2026)
by: Pandey, Prashant, et al.
Published: (2026)
LearnPruner: Rethinking Attention-based Token Pruning in Vision Language Models
by: Takezoe, Rinyoichi, et al.
Published: (2026)
by: Takezoe, Rinyoichi, et al.
Published: (2026)
Revisiting the Generalization Problem of Low-level Vision Models Through the Lens of Image Deraining
by: Hu, Jinfan, et al.
Published: (2025)
by: Hu, Jinfan, et al.
Published: (2025)
SlowFastVAD: Video Anomaly Detection via Integrating Simple Detector and RAG-Enhanced Vision-Language Model
by: Ding, Zongcan, et al.
Published: (2025)
by: Ding, Zongcan, et al.
Published: (2025)
Similar Items
-
Eyes on Target: Gaze-Aware Object Detection in Egocentric Video
by: Lall, Vishakha, et al.
Published: (2025) -
IV.—Fossil Shells discovered by Capt. Hay, 1st European Regiment, in the neighbourhood of Bajgah, Afghanistan
by: Hay, Capt
Published: (1840) -
A Comprehensive Review of Knowledge Distillation in Computer Vision
by: Habib, Gousia, et al.
Published: (2024) -
Garbage Vulnerable Point Monitoring using IoT and Computer Vision
by: Kumar, R., et al.
Published: (2025) -
Knowledge Distillation in Vision Transformers: A Critical Review
by: Habib, Gousia, et al.
Published: (2023)