:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Song, Peipei, Zhang, Jing, Koniusz, Piotr, Barnes, Nick
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2403.14821
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CHAIN: Enhancing Generalization in Data-Efficient GANs via lipsCHitz continuity constrAIned Normalization
by: Ni, Yao, et al.
Published: (2024)

OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection
by: Lu, Changsheng, et al.
Published: (2024)

Possibilistic Predictive Uncertainty for Deep Learning
by: Ni, Yao, et al.
Published: (2026)

PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization
by: Ni, Yao, et al.
Published: (2024)

Feature Hallucination for Self-supervised Action Recognition
by: Wang, Lei, et al.
Published: (2025)

Focus on Background: Exploring SAM's Potential in Few-shot Medical Image Segmentation with Background-centric Prompting
by: Bo, Yuntian, et al.
Published: (2026)

Open Eyes, Then Reason: Fine-grained Visual Mathematical Understanding in MLLMs
by: Zhang, Shan, et al.
Published: (2025)

Learning Time in Static Classifiers
by: Ding, Xi, et al.
Published: (2025)

Subspace Kernel Learning on Tensor Sequences
by: Wang, Lei, et al.
Published: (2026)

Adaptive Multi-head Contrastive Learning
by: Wang, Lei, et al.
Published: (2023)

Video Understanding by Design: How Datasets Shape Architectures and Insights
by: Wang, Lei, et al.
Published: (2025)

ICED: Concept-level Machine Unlearning via Interpretable Concept Decomposition
by: Lin, Shen, et al.
Published: (2026)

Noise Consistency Regularization for Improved Subject-Driven Image Synthesis
by: Ni, Yao, et al.
Published: (2025)

Motion meets Attention: Video Motion Prompts
by: Chen, Qixiang, et al.
Published: (2024)

Graph Your Own Prompt
by: Ding, Xi, et al.
Published: (2025)

Pre-training with Random Orthogonal Projection Image Modeling
by: Haghighat, Maryam, et al.
Published: (2023)

When Spatial meets Temporal in Action Recognition
by: Chen, Huilin, et al.
Published: (2024)

Artemis: Structured Visual Reasoning for Perception Policy Learning
by: Tang, Wei, et al.
Published: (2025)

Through Their Eyes: Fixation-aligned Tuning for Personalized User Emulation
by: Huang, Lingfeng, et al.
Published: (2026)

Uncertainty-DTW for Sequences and Visual Tokens
by: Wang, Lei, et al.
Published: (2026)

Fixation-based Self-calibration for Eye Tracking in VR Headsets
by: Uramune, Ryusei, et al.
Published: (2023)

Hierarchically Robust Zero-shot Vision-language Models
by: Dong, Junhao, et al.
Published: (2026)

Gaussian Graph Network: Learning Efficient and Generalizable Gaussian Representations from Multi-view Images
by: Zhang, Shengjun, et al.
Published: (2025)

Meet JEANIE: a Similarity Measure for 3D Skeleton Sequences via Temporal-Viewpoint Alignment
by: Wang, Lei, et al.
Published: (2024)

Learnable Expansion of Graph Operators for Multi-Modal Feature Fusion
by: Ding, Dexuan, et al.
Published: (2024)

Point-PNG: Conditional Pseudo-Negatives Generation for Point Cloud Pre-Training
by: Mahendren, Sutharsan, et al.
Published: (2024)

SmokeBench: Evaluating Multimodal Large Language Models for Wildfire Smoke Detection
by: Qi, Tianye, et al.
Published: (2025)

Math Blind: Failures in Diagram Understanding Undermine Reasoning in MLLMs
by: Sun, Yanpeng, et al.
Published: (2025)

NumGrad-Pull: Numerical Gradient Guided Tri-plane Representation for Surface Reconstruction from Point Clouds
by: Cui, Ruikai, et al.
Published: (2024)

MMT-ARD: Multimodal Multi-Teacher Adversarial Distillation for Robust Vision-Language Models
by: Li, Yuqi, et al.
Published: (2025)

Deep Learning-Based Fixation Type Prediction for Quality Assurance in Digital Pathology
by: Thaeter, Oskar, et al.
Published: (2026)

Self-Evolving Spatial Reasoning in Vision Language Models via Geometric Logic Consistency
by: Liu, Junming, et al.
Published: (2026)

NoiseSDF2NoiseSDF: Learning Clean Neural Fields from Noisy Supervision
by: Wang, Tengkai, et al.
Published: (2025)

Distinguishing Target and Non-Target Fixations with EEG and Eye Tracking in Realistic Visual Scenes
by: Sharma, Mansi, et al.
Published: (2025)

Enhancing Partially Relevant Video Retrieval with Robust Alignment Learning
by: Zhang, Long, et al.
Published: (2025)

SDI-Paste: Synthetic Dynamic Instance Copy-Paste for Video Instance Segmentation
by: Shrestha, Sahir, et al.
Published: (2024)

Learning Unsupervised Gaze Representation via Eye Mask Driven Information Bottleneck
by: Jiang, Yangzhou, et al.
Published: (2024)

BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
by: Liu, Zhijian, et al.
Published: (2022)

Rethinking Polyp Segmentation from an Out-of-Distribution Perspective
by: Ji, Ge-Peng, et al.
Published: (2023)

P2C: Self-Supervised Point Cloud Completion from Single Partial Clouds
by: Cui, Ruikai, et al.
Published: (2023)