Saved in:
| Main Authors: | Song, Peipei, Zhang, Jing, Koniusz, Piotr, Barnes, Nick |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.14821 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CHAIN: Enhancing Generalization in Data-Efficient GANs via lipsCHitz continuity constrAIned Normalization
by: Ni, Yao, et al.
Published: (2024)
by: Ni, Yao, et al.
Published: (2024)
OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection
by: Lu, Changsheng, et al.
Published: (2024)
by: Lu, Changsheng, et al.
Published: (2024)
Possibilistic Predictive Uncertainty for Deep Learning
by: Ni, Yao, et al.
Published: (2026)
by: Ni, Yao, et al.
Published: (2026)
PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization
by: Ni, Yao, et al.
Published: (2024)
by: Ni, Yao, et al.
Published: (2024)
Feature Hallucination for Self-supervised Action Recognition
by: Wang, Lei, et al.
Published: (2025)
by: Wang, Lei, et al.
Published: (2025)
Focus on Background: Exploring SAM's Potential in Few-shot Medical Image Segmentation with Background-centric Prompting
by: Bo, Yuntian, et al.
Published: (2026)
by: Bo, Yuntian, et al.
Published: (2026)
Open Eyes, Then Reason: Fine-grained Visual Mathematical Understanding in MLLMs
by: Zhang, Shan, et al.
Published: (2025)
by: Zhang, Shan, et al.
Published: (2025)
Learning Time in Static Classifiers
by: Ding, Xi, et al.
Published: (2025)
by: Ding, Xi, et al.
Published: (2025)
Subspace Kernel Learning on Tensor Sequences
by: Wang, Lei, et al.
Published: (2026)
by: Wang, Lei, et al.
Published: (2026)
Adaptive Multi-head Contrastive Learning
by: Wang, Lei, et al.
Published: (2023)
by: Wang, Lei, et al.
Published: (2023)
Video Understanding by Design: How Datasets Shape Architectures and Insights
by: Wang, Lei, et al.
Published: (2025)
by: Wang, Lei, et al.
Published: (2025)
ICED: Concept-level Machine Unlearning via Interpretable Concept Decomposition
by: Lin, Shen, et al.
Published: (2026)
by: Lin, Shen, et al.
Published: (2026)
Noise Consistency Regularization for Improved Subject-Driven Image Synthesis
by: Ni, Yao, et al.
Published: (2025)
by: Ni, Yao, et al.
Published: (2025)
Motion meets Attention: Video Motion Prompts
by: Chen, Qixiang, et al.
Published: (2024)
by: Chen, Qixiang, et al.
Published: (2024)
Graph Your Own Prompt
by: Ding, Xi, et al.
Published: (2025)
by: Ding, Xi, et al.
Published: (2025)
Pre-training with Random Orthogonal Projection Image Modeling
by: Haghighat, Maryam, et al.
Published: (2023)
by: Haghighat, Maryam, et al.
Published: (2023)
When Spatial meets Temporal in Action Recognition
by: Chen, Huilin, et al.
Published: (2024)
by: Chen, Huilin, et al.
Published: (2024)
Artemis: Structured Visual Reasoning for Perception Policy Learning
by: Tang, Wei, et al.
Published: (2025)
by: Tang, Wei, et al.
Published: (2025)
Through Their Eyes: Fixation-aligned Tuning for Personalized User Emulation
by: Huang, Lingfeng, et al.
Published: (2026)
by: Huang, Lingfeng, et al.
Published: (2026)
Uncertainty-DTW for Sequences and Visual Tokens
by: Wang, Lei, et al.
Published: (2026)
by: Wang, Lei, et al.
Published: (2026)
Fixation-based Self-calibration for Eye Tracking in VR Headsets
by: Uramune, Ryusei, et al.
Published: (2023)
by: Uramune, Ryusei, et al.
Published: (2023)
Hierarchically Robust Zero-shot Vision-language Models
by: Dong, Junhao, et al.
Published: (2026)
by: Dong, Junhao, et al.
Published: (2026)
Gaussian Graph Network: Learning Efficient and Generalizable Gaussian Representations from Multi-view Images
by: Zhang, Shengjun, et al.
Published: (2025)
by: Zhang, Shengjun, et al.
Published: (2025)
Meet JEANIE: a Similarity Measure for 3D Skeleton Sequences via Temporal-Viewpoint Alignment
by: Wang, Lei, et al.
Published: (2024)
by: Wang, Lei, et al.
Published: (2024)
Learnable Expansion of Graph Operators for Multi-Modal Feature Fusion
by: Ding, Dexuan, et al.
Published: (2024)
by: Ding, Dexuan, et al.
Published: (2024)
Point-PNG: Conditional Pseudo-Negatives Generation for Point Cloud Pre-Training
by: Mahendren, Sutharsan, et al.
Published: (2024)
by: Mahendren, Sutharsan, et al.
Published: (2024)
SmokeBench: Evaluating Multimodal Large Language Models for Wildfire Smoke Detection
by: Qi, Tianye, et al.
Published: (2025)
by: Qi, Tianye, et al.
Published: (2025)
Math Blind: Failures in Diagram Understanding Undermine Reasoning in MLLMs
by: Sun, Yanpeng, et al.
Published: (2025)
by: Sun, Yanpeng, et al.
Published: (2025)
NumGrad-Pull: Numerical Gradient Guided Tri-plane Representation for Surface Reconstruction from Point Clouds
by: Cui, Ruikai, et al.
Published: (2024)
by: Cui, Ruikai, et al.
Published: (2024)
MMT-ARD: Multimodal Multi-Teacher Adversarial Distillation for Robust Vision-Language Models
by: Li, Yuqi, et al.
Published: (2025)
by: Li, Yuqi, et al.
Published: (2025)
Deep Learning-Based Fixation Type Prediction for Quality Assurance in Digital Pathology
by: Thaeter, Oskar, et al.
Published: (2026)
by: Thaeter, Oskar, et al.
Published: (2026)
Self-Evolving Spatial Reasoning in Vision Language Models via Geometric Logic Consistency
by: Liu, Junming, et al.
Published: (2026)
by: Liu, Junming, et al.
Published: (2026)
NoiseSDF2NoiseSDF: Learning Clean Neural Fields from Noisy Supervision
by: Wang, Tengkai, et al.
Published: (2025)
by: Wang, Tengkai, et al.
Published: (2025)
Distinguishing Target and Non-Target Fixations with EEG and Eye Tracking in Realistic Visual Scenes
by: Sharma, Mansi, et al.
Published: (2025)
by: Sharma, Mansi, et al.
Published: (2025)
Enhancing Partially Relevant Video Retrieval with Robust Alignment Learning
by: Zhang, Long, et al.
Published: (2025)
by: Zhang, Long, et al.
Published: (2025)
SDI-Paste: Synthetic Dynamic Instance Copy-Paste for Video Instance Segmentation
by: Shrestha, Sahir, et al.
Published: (2024)
by: Shrestha, Sahir, et al.
Published: (2024)
Learning Unsupervised Gaze Representation via Eye Mask Driven Information Bottleneck
by: Jiang, Yangzhou, et al.
Published: (2024)
by: Jiang, Yangzhou, et al.
Published: (2024)
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
by: Liu, Zhijian, et al.
Published: (2022)
by: Liu, Zhijian, et al.
Published: (2022)
Rethinking Polyp Segmentation from an Out-of-Distribution Perspective
by: Ji, Ge-Peng, et al.
Published: (2023)
by: Ji, Ge-Peng, et al.
Published: (2023)
P2C: Self-Supervised Point Cloud Completion from Single Partial Clouds
by: Cui, Ruikai, et al.
Published: (2023)
by: Cui, Ruikai, et al.
Published: (2023)
Similar Items
-
CHAIN: Enhancing Generalization in Data-Efficient GANs via lipsCHitz continuity constrAIned Normalization
by: Ni, Yao, et al.
Published: (2024) -
OpenKD: Opening Prompt Diversity for Zero- and Few-shot Keypoint Detection
by: Lu, Changsheng, et al.
Published: (2024) -
Possibilistic Predictive Uncertainty for Deep Learning
by: Ni, Yao, et al.
Published: (2026) -
PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization
by: Ni, Yao, et al.
Published: (2024) -
Feature Hallucination for Self-supervised Action Recognition
by: Wang, Lei, et al.
Published: (2025)