:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wei, Tong, Tolias, Giorgos, Matas, Jiri, Barath, Daniel
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2602.21963
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Three Things to Know about Deep Metric Learning
by: Patel, Yash, et al.
Published: (2024)

Breaking the Frame: Visual Place Recognition by Overlap Prediction
by: Wei, Tong, et al.
Published: (2024)

LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation
by: Stojnić, Vladan, et al.
Published: (2025)

ProbPose: A Probabilistic Approach to 2D Human Pose Estimation
by: Purkrabek, Miroslav, et al.
Published: (2024)

Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous Circle
by: Purkrabek, Miroslav, et al.
Published: (2024)

Improving 2D Human Pose Estimation in Rare Camera Views with Synthetic Data
by: Purkrabek, Miroslav, et al.
Published: (2023)

Human Pose-Constrained UV Map Estimation
by: Suchanek, Matej, et al.
Published: (2025)

ILIAS: Instance-Level Image retrieval At Scale
by: Kordopatis-Zilos, Giorgos, et al.
Published: (2025)

SAM-pose2seg: Pose-Guided Human Instance Segmentation in Crowds
by: Kolomiiets, Constantin, et al.
Published: (2026)

BBoxMaskPose v2: Expanding Mutual Conditioning to 3D
by: Purkrabek, Miroslav, et al.
Published: (2026)

A Dataset for Semantic Segmentation in the Presence of Unknowns
by: Laskar, Zakaria, et al.
Published: (2025)

Robust Context-Aware Object Recognition
by: Janouskova, Klara, et al.
Published: (2025)

ELViS: Efficient Visual Similarity from Local Descriptors that Generalizes Across Domains
by: Suma, Pavel, et al.
Published: (2026)

AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval
by: Suma, Pavel, et al.
Published: (2024)

Crafting Distribution Shifts for Validation and Training in Single Source Domain Generalization
by: Efthymiadis, Nikos, et al.
Published: (2024)

Accurate Planar Tracking With Robust Re-Detection
by: Serych, Jonas, et al.
Published: (2026)

Video shutter angle estimation using optical flow and linear blur
by: Korcak, David, et al.
Published: (2023)

Instance-Level Generation for Representation Learning
by: Wu, Yankun, et al.
Published: (2025)

Label Propagation for Zero-shot Classification with Vision-Language Models
by: Stojnić, Vladan, et al.
Published: (2024)

A New Dataset and a Distractor-Aware Architecture for Transparent Object Tracking
by: Lukezic, Alan, et al.
Published: (2024)

SPAR: Single-Pass Any-Resolution ViT for Open-vocabulary Segmentation
by: Kombol, Naomi, et al.
Published: (2026)

SupeRANSAC: One RANSAC to Rule Them All
by: Barath, Daniel
Published: (2025)

P1AC: Revisiting Absolute Pose From a Single Affine Correspondence
by: Ventura, Jonathan, et al.
Published: (2020)

Processing and acquisition traces in visual encoders: What does CLIP know about your camera?
by: Ramos, Ryan, et al.
Published: (2025)

Indexing Multimodal Language Models for Large-scale Image Retrieval
by: Tharwat, Bahey, et al.
Published: (2026)

Dense Matchers for Dense Tracking
by: Jelínek, Tomáš, et al.
Published: (2024)

MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation
by: Serych, Jonas, et al.
Published: (2024)

Unlocking the Hidden Potential of CLIP in Generalizable Deepfake Detection
by: Yermakov, Andrii, et al.
Published: (2025)

Animal Identification with Independent Foreground and Background Modeling
by: Picek, Lukas, et al.
Published: (2024)

PixOOD: Pixel-Level Out-of-Distribution Detection
by: Vojíř, Tomáš, et al.
Published: (2024)

Bringing the Context Back into Object Recognition, Robustly
by: Janouskova, Klara, et al.
Published: (2024)

Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation?
by: Aravanis, Tilemachos, et al.
Published: (2026)

Multiway Point Cloud Mosaicking with Diffusion and Global Optimization
by: Jin, Shengze, et al.
Published: (2024)

Global Structure-from-Motion Revisited
by: Pan, Linfei, et al.
Published: (2024)

LOCORE: Image Re-ranking with Long-Context Sequence Modeling
by: Xiao, Zilin, et al.
Published: (2025)

Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders
by: Khan, Faizan Farooq, et al.
Published: (2025)

HelixTrack: Event-Based Tracking and RPM Estimation of Propeller-like Objects
by: Spetlik, Radim, et al.
Published: (2026)

The Alpha Blending Hypothesis: Compositing Shortcut in Deepfake Detection
by: Yermakov, Andrii, et al.
Published: (2026)

Koo-Fu CLIP: Closed-Form Adaptation of Vision-Language Models via Fukunaga-Koontz Linear Discriminant Analysis
by: Suchanek, Matej, et al.
Published: (2026)

Multimodal Large Language Models as Image Classifiers
by: Kisel, Nikita, et al.
Published: (2026)