Saved in:
| Main Authors: | Wei, Tong, Tolias, Giorgos, Matas, Jiri, Barath, Daniel |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.21963 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Three Things to Know about Deep Metric Learning
by: Patel, Yash, et al.
Published: (2024)
by: Patel, Yash, et al.
Published: (2024)
Breaking the Frame: Visual Place Recognition by Overlap Prediction
by: Wei, Tong, et al.
Published: (2024)
by: Wei, Tong, et al.
Published: (2024)
LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation
by: Stojnić, Vladan, et al.
Published: (2025)
by: Stojnić, Vladan, et al.
Published: (2025)
ProbPose: A Probabilistic Approach to 2D Human Pose Estimation
by: Purkrabek, Miroslav, et al.
Published: (2024)
by: Purkrabek, Miroslav, et al.
Published: (2024)
Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous Circle
by: Purkrabek, Miroslav, et al.
Published: (2024)
by: Purkrabek, Miroslav, et al.
Published: (2024)
Improving 2D Human Pose Estimation in Rare Camera Views with Synthetic Data
by: Purkrabek, Miroslav, et al.
Published: (2023)
by: Purkrabek, Miroslav, et al.
Published: (2023)
Human Pose-Constrained UV Map Estimation
by: Suchanek, Matej, et al.
Published: (2025)
by: Suchanek, Matej, et al.
Published: (2025)
ILIAS: Instance-Level Image retrieval At Scale
by: Kordopatis-Zilos, Giorgos, et al.
Published: (2025)
by: Kordopatis-Zilos, Giorgos, et al.
Published: (2025)
SAM-pose2seg: Pose-Guided Human Instance Segmentation in Crowds
by: Kolomiiets, Constantin, et al.
Published: (2026)
by: Kolomiiets, Constantin, et al.
Published: (2026)
BBoxMaskPose v2: Expanding Mutual Conditioning to 3D
by: Purkrabek, Miroslav, et al.
Published: (2026)
by: Purkrabek, Miroslav, et al.
Published: (2026)
A Dataset for Semantic Segmentation in the Presence of Unknowns
by: Laskar, Zakaria, et al.
Published: (2025)
by: Laskar, Zakaria, et al.
Published: (2025)
Robust Context-Aware Object Recognition
by: Janouskova, Klara, et al.
Published: (2025)
by: Janouskova, Klara, et al.
Published: (2025)
ELViS: Efficient Visual Similarity from Local Descriptors that Generalizes Across Domains
by: Suma, Pavel, et al.
Published: (2026)
by: Suma, Pavel, et al.
Published: (2026)
AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval
by: Suma, Pavel, et al.
Published: (2024)
by: Suma, Pavel, et al.
Published: (2024)
Crafting Distribution Shifts for Validation and Training in Single Source Domain Generalization
by: Efthymiadis, Nikos, et al.
Published: (2024)
by: Efthymiadis, Nikos, et al.
Published: (2024)
Accurate Planar Tracking With Robust Re-Detection
by: Serych, Jonas, et al.
Published: (2026)
by: Serych, Jonas, et al.
Published: (2026)
Video shutter angle estimation using optical flow and linear blur
by: Korcak, David, et al.
Published: (2023)
by: Korcak, David, et al.
Published: (2023)
Instance-Level Generation for Representation Learning
by: Wu, Yankun, et al.
Published: (2025)
by: Wu, Yankun, et al.
Published: (2025)
Label Propagation for Zero-shot Classification with Vision-Language Models
by: Stojnić, Vladan, et al.
Published: (2024)
by: Stojnić, Vladan, et al.
Published: (2024)
A New Dataset and a Distractor-Aware Architecture for Transparent Object Tracking
by: Lukezic, Alan, et al.
Published: (2024)
by: Lukezic, Alan, et al.
Published: (2024)
SPAR: Single-Pass Any-Resolution ViT for Open-vocabulary Segmentation
by: Kombol, Naomi, et al.
Published: (2026)
by: Kombol, Naomi, et al.
Published: (2026)
SupeRANSAC: One RANSAC to Rule Them All
by: Barath, Daniel
Published: (2025)
by: Barath, Daniel
Published: (2025)
P1AC: Revisiting Absolute Pose From a Single Affine Correspondence
by: Ventura, Jonathan, et al.
Published: (2020)
by: Ventura, Jonathan, et al.
Published: (2020)
Processing and acquisition traces in visual encoders: What does CLIP know about your camera?
by: Ramos, Ryan, et al.
Published: (2025)
by: Ramos, Ryan, et al.
Published: (2025)
Indexing Multimodal Language Models for Large-scale Image Retrieval
by: Tharwat, Bahey, et al.
Published: (2026)
by: Tharwat, Bahey, et al.
Published: (2026)
Dense Matchers for Dense Tracking
by: Jelínek, Tomáš, et al.
Published: (2024)
by: Jelínek, Tomáš, et al.
Published: (2024)
MFTIQ: Multi-Flow Tracker with Independent Matching Quality Estimation
by: Serych, Jonas, et al.
Published: (2024)
by: Serych, Jonas, et al.
Published: (2024)
Unlocking the Hidden Potential of CLIP in Generalizable Deepfake Detection
by: Yermakov, Andrii, et al.
Published: (2025)
by: Yermakov, Andrii, et al.
Published: (2025)
Animal Identification with Independent Foreground and Background Modeling
by: Picek, Lukas, et al.
Published: (2024)
by: Picek, Lukas, et al.
Published: (2024)
PixOOD: Pixel-Level Out-of-Distribution Detection
by: Vojíř, Tomáš, et al.
Published: (2024)
by: Vojíř, Tomáš, et al.
Published: (2024)
Bringing the Context Back into Object Recognition, Robustly
by: Janouskova, Klara, et al.
Published: (2024)
by: Janouskova, Klara, et al.
Published: (2024)
Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation?
by: Aravanis, Tilemachos, et al.
Published: (2026)
by: Aravanis, Tilemachos, et al.
Published: (2026)
Multiway Point Cloud Mosaicking with Diffusion and Global Optimization
by: Jin, Shengze, et al.
Published: (2024)
by: Jin, Shengze, et al.
Published: (2024)
Global Structure-from-Motion Revisited
by: Pan, Linfei, et al.
Published: (2024)
by: Pan, Linfei, et al.
Published: (2024)
LOCORE: Image Re-ranking with Long-Context Sequence Modeling
by: Xiao, Zilin, et al.
Published: (2025)
by: Xiao, Zilin, et al.
Published: (2025)
Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders
by: Khan, Faizan Farooq, et al.
Published: (2025)
by: Khan, Faizan Farooq, et al.
Published: (2025)
HelixTrack: Event-Based Tracking and RPM Estimation of Propeller-like Objects
by: Spetlik, Radim, et al.
Published: (2026)
by: Spetlik, Radim, et al.
Published: (2026)
The Alpha Blending Hypothesis: Compositing Shortcut in Deepfake Detection
by: Yermakov, Andrii, et al.
Published: (2026)
by: Yermakov, Andrii, et al.
Published: (2026)
Koo-Fu CLIP: Closed-Form Adaptation of Vision-Language Models via Fukunaga-Koontz Linear Discriminant Analysis
by: Suchanek, Matej, et al.
Published: (2026)
by: Suchanek, Matej, et al.
Published: (2026)
Multimodal Large Language Models as Image Classifiers
by: Kisel, Nikita, et al.
Published: (2026)
by: Kisel, Nikita, et al.
Published: (2026)
Similar Items
-
Three Things to Know about Deep Metric Learning
by: Patel, Yash, et al.
Published: (2024) -
Breaking the Frame: Visual Place Recognition by Overlap Prediction
by: Wei, Tong, et al.
Published: (2024) -
LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation
by: Stojnić, Vladan, et al.
Published: (2025) -
ProbPose: A Probabilistic Approach to 2D Human Pose Estimation
by: Purkrabek, Miroslav, et al.
Published: (2024) -
Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous Circle
by: Purkrabek, Miroslav, et al.
Published: (2024)