Saved in:
| Main Authors: | Khalil, Daniel, Liu, Christina, Perona, Pietro, Sun, Jennifer J., Marks, Markus |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.09455 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SAVeD: Learning to Denoise Low-SNR Video for Improved Downstream Performance
by: Stathatos, Suzanne, et al.
Published: (2025)
by: Stathatos, Suzanne, et al.
Published: (2025)
A Closer Look at Benchmarking Self-Supervised Pre-training with Image Classification
by: Marks, Markus, et al.
Published: (2024)
by: Marks, Markus, et al.
Published: (2024)
Diffusion-Based Action Recognition Generalizes to Untrained Domains
by: Guimaraes, Rogerio, et al.
Published: (2025)
by: Guimaraes, Rogerio, et al.
Published: (2025)
Less is More: Discovering Concise Network Explanations
by: Kondapaneni, Neehar, et al.
Published: (2024)
by: Kondapaneni, Neehar, et al.
Published: (2024)
Text-image Alignment for Diffusion-based Perception
by: Kondapaneni, Neehar, et al.
Published: (2023)
by: Kondapaneni, Neehar, et al.
Published: (2023)
Probing the Mid-level Vision Capabilities of Self-Supervised Learning
by: Chen, Xuweiyi, et al.
Published: (2024)
by: Chen, Xuweiyi, et al.
Published: (2024)
Confidence Intervals for Error Rates in 1:1 Matching Tasks: Critical Statistical Analysis and Recommendations
by: Fogliato, Riccardo, et al.
Published: (2023)
by: Fogliato, Riccardo, et al.
Published: (2023)
Unsupervised Representation Learning from Sparse Transformation Analysis
by: Song, Yue, et al.
Published: (2024)
by: Song, Yue, et al.
Published: (2024)
A Number Sense as an Emergent Property of the Manipulating Brain
by: Kondapaneni, Neehar, et al.
Published: (2020)
by: Kondapaneni, Neehar, et al.
Published: (2020)
Single View Seafloor Recovery from Imaging Sonar via Differentiable Rendering
by: Brodjian, Sevan, et al.
Published: (2026)
by: Brodjian, Sevan, et al.
Published: (2026)
Event-based Facial Keypoint Alignment via Cross-Modal Fusion Attention and Self-Supervised Multi-Event Representation Learning
by: Kang, Donghwa, et al.
Published: (2025)
by: Kang, Donghwa, et al.
Published: (2025)
Linear Mechanisms for Spatiotemporal Reasoning in Vision Language Models
by: Kang, Raphi, et al.
Published: (2026)
by: Kang, Raphi, et al.
Published: (2026)
SHIC: Shape-Image Correspondences with no Keypoint Supervision
by: Shtedritski, Aleksandar, et al.
Published: (2024)
by: Shtedritski, Aleksandar, et al.
Published: (2024)
A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation
by: Fogliato, Riccardo, et al.
Published: (2024)
by: Fogliato, Riccardo, et al.
Published: (2024)
TimePoint: Accelerated Time Series Alignment via Self-Supervised Keypoint and Descriptor Learning
by: Weber, Ron Shapira, et al.
Published: (2025)
by: Weber, Ron Shapira, et al.
Published: (2025)
Learning to Make Keypoints Sub-Pixel Accurate
by: Kim, Shinjeong, et al.
Published: (2024)
by: Kim, Shinjeong, et al.
Published: (2024)
On the Effect of Image Resolution on Semantic Segmentation
by: Singh, Ritambhara, et al.
Published: (2024)
by: Singh, Ritambhara, et al.
Published: (2024)
Multi-Stream Keypoint Attention Network for Sign Language Recognition and Translation
by: Guan, Mo, et al.
Published: (2024)
by: Guan, Mo, et al.
Published: (2024)
Is CLIP ideal? No. Can we fix it? Yes!
by: Kang, Raphi, et al.
Published: (2025)
by: Kang, Raphi, et al.
Published: (2025)
A Rapid Test for Accuracy and Bias of Face Recognition Technology
by: Knott, Manuel, et al.
Published: (2025)
by: Knott, Manuel, et al.
Published: (2025)
Representational Similarity via Interpretable Visual Concepts
by: Kondapaneni, Neehar, et al.
Published: (2025)
by: Kondapaneni, Neehar, et al.
Published: (2025)
A Self-Supervised Method for Body Part Segmentation and Keypoint Detection of Rat Images
by: Kopácsi, László, et al.
Published: (2024)
by: Kopácsi, László, et al.
Published: (2024)
Depth-Guided Self-Supervised Human Keypoint Detection via Cross-Modal Distillation
by: Anand, Aman, et al.
Published: (2024)
by: Anand, Aman, et al.
Published: (2024)
Learning Keypoints for Robotic Cloth Manipulation using Synthetic Data
by: Lips, Thomas, et al.
Published: (2024)
by: Lips, Thomas, et al.
Published: (2024)
Non-Contact Physiological Monitoring in Pediatric Intensive Care Units via Adaptive Masking and Self-Supervised Learning
by: Salah, Mohamed Khalil Ben, et al.
Published: (2026)
by: Salah, Mohamed Khalil Ben, et al.
Published: (2026)
Multi-modal NeRF Self-Supervision for LiDAR Semantic Segmentation
by: Timoneda, Xavier, et al.
Published: (2024)
by: Timoneda, Xavier, et al.
Published: (2024)
ReassembleNet: Learnable Keypoints and Diffusion for 2D Fresco Reconstruction
by: Islam, Adeela, et al.
Published: (2025)
by: Islam, Adeela, et al.
Published: (2025)
Contrastive Learning under Noisy Temporal Self-Supervision for Colonoscopy Videos
by: Parolari, Luca, et al.
Published: (2026)
by: Parolari, Luca, et al.
Published: (2026)
SelfGeo: Self-supervised and Geodesic-consistent Estimation of Keypoints on Deformable Shapes
by: Zohaib, Mohammad, et al.
Published: (2024)
by: Zohaib, Mohammad, et al.
Published: (2024)
VoxelKeypointFusion: Generalizable Multi-View Multi-Person Pose Estimation
by: Bermuth, Daniel, et al.
Published: (2024)
by: Bermuth, Daniel, et al.
Published: (2024)
LatentKeypointGAN: Controlling Images via Latent Keypoints
by: He, Xingzhe, et al.
Published: (2021)
by: He, Xingzhe, et al.
Published: (2021)
Representational Difference Explanations
by: Kondapaneni, Neehar, et al.
Published: (2025)
by: Kondapaneni, Neehar, et al.
Published: (2025)
On the Discriminability of Self-Supervised Representation Learning
by: Song, Zeen, et al.
Published: (2024)
by: Song, Zeen, et al.
Published: (2024)
Self-Supervised Learning for Endoscopic Video Analysis
by: Hirsch, Roy, et al.
Published: (2023)
by: Hirsch, Roy, et al.
Published: (2023)
Diminishing Returns in Self-Supervised Learning
by: Bridge, Oli, et al.
Published: (2025)
by: Bridge, Oli, et al.
Published: (2025)
Video-based Surgical Tool-tip and Keypoint Tracking using Multi-frame Context-driven Deep Learning Models
by: Ghanekar, Bhargav, et al.
Published: (2025)
by: Ghanekar, Bhargav, et al.
Published: (2025)
Counting Fish with Temporal Representations of Sonar Video
by: Van Brunt, Kai, et al.
Published: (2025)
by: Van Brunt, Kai, et al.
Published: (2025)
Multi-View Crowd Counting With Self-Supervised Learning
by: Mo, Hong, et al.
Published: (2025)
by: Mo, Hong, et al.
Published: (2025)
Self-Supervised Contrastive Learning for Multi-Label Images
by: Chen, Jiale
Published: (2025)
by: Chen, Jiale
Published: (2025)
Incremental Object Keypoint Learning
by: Liang, Mingfu, et al.
Published: (2025)
by: Liang, Mingfu, et al.
Published: (2025)
Similar Items
-
SAVeD: Learning to Denoise Low-SNR Video for Improved Downstream Performance
by: Stathatos, Suzanne, et al.
Published: (2025) -
A Closer Look at Benchmarking Self-Supervised Pre-training with Image Classification
by: Marks, Markus, et al.
Published: (2024) -
Diffusion-Based Action Recognition Generalizes to Untrained Domains
by: Guimaraes, Rogerio, et al.
Published: (2025) -
Less is More: Discovering Concise Network Explanations
by: Kondapaneni, Neehar, et al.
Published: (2024) -
Text-image Alignment for Diffusion-based Perception
by: Kondapaneni, Neehar, et al.
Published: (2023)