:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Perez, Gustavo, Yu, Stella X.
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2506.04401
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Vision Harnessing Agent for Open Ad-hoc Segmentation
by: Wang, Zilin, et al.
Published: (2026)

Co-domain Symmetry for Complex-Valued Deep Learning
by: Singhal, Utkarsh, et al.
Published: (2021)

Learning Normals of Noisy Points by Local Gradient-Aware Surface Filtering
by: Li, Qing, et al.
Published: (2025)

Free-Grained Hierarchical Visual Recognition
by: Park, Seulki, et al.
Published: (2025)

Next-Embedding Prediction Makes Strong Vision Learners
by: Xu, Sihan, et al.
Published: (2025)

Context Matters: Vision-Based Depression Detection Comparing Classical and Deep Approaches
by: Bilalpur, Maneesh, et al.
Published: (2026)

Design description of Wisdom Computing Persperctive
by: Yu, TianYi
Published: (2025)

SHED Light on Segmentation for Dense Prediction
by: Lee, Seung Hyun, et al.
Published: (2026)

Novel View Synthesis from A Few Glimpses via Test-Time Natural Video Completion
by: Xu, Yan, et al.
Published: (2025)

GeoSANE: Learning Geospatial Representations from Models, Not Data
by: Hanna, Joelle, et al.
Published: (2026)

Silicon Minds versus Human Hearts: The Wisdom of Crowds Beats the Wisdom of AI in Emotion Recognition
by: Akben, Mustafa, et al.
Published: (2025)

The Wisdom of a Crowd of Brains: A Universal Brain Encoder
by: Beliy, Roman, et al.
Published: (2024)

Pose-Aware Self-Supervised Learning with Viewpoint Trajectory Regularization
by: Wang, Jiayun, et al.
Published: (2024)

Let Humanoids Hike! Integrative Skill Development on Complex Trails
by: Lin, Kwan-Yee, et al.
Published: (2025)

Benchmarking Deep Learning and Vision Foundation Models for Atypical vs. Normal Mitosis Classification with Cross-Dataset Evaluation
by: Banerjee, Sweta, et al.
Published: (2025)

SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation
by: Cuttano, Claudia, et al.
Published: (2024)

Visually Consistent Hierarchical Image Classification
by: Park, Seulki, et al.
Published: (2024)

Beyond Kalman Filters: Deep Learning-Based Filters for Improved Object Tracking
by: Adžemović, Momir, et al.
Published: (2024)

FViT: A Focal Vision Transformer with Gabor Filter
by: Shi, Yulong, et al.
Published: (2024)

Quantum-enhanced Computer Vision: Going Beyond Classical Algorithms
by: Meli, Natacha Kuete, et al.
Published: (2025)

Vision Transformer-Based Deep Learning for Histologic Classification of Endometrial Cancer
by: Goyal, Manu, et al.
Published: (2023)

Test-Time Canonicalization by Foundation Models for Robust Perception
by: Singhal, Utkarsh, et al.
Published: (2025)

Learning to Transform for Generalizable Instance-wise Invariance
by: Singhal, Utkarsh, et al.
Published: (2023)

There is More to Attention: Statistical Filtering Enhances Explanations in Vision Transformers
by: Ayyar, Meghna P, et al.
Published: (2025)

Low-Pass Filtering Improves Behavioral Alignment of Vision Models
by: Wolff, Max, et al.
Published: (2026)

The Master Key Filters Hypothesis: Deep Filters Are General
by: Babaiee, Zahra, et al.
Published: (2024)

MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers
by: Bae, Jongseong, et al.
Published: (2024)

Are you In or Out (of gallery)? Wisdom from the Same-Identity Crowd
by: Bhatta, Aman, et al.
Published: (2025)

Sampling Strategies based on Wisdom of Crowds for Amazon Deforestation Detection
by: Resende, Hugo, et al.
Published: (2024)

Speed-up of Vision Transformer Models by Attention-aware Token Filtering
by: Naruko, Takahiro, et al.
Published: (2025)

IDTrust: Deep Identity Document Quality Detection with Bandpass Filtering
by: Al-Ghadi, Musab, et al.
Published: (2024)

ATCTrack: Aligning Target-Context Cues with Dynamic Target States for Robust Vision-Language Tracking
by: Feng, X., et al.
Published: (2025)

ProReason: Multi-Modal Proactive Reasoning with Decoupled Eyesight and Wisdom
by: Zhou, Jingqi, et al.
Published: (2024)

Hallucination Filtering in Radiology Vision-Language Models Using Discrete Semantic Entropy
by: Wienholt, Patrick, et al.
Published: (2025)

Taxonomy-Aware Evaluation of Vision-Language Models
by: Snæbjarnarson, Vésteinn, et al.
Published: (2025)

Bridging Classical and Modern Computer Vision: PerceptiveNet for Tree Crown Semantic Segmentation
by: Voulgaris, Georgios
Published: (2025)

FGFP: A Fractional Gaussian Filter and Pruning for Deep Neural Networks Compression
by: Tu, Kuan-Ting, et al.
Published: (2025)

Open Ad-hoc Categorization with Contextualized Feature Learning
by: Wang, Zilin, et al.
Published: (2025)

Exploring the Efficacy of Group-Normalization in Deep Learning Models for Alzheimer's Disease Classification
by: Habib, Gousia, et al.
Published: (2024)

Normal and Abnormal Pathology Knowledge-Augmented Vision-Language Model for Anomaly Detection in Pathology Images
by: Song, Jinsol, et al.
Published: (2025)