Saved in:
| Main Authors: | Perez, Gustavo, Yu, Stella X. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.04401 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Vision Harnessing Agent for Open Ad-hoc Segmentation
by: Wang, Zilin, et al.
Published: (2026)
by: Wang, Zilin, et al.
Published: (2026)
Co-domain Symmetry for Complex-Valued Deep Learning
by: Singhal, Utkarsh, et al.
Published: (2021)
by: Singhal, Utkarsh, et al.
Published: (2021)
Learning Normals of Noisy Points by Local Gradient-Aware Surface Filtering
by: Li, Qing, et al.
Published: (2025)
by: Li, Qing, et al.
Published: (2025)
Free-Grained Hierarchical Visual Recognition
by: Park, Seulki, et al.
Published: (2025)
by: Park, Seulki, et al.
Published: (2025)
Next-Embedding Prediction Makes Strong Vision Learners
by: Xu, Sihan, et al.
Published: (2025)
by: Xu, Sihan, et al.
Published: (2025)
Context Matters: Vision-Based Depression Detection Comparing Classical and Deep Approaches
by: Bilalpur, Maneesh, et al.
Published: (2026)
by: Bilalpur, Maneesh, et al.
Published: (2026)
Design description of Wisdom Computing Persperctive
by: Yu, TianYi
Published: (2025)
by: Yu, TianYi
Published: (2025)
SHED Light on Segmentation for Dense Prediction
by: Lee, Seung Hyun, et al.
Published: (2026)
by: Lee, Seung Hyun, et al.
Published: (2026)
Novel View Synthesis from A Few Glimpses via Test-Time Natural Video Completion
by: Xu, Yan, et al.
Published: (2025)
by: Xu, Yan, et al.
Published: (2025)
GeoSANE: Learning Geospatial Representations from Models, Not Data
by: Hanna, Joelle, et al.
Published: (2026)
by: Hanna, Joelle, et al.
Published: (2026)
Silicon Minds versus Human Hearts: The Wisdom of Crowds Beats the Wisdom of AI in Emotion Recognition
by: Akben, Mustafa, et al.
Published: (2025)
by: Akben, Mustafa, et al.
Published: (2025)
The Wisdom of a Crowd of Brains: A Universal Brain Encoder
by: Beliy, Roman, et al.
Published: (2024)
by: Beliy, Roman, et al.
Published: (2024)
Pose-Aware Self-Supervised Learning with Viewpoint Trajectory Regularization
by: Wang, Jiayun, et al.
Published: (2024)
by: Wang, Jiayun, et al.
Published: (2024)
Let Humanoids Hike! Integrative Skill Development on Complex Trails
by: Lin, Kwan-Yee, et al.
Published: (2025)
by: Lin, Kwan-Yee, et al.
Published: (2025)
Benchmarking Deep Learning and Vision Foundation Models for Atypical vs. Normal Mitosis Classification with Cross-Dataset Evaluation
by: Banerjee, Sweta, et al.
Published: (2025)
by: Banerjee, Sweta, et al.
Published: (2025)
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation
by: Cuttano, Claudia, et al.
Published: (2024)
by: Cuttano, Claudia, et al.
Published: (2024)
Visually Consistent Hierarchical Image Classification
by: Park, Seulki, et al.
Published: (2024)
by: Park, Seulki, et al.
Published: (2024)
Beyond Kalman Filters: Deep Learning-Based Filters for Improved Object Tracking
by: Adžemović, Momir, et al.
Published: (2024)
by: Adžemović, Momir, et al.
Published: (2024)
FViT: A Focal Vision Transformer with Gabor Filter
by: Shi, Yulong, et al.
Published: (2024)
by: Shi, Yulong, et al.
Published: (2024)
Quantum-enhanced Computer Vision: Going Beyond Classical Algorithms
by: Meli, Natacha Kuete, et al.
Published: (2025)
by: Meli, Natacha Kuete, et al.
Published: (2025)
Vision Transformer-Based Deep Learning for Histologic Classification of Endometrial Cancer
by: Goyal, Manu, et al.
Published: (2023)
by: Goyal, Manu, et al.
Published: (2023)
Test-Time Canonicalization by Foundation Models for Robust Perception
by: Singhal, Utkarsh, et al.
Published: (2025)
by: Singhal, Utkarsh, et al.
Published: (2025)
Learning to Transform for Generalizable Instance-wise Invariance
by: Singhal, Utkarsh, et al.
Published: (2023)
by: Singhal, Utkarsh, et al.
Published: (2023)
There is More to Attention: Statistical Filtering Enhances Explanations in Vision Transformers
by: Ayyar, Meghna P, et al.
Published: (2025)
by: Ayyar, Meghna P, et al.
Published: (2025)
Low-Pass Filtering Improves Behavioral Alignment of Vision Models
by: Wolff, Max, et al.
Published: (2026)
by: Wolff, Max, et al.
Published: (2026)
The Master Key Filters Hypothesis: Deep Filters Are General
by: Babaiee, Zahra, et al.
Published: (2024)
by: Babaiee, Zahra, et al.
Published: (2024)
MVFormer: Diversifying Feature Normalization and Token Mixing for Efficient Vision Transformers
by: Bae, Jongseong, et al.
Published: (2024)
by: Bae, Jongseong, et al.
Published: (2024)
Are you In or Out (of gallery)? Wisdom from the Same-Identity Crowd
by: Bhatta, Aman, et al.
Published: (2025)
by: Bhatta, Aman, et al.
Published: (2025)
Sampling Strategies based on Wisdom of Crowds for Amazon Deforestation Detection
by: Resende, Hugo, et al.
Published: (2024)
by: Resende, Hugo, et al.
Published: (2024)
Speed-up of Vision Transformer Models by Attention-aware Token Filtering
by: Naruko, Takahiro, et al.
Published: (2025)
by: Naruko, Takahiro, et al.
Published: (2025)
IDTrust: Deep Identity Document Quality Detection with Bandpass Filtering
by: Al-Ghadi, Musab, et al.
Published: (2024)
by: Al-Ghadi, Musab, et al.
Published: (2024)
ATCTrack: Aligning Target-Context Cues with Dynamic Target States for Robust Vision-Language Tracking
by: Feng, X., et al.
Published: (2025)
by: Feng, X., et al.
Published: (2025)
ProReason: Multi-Modal Proactive Reasoning with Decoupled Eyesight and Wisdom
by: Zhou, Jingqi, et al.
Published: (2024)
by: Zhou, Jingqi, et al.
Published: (2024)
Hallucination Filtering in Radiology Vision-Language Models Using Discrete Semantic Entropy
by: Wienholt, Patrick, et al.
Published: (2025)
by: Wienholt, Patrick, et al.
Published: (2025)
Taxonomy-Aware Evaluation of Vision-Language Models
by: Snæbjarnarson, Vésteinn, et al.
Published: (2025)
by: Snæbjarnarson, Vésteinn, et al.
Published: (2025)
Bridging Classical and Modern Computer Vision: PerceptiveNet for Tree Crown Semantic Segmentation
by: Voulgaris, Georgios
Published: (2025)
by: Voulgaris, Georgios
Published: (2025)
FGFP: A Fractional Gaussian Filter and Pruning for Deep Neural Networks Compression
by: Tu, Kuan-Ting, et al.
Published: (2025)
by: Tu, Kuan-Ting, et al.
Published: (2025)
Open Ad-hoc Categorization with Contextualized Feature Learning
by: Wang, Zilin, et al.
Published: (2025)
by: Wang, Zilin, et al.
Published: (2025)
Exploring the Efficacy of Group-Normalization in Deep Learning Models for Alzheimer's Disease Classification
by: Habib, Gousia, et al.
Published: (2024)
by: Habib, Gousia, et al.
Published: (2024)
Normal and Abnormal Pathology Knowledge-Augmented Vision-Language Model for Anomaly Detection in Pathology Images
by: Song, Jinsol, et al.
Published: (2025)
by: Song, Jinsol, et al.
Published: (2025)
Similar Items
-
Vision Harnessing Agent for Open Ad-hoc Segmentation
by: Wang, Zilin, et al.
Published: (2026) -
Co-domain Symmetry for Complex-Valued Deep Learning
by: Singhal, Utkarsh, et al.
Published: (2021) -
Learning Normals of Noisy Points by Local Gradient-Aware Surface Filtering
by: Li, Qing, et al.
Published: (2025) -
Free-Grained Hierarchical Visual Recognition
by: Park, Seulki, et al.
Published: (2025) -
Next-Embedding Prediction Makes Strong Vision Learners
by: Xu, Sihan, et al.
Published: (2025)