:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Chenhao, Ono, Taishi, Uemori, Takeshi, Moriuchi, Yusuke
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2603.04817
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

NeISF++: Neural Incident Stokes Field for Polarized Inverse Rendering of Conductors and Dielectrics
by: Li, Chenhao, et al.
Published: (2024)

Revisiting Active Learning in the Era of Vision Foundation Models
by: Gupte, Sanket Rajan, et al.
Published: (2024)

Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation
by: Kurita, Teppei, et al.
Published: (2024)

Deep Polarization Cues for Single-shot Shape and Subsurface Scattering Estimation
by: Li, Chenhao, et al.
Published: (2024)

Revisiting Model Stitching In the Foundation Model Era
by: Mai, Zheda, et al.
Published: (2026)

Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
by: Zhang, Yue, et al.
Published: (2024)

Towards Unifying Understanding and Generation in the Era of Vision Foundation Models: A Survey from the Autoregression Perspective
by: Xie, Shenghao, et al.
Published: (2024)

Revisiting Automatic Data Curation for Vision Foundation Models in Digital Pathology
by: Chen, Boqi, et al.
Published: (2025)

OmniCD: A Foundational Framework for Remote Sensing Image Change Detection Guided by Multimodal Semantics
by: Sun, Chenhao
Published: (2026)

Mixed-precision Supernet Training from Vision Foundation Models using Low Rank Adapter
by: Sakuma, Yuiko, et al.
Published: (2024)

Revisiting Vision Language Foundations for No-Reference Image Quality Assessment
by: Yadav, Ankit, et al.
Published: (2025)

HitoMi-Cam: A Shape-Agnostic Person Detection Method Using the Spectral Characteristics of Clothing
by: Ono, Shuji
Published: (2025)

Image Segmentation in Foundation Model Era: A Survey
by: Zhou, Tianfei, et al.
Published: (2024)

Zero-shot Shape Classification of Nanoparticles in SEM Images using Vision Foundation Models
by: Barnatan, Freida, et al.
Published: (2025)

Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal Models
by: Chen, Jierun, et al.
Published: (2024)

In the Era of Prompt Learning with Vision-Language Models
by: Jha, Ankit
Published: (2024)

Revisiting Prompt Pretraining of Vision-Language Models
by: Chen, Zhenyuan, et al.
Published: (2024)

Iterated Learning Improves Compositionality in Large Vision-Language Models
by: Zheng, Chenhao, et al.
Published: (2024)

Online Long-term Point Tracking in the Foundation Model Era
by: Aydemir, Görkay
Published: (2025)

ViTamin: Designing Scalable Vision Models in the Vision-Language Era
by: Chen, Jieneng, et al.
Published: (2024)

Reflection Removal Using Recurrent Polarization-to-Polarization Network
by: Bian, Wenjiao, et al.
Published: (2024)

HSViT: Horizontally Scalable Vision Transformer
by: Xu, Chenhao, et al.
Published: (2024)

Revisiting Tampered Scene Text Detection in the Era of Generative AI
by: Qu, Chenfan, et al.
Published: (2024)

PLUG: Revisiting Amodal Segmentation with Foundation Model and Hierarchical Focus
by: Liu, Zhaochen, et al.
Published: (2024)

A Novel Benchmark for Few-Shot Semantic Segmentation in the Era of Foundation Models
by: Bensaid, Reda, et al.
Published: (2024)

Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review
by: Khan, Ufaq, et al.
Published: (2025)

Segmentation-Driven Monocular Shape from Polarization based on Physical Model
by: Zhang, Jinyu, et al.
Published: (2026)

SuperPlace: The Renaissance of Classical Feature Aggregation for Visual Place Recognition in the Era of Foundation Models
by: Liu, Bingxi, et al.
Published: (2025)

ZeroSlide: Is Zero-Shot Classification Adequate for Lifelong Learning in Whole-Slide Image Analysis in the Era of Pathology Vision-Language Foundation Models?
by: Bui, Doanh C., et al.
Published: (2025)

Diversity Covariance-Aware Prompt Learning for Vision-Language Models
by: Dong, Songlin, et al.
Published: (2025)

Revisiting Continual Semantic Segmentation with Pre-trained Vision Models
by: Zhang, Duzhen, et al.
Published: (2025)

Are Vision Foundation Models Foundational for Electron Microscopy Image Segmentation?
by: Fuster-Barceló, Caterina, et al.
Published: (2026)

Shape from Polarization of Thermal Emission and Reflection
by: Kitazawa, Kazuma, et al.
Published: (2025)

Sapiens: Foundation for Human Vision Models
by: Khirodkar, Rawal, et al.
Published: (2024)

Revisiting Multimodal Positional Encoding in Vision-Language Models
by: Huang, Jie, et al.
Published: (2025)

Bootstrapping SparseFormers from Vision Foundation Models
by: Gao, Ziteng, et al.
Published: (2023)

Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models
by: Guo, Jianyuan, et al.
Published: (2024)

Revisiting Shadow Detection from a Vision-Language Perspective
by: Wang, Yonghui, et al.
Published: (2026)

PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era
by: Zheng, Xu, et al.
Published: (2025)

Fooling Polarization-based Vision using Locally Controllable Polarizing Projection
by: Li, Zhuoxiao, et al.
Published: (2023)