Saved in:
| Main Authors: | Li, Chenhao, Ono, Taishi, Uemori, Takeshi, Moriuchi, Yusuke |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.04817 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
NeISF++: Neural Incident Stokes Field for Polarized Inverse Rendering of Conductors and Dielectrics
by: Li, Chenhao, et al.
Published: (2024)
by: Li, Chenhao, et al.
Published: (2024)
Revisiting Active Learning in the Era of Vision Foundation Models
by: Gupte, Sanket Rajan, et al.
Published: (2024)
by: Gupte, Sanket Rajan, et al.
Published: (2024)
Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation
by: Kurita, Teppei, et al.
Published: (2024)
by: Kurita, Teppei, et al.
Published: (2024)
Deep Polarization Cues for Single-shot Shape and Subsurface Scattering Estimation
by: Li, Chenhao, et al.
Published: (2024)
by: Li, Chenhao, et al.
Published: (2024)
Revisiting Model Stitching In the Foundation Model Era
by: Mai, Zheda, et al.
Published: (2026)
by: Mai, Zheda, et al.
Published: (2026)
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models
by: Zhang, Yue, et al.
Published: (2024)
by: Zhang, Yue, et al.
Published: (2024)
Towards Unifying Understanding and Generation in the Era of Vision Foundation Models: A Survey from the Autoregression Perspective
by: Xie, Shenghao, et al.
Published: (2024)
by: Xie, Shenghao, et al.
Published: (2024)
Revisiting Automatic Data Curation for Vision Foundation Models in Digital Pathology
by: Chen, Boqi, et al.
Published: (2025)
by: Chen, Boqi, et al.
Published: (2025)
OmniCD: A Foundational Framework for Remote Sensing Image Change Detection Guided by Multimodal Semantics
by: Sun, Chenhao
Published: (2026)
by: Sun, Chenhao
Published: (2026)
Mixed-precision Supernet Training from Vision Foundation Models using Low Rank Adapter
by: Sakuma, Yuiko, et al.
Published: (2024)
by: Sakuma, Yuiko, et al.
Published: (2024)
Revisiting Vision Language Foundations for No-Reference Image Quality Assessment
by: Yadav, Ankit, et al.
Published: (2025)
by: Yadav, Ankit, et al.
Published: (2025)
HitoMi-Cam: A Shape-Agnostic Person Detection Method Using the Spectral Characteristics of Clothing
by: Ono, Shuji
Published: (2025)
by: Ono, Shuji
Published: (2025)
Image Segmentation in Foundation Model Era: A Survey
by: Zhou, Tianfei, et al.
Published: (2024)
by: Zhou, Tianfei, et al.
Published: (2024)
Zero-shot Shape Classification of Nanoparticles in SEM Images using Vision Foundation Models
by: Barnatan, Freida, et al.
Published: (2025)
by: Barnatan, Freida, et al.
Published: (2025)
Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal Models
by: Chen, Jierun, et al.
Published: (2024)
by: Chen, Jierun, et al.
Published: (2024)
In the Era of Prompt Learning with Vision-Language Models
by: Jha, Ankit
Published: (2024)
by: Jha, Ankit
Published: (2024)
Revisiting Prompt Pretraining of Vision-Language Models
by: Chen, Zhenyuan, et al.
Published: (2024)
by: Chen, Zhenyuan, et al.
Published: (2024)
Iterated Learning Improves Compositionality in Large Vision-Language Models
by: Zheng, Chenhao, et al.
Published: (2024)
by: Zheng, Chenhao, et al.
Published: (2024)
Online Long-term Point Tracking in the Foundation Model Era
by: Aydemir, Görkay
Published: (2025)
by: Aydemir, Görkay
Published: (2025)
ViTamin: Designing Scalable Vision Models in the Vision-Language Era
by: Chen, Jieneng, et al.
Published: (2024)
by: Chen, Jieneng, et al.
Published: (2024)
Reflection Removal Using Recurrent Polarization-to-Polarization Network
by: Bian, Wenjiao, et al.
Published: (2024)
by: Bian, Wenjiao, et al.
Published: (2024)
HSViT: Horizontally Scalable Vision Transformer
by: Xu, Chenhao, et al.
Published: (2024)
by: Xu, Chenhao, et al.
Published: (2024)
Revisiting Tampered Scene Text Detection in the Era of Generative AI
by: Qu, Chenfan, et al.
Published: (2024)
by: Qu, Chenfan, et al.
Published: (2024)
PLUG: Revisiting Amodal Segmentation with Foundation Model and Hierarchical Focus
by: Liu, Zhaochen, et al.
Published: (2024)
by: Liu, Zhaochen, et al.
Published: (2024)
A Novel Benchmark for Few-Shot Semantic Segmentation in the Era of Foundation Models
by: Bensaid, Reda, et al.
Published: (2024)
by: Bensaid, Reda, et al.
Published: (2024)
Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review
by: Khan, Ufaq, et al.
Published: (2025)
by: Khan, Ufaq, et al.
Published: (2025)
Segmentation-Driven Monocular Shape from Polarization based on Physical Model
by: Zhang, Jinyu, et al.
Published: (2026)
by: Zhang, Jinyu, et al.
Published: (2026)
SuperPlace: The Renaissance of Classical Feature Aggregation for Visual Place Recognition in the Era of Foundation Models
by: Liu, Bingxi, et al.
Published: (2025)
by: Liu, Bingxi, et al.
Published: (2025)
ZeroSlide: Is Zero-Shot Classification Adequate for Lifelong Learning in Whole-Slide Image Analysis in the Era of Pathology Vision-Language Foundation Models?
by: Bui, Doanh C., et al.
Published: (2025)
by: Bui, Doanh C., et al.
Published: (2025)
Diversity Covariance-Aware Prompt Learning for Vision-Language Models
by: Dong, Songlin, et al.
Published: (2025)
by: Dong, Songlin, et al.
Published: (2025)
Revisiting Continual Semantic Segmentation with Pre-trained Vision Models
by: Zhang, Duzhen, et al.
Published: (2025)
by: Zhang, Duzhen, et al.
Published: (2025)
Are Vision Foundation Models Foundational for Electron Microscopy Image Segmentation?
by: Fuster-Barceló, Caterina, et al.
Published: (2026)
by: Fuster-Barceló, Caterina, et al.
Published: (2026)
Shape from Polarization of Thermal Emission and Reflection
by: Kitazawa, Kazuma, et al.
Published: (2025)
by: Kitazawa, Kazuma, et al.
Published: (2025)
Sapiens: Foundation for Human Vision Models
by: Khirodkar, Rawal, et al.
Published: (2024)
by: Khirodkar, Rawal, et al.
Published: (2024)
Revisiting Multimodal Positional Encoding in Vision-Language Models
by: Huang, Jie, et al.
Published: (2025)
by: Huang, Jie, et al.
Published: (2025)
Bootstrapping SparseFormers from Vision Foundation Models
by: Gao, Ziteng, et al.
Published: (2023)
by: Gao, Ziteng, et al.
Published: (2023)
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models
by: Guo, Jianyuan, et al.
Published: (2024)
by: Guo, Jianyuan, et al.
Published: (2024)
Revisiting Shadow Detection from a Vision-Language Perspective
by: Wang, Yonghui, et al.
Published: (2026)
by: Wang, Yonghui, et al.
Published: (2026)
PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era
by: Zheng, Xu, et al.
Published: (2025)
by: Zheng, Xu, et al.
Published: (2025)
Fooling Polarization-based Vision using Locally Controllable Polarizing Projection
by: Li, Zhuoxiao, et al.
Published: (2023)
by: Li, Zhuoxiao, et al.
Published: (2023)
Similar Items
-
NeISF++: Neural Incident Stokes Field for Polarized Inverse Rendering of Conductors and Dielectrics
by: Li, Chenhao, et al.
Published: (2024) -
Revisiting Active Learning in the Era of Vision Foundation Models
by: Gupte, Sanket Rajan, et al.
Published: (2024) -
Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation
by: Kurita, Teppei, et al.
Published: (2024) -
Deep Polarization Cues for Single-shot Shape and Subsurface Scattering Estimation
by: Li, Chenhao, et al.
Published: (2024) -
Revisiting Model Stitching In the Foundation Model Era
by: Mai, Zheda, et al.
Published: (2026)