Guardado en:
| Autores principales: | Xiang, Shuai, Guo, Wei, Burridge, James, Liu, Shouyang, Lu, Hao, Fukatsu, Tokihiro |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2603.27519 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
DODA: Adapting Object Detectors to Dynamic Agricultural Environments in Real-Time with Diffusion
por: Xiang, Shuai, et al.
Publicado: (2024)
por: Xiang, Shuai, et al.
Publicado: (2024)
High-throughput 3D shape completion of potato tubers on a harvester
por: Blok, Pieter M., et al.
Publicado: (2024)
por: Blok, Pieter M., et al.
Publicado: (2024)
Survey of Video Diffusion Models: Foundations, Implementations, and Applications
por: Wang, Yimu, et al.
Publicado: (2025)
por: Wang, Yimu, et al.
Publicado: (2025)
Scalable Object Detection in the Car Interior With Vision Foundation Models
por: Schmidt, Sebastian, et al.
Publicado: (2025)
por: Schmidt, Sebastian, et al.
Publicado: (2025)
Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models
por: Zhang, Tiezheng, et al.
Publicado: (2025)
por: Zhang, Tiezheng, et al.
Publicado: (2025)
Harnessing Large Vision and Language Models in Agriculture: A Review
por: Zhu, Hongyan, et al.
Publicado: (2024)
por: Zhu, Hongyan, et al.
Publicado: (2024)
A Survey on Remote Sensing Foundation Models: From Vision to Multimodality
por: Huang, Ziyue, et al.
Publicado: (2025)
por: Huang, Ziyue, et al.
Publicado: (2025)
PTDiffusion: Free Lunch for Generating Optical Illusion Hidden Pictures with Phase-Transferred Diffusion Model
por: Gao, Xiang, et al.
Publicado: (2025)
por: Gao, Xiang, et al.
Publicado: (2025)
Vision Foundation Models in Agriculture: Toward Domain-Specific Adaptation for Weed Herbicide Trials Assessment
por: Benito-Del-Valle, Leire, et al.
Publicado: (2025)
por: Benito-Del-Valle, Leire, et al.
Publicado: (2025)
Vision Foundation Models in Remote Sensing: A Survey
por: Lu, Siqi, et al.
Publicado: (2024)
por: Lu, Siqi, et al.
Publicado: (2024)
Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models
por: Liu, Tongkun, et al.
Publicado: (2025)
por: Liu, Tongkun, et al.
Publicado: (2025)
CoD: A Diffusion Foundation Model for Image Compression
por: Jia, Zhaoyang, et al.
Publicado: (2025)
por: Jia, Zhaoyang, et al.
Publicado: (2025)
TasselNetV4: A vision foundation model for cross-scene, cross-scale, and cross-species plant counting
por: Hu, Xiaonan, et al.
Publicado: (2025)
por: Hu, Xiaonan, et al.
Publicado: (2025)
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models
por: Guo, Jianyuan, et al.
Publicado: (2024)
por: Guo, Jianyuan, et al.
Publicado: (2024)
VFM-VAE: Vision Foundation Models Can Be Good Tokenizers for Latent Diffusion Models
por: Bi, Tianci, et al.
Publicado: (2025)
por: Bi, Tianci, et al.
Publicado: (2025)
PointRAFT: 3D deep learning for high-throughput prediction of potato tuber weight from partial point clouds
por: Blok, Pieter M., et al.
Publicado: (2025)
por: Blok, Pieter M., et al.
Publicado: (2025)
Sapiens: Foundation for Human Vision Models
por: Khirodkar, Rawal, et al.
Publicado: (2024)
por: Khirodkar, Rawal, et al.
Publicado: (2024)
One for All: Toward Unified Foundation Models for Earth Vision
por: Xiong, Zhitong, et al.
Publicado: (2024)
por: Xiong, Zhitong, et al.
Publicado: (2024)
FRNet: Frustum-Range Networks for Scalable LiDAR Segmentation
por: Xu, Xiang, et al.
Publicado: (2023)
por: Xu, Xiang, et al.
Publicado: (2023)
GuiDINO: Rethinking Vision Foundation Model in Medical Image Segmentation
por: Liang, Zhuonan, et al.
Publicado: (2026)
por: Liang, Zhuonan, et al.
Publicado: (2026)
RePack then Refine: Efficient Diffusion Transformer with Vision Foundation Model
por: Dong, Guanfang, et al.
Publicado: (2025)
por: Dong, Guanfang, et al.
Publicado: (2025)
FSOD-VFM: Few-Shot Object Detection with Vision Foundation Models and Graph Diffusion
por: Feng, Chen-Bin, et al.
Publicado: (2026)
por: Feng, Chen-Bin, et al.
Publicado: (2026)
Fully Exploiting Vision Foundation Model's Profound Prior Knowledge for Generalizable RGB-Depth Driving Scene Parsing
por: Guo, Sicen, et al.
Publicado: (2025)
por: Guo, Sicen, et al.
Publicado: (2025)
Large Language Models and Foundation Models in Smart Agriculture: Basics, Opportunities, and Challenges
por: Li, Jiajia, et al.
Publicado: (2023)
por: Li, Jiajia, et al.
Publicado: (2023)
GEM: Boost Simple Network for Glass Surface Segmentation via Vision Foundation Models
por: Hao, Jing, et al.
Publicado: (2023)
por: Hao, Jing, et al.
Publicado: (2023)
Look Before Acting: Enhancing Vision Foundation Representations for Vision-Language-Action Models
por: Luo, Yulin, et al.
Publicado: (2026)
por: Luo, Yulin, et al.
Publicado: (2026)
FiT: Flexible Vision Transformer for Diffusion Model
por: Lu, Zeyu, et al.
Publicado: (2024)
por: Lu, Zeyu, et al.
Publicado: (2024)
Towards a Unified Copernicus Foundation Model for Earth Vision
por: Wang, Yi, et al.
Publicado: (2025)
por: Wang, Yi, et al.
Publicado: (2025)
Efficient Generation of Targeted and Transferable Adversarial Examples for Vision-Language Models Via Diffusion Models
por: Guo, Qi, et al.
Publicado: (2024)
por: Guo, Qi, et al.
Publicado: (2024)
SARATR-X: Toward Building A Foundation Model for SAR Target Recognition
por: Li, Weijie, et al.
Publicado: (2024)
por: Li, Weijie, et al.
Publicado: (2024)
PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching
por: Nie, Han, et al.
Publicado: (2025)
por: Nie, Han, et al.
Publicado: (2025)
Agriculture-Vision Challenge 2024 -- The Runner-Up Solution for Agricultural Pattern Recognition via Class Balancing and Model Ensemble
por: Liu, Wang, et al.
Publicado: (2024)
por: Liu, Wang, et al.
Publicado: (2024)
Robust SAM: On the Adversarial Robustness of Vision Foundation Models
por: Long, Jiahuan, et al.
Publicado: (2025)
por: Long, Jiahuan, et al.
Publicado: (2025)
MetaFruit Meets Foundation Models: Leveraging a Comprehensive Multi-Fruit Dataset for Advancing Agricultural Foundation Models
por: Li, Jiajia, et al.
Publicado: (2024)
por: Li, Jiajia, et al.
Publicado: (2024)
REOBench: Benchmarking Robustness of Earth Observation Foundation Models
por: Li, Xiang, et al.
Publicado: (2025)
por: Li, Xiang, et al.
Publicado: (2025)
Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities
por: Yan, Xu, et al.
Publicado: (2024)
por: Yan, Xu, et al.
Publicado: (2024)
Neural Residual Diffusion Models for Deep Scalable Vision Generation
por: Ma, Zhiyuan, et al.
Publicado: (2024)
por: Ma, Zhiyuan, et al.
Publicado: (2024)
AgroBench: Vision-Language Model Benchmark in Agriculture
por: Shinoda, Risa, et al.
Publicado: (2025)
por: Shinoda, Risa, et al.
Publicado: (2025)
LandSegmenter: Towards a Flexible Foundation Model for Land Use and Land Cover Mapping
por: Liu, Chenying, et al.
Publicado: (2025)
por: Liu, Chenying, et al.
Publicado: (2025)
CD-NGP: A Fast Scalable Continual Representation for Dynamic Scenes
por: Liu, Zhenhuan, et al.
Publicado: (2024)
por: Liu, Zhenhuan, et al.
Publicado: (2024)
Ejemplares similares
-
DODA: Adapting Object Detectors to Dynamic Agricultural Environments in Real-Time with Diffusion
por: Xiang, Shuai, et al.
Publicado: (2024) -
High-throughput 3D shape completion of potato tubers on a harvester
por: Blok, Pieter M., et al.
Publicado: (2024) -
Survey of Video Diffusion Models: Foundations, Implementations, and Applications
por: Wang, Yimu, et al.
Publicado: (2025) -
Scalable Object Detection in the Car Interior With Vision Foundation Models
por: Schmidt, Sebastian, et al.
Publicado: (2025) -
Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models
por: Zhang, Tiezheng, et al.
Publicado: (2025)