Saved in:
| Main Authors: | Yuan, Ruicheng, Zhang, Zhenxuan, Wang, Anbang, Hu, Liwei, Hua, Xiangqian, Peng, Yaya, Luo, Jiawei, Yang, Guang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.19957 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Trustworthy Selective Generation: Reliability-Guided Diffusion for Ultra-Low-Field to High-Field MRI Synthesis
by: Zhang, Zhenxuan, et al.
Published: (2026)
by: Zhang, Zhenxuan, et al.
Published: (2026)
HiLa: Hierarchical Vision-Language Collaboration for Cancer Survival Prediction
by: Cui, Jiaqi, et al.
Published: (2025)
by: Cui, Jiaqi, et al.
Published: (2025)
HiPP-Prune: Hierarchical Preference-Conditioned Structured Pruning for Vision-Language Models
by: Bai, Lincen, et al.
Published: (2026)
by: Bai, Lincen, et al.
Published: (2026)
PathFL: Multi-Alignment Federated Learning for Pathology Image Segmentation
by: Zhang, Yuan, et al.
Published: (2025)
by: Zhang, Yuan, et al.
Published: (2025)
Hi-Agent: Hierarchical Vision-Language Agents for Mobile Device Control
by: Wu, Zhe, et al.
Published: (2025)
by: Wu, Zhe, et al.
Published: (2025)
ScaleKD: Strong Vision Transformers Could Be Excellent Teachers
by: Fan, Jiawei, et al.
Published: (2024)
by: Fan, Jiawei, et al.
Published: (2024)
LoC-Path: Learning to Compress for Pathology Multimodal Large Language Models
by: Hu, Qingqiao, et al.
Published: (2025)
by: Hu, Qingqiao, et al.
Published: (2025)
PathoHR: Hierarchical Reasoning for Vision-Language Models in Pathology
by: Huang, Yating, et al.
Published: (2025)
by: Huang, Yating, et al.
Published: (2025)
Unleashing Video Language Models for Fine-grained HRCT Report Generation
by: Fang, Yingying, et al.
Published: (2026)
by: Fang, Yingying, et al.
Published: (2026)
HiCrowd: Hierarchical Crowd Flow Alignment for Dense Human Environments
by: Zhu, Yufei, et al.
Published: (2026)
by: Zhu, Yufei, et al.
Published: (2026)
HiSpatial: Taming Hierarchical 3D Spatial Understanding in Vision-Language Models
by: Liang, Huizhi, et al.
Published: (2026)
by: Liang, Huizhi, et al.
Published: (2026)
HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models
by: Zhou, Ziqin, et al.
Published: (2025)
by: Zhou, Ziqin, et al.
Published: (2025)
Seeing Through Experts Eyes A Foundational Vision Language Model Trained on Radiologists Gaze and Reasoning
by: Lee, Kinhei, et al.
Published: (2026)
by: Lee, Kinhei, et al.
Published: (2026)
HiMo-CLIP: Modeling Semantic Hierarchy and Monotonicity in Vision-Language Alignment
by: Wu, Ruijia, et al.
Published: (2025)
by: Wu, Ruijia, et al.
Published: (2025)
HiPrune: Hierarchical Attention for Efficient Token Pruning in Vision-Language Models
by: Liu, Jizhihui, et al.
Published: (2025)
by: Liu, Jizhihui, et al.
Published: (2025)
HiMemVLN: Enhancing Reliability of Open-Source Zero-Shot Vision-and-Language Navigation with Hierarchical Memory System
by: Lyu, Kailin, et al.
Published: (2026)
by: Lyu, Kailin, et al.
Published: (2026)
Chain-of-Models Pre-Training: Rethinking Training Acceleration of Vision Foundation Models
by: Fan, Jiawei, et al.
Published: (2026)
by: Fan, Jiawei, et al.
Published: (2026)
Accurate and Scalable Multimodal Pathology Retrieval via Attentive Vision-Language Alignment
by: Wang, Hongyi, et al.
Published: (2025)
by: Wang, Hongyi, et al.
Published: (2025)
HiBench: Benchmarking LLMs Capability on Hierarchical Structure Reasoning
by: Jiang, Zhuohang, et al.
Published: (2025)
by: Jiang, Zhuohang, et al.
Published: (2025)
HiMoE-VLA: Hierarchical Mixture-of-Experts for Generalist Vision-Language-Action Policies
by: Du, Zhiying, et al.
Published: (2025)
by: Du, Zhiying, et al.
Published: (2025)
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis
by: Wang, Jiawei, et al.
Published: (2025)
by: Wang, Jiawei, et al.
Published: (2025)
Morse: Dual-Sampling for Lossless Acceleration of Diffusion Models
by: Li, Chao, et al.
Published: (2025)
by: Li, Chao, et al.
Published: (2025)
HiScene: Creating Hierarchical 3D Scenes with Isometric View Generation
by: Dong, Wenqi, et al.
Published: (2025)
by: Dong, Wenqi, et al.
Published: (2025)
Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models
by: Shi, Lucy Xiaoyang, et al.
Published: (2025)
by: Shi, Lucy Xiaoyang, et al.
Published: (2025)
HiF-DTA: Hierarchical Feature Learning Network for Drug-Target Affinity Prediction
by: Li, Minghui, et al.
Published: (2025)
by: Li, Minghui, et al.
Published: (2025)
PathAR: Structure-First Autoregressive Synthesis of Multimodal Pathology Images
by: Zhang, Yuan, et al.
Published: (2026)
by: Zhang, Yuan, et al.
Published: (2026)
On Ihara's lemma for definite unitary groups
by: Yang, Xiangqian
Published: (2025)
by: Yang, Xiangqian
Published: (2025)
Modest-Align: Data-Efficient Alignment for Vision-Language Models
by: Liu, Jiaxiang, et al.
Published: (2025)
by: Liu, Jiaxiang, et al.
Published: (2025)
Hi-Dyna Graph: Hierarchical Dynamic Scene Graph for Robotic Autonomy in Human-Centric Environments
by: Hou, Jiawei, et al.
Published: (2025)
by: Hou, Jiawei, et al.
Published: (2025)
HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model
by: Hu, Mengkang, et al.
Published: (2024)
by: Hu, Mengkang, et al.
Published: (2024)
HiMix: Reducing Computational Complexity in Large Vision-Language Models
by: Zhang, Xuange, et al.
Published: (2025)
by: Zhang, Xuange, et al.
Published: (2025)
HiVG: Hierarchical Multimodal Fine-grained Modulation for Visual Grounding
by: Xiao, Linhui, et al.
Published: (2024)
by: Xiao, Linhui, et al.
Published: (2024)
Hierarchical Reconstruction of Time-arrow from Multi-time Correlations
by: Cheng, Yijia, et al.
Published: (2026)
by: Cheng, Yijia, et al.
Published: (2026)
Hi-GMAE: Hierarchical Graph Masked Autoencoders
by: Liu, Chuang, et al.
Published: (2024)
by: Liu, Chuang, et al.
Published: (2024)
HiFusion: Hierarchical Intra-Spot Alignment and Regional Context Fusion for Spatial Gene Expression Prediction from Histopathology
by: Weng, Ziqiao, et al.
Published: (2025)
by: Weng, Ziqiao, et al.
Published: (2025)
HiGR: Efficient Generative Slate Recommendation via Hierarchical Planning and Multi-Objective Preference Alignment
by: Pang, Yunsheng, et al.
Published: (2025)
by: Pang, Yunsheng, et al.
Published: (2025)
PathFound: An Agentic Multimodal Model Activating Evidence-seeking Pathological Diagnosis
by: Hua, Shengyi, et al.
Published: (2025)
by: Hua, Shengyi, et al.
Published: (2025)
HiMix: Hierarchical Artifact-aware Mixup for Generalized Synthetic Image Detection
by: Zhou, Shuchang, et al.
Published: (2026)
by: Zhou, Shuchang, et al.
Published: (2026)
Hi-SAM: A Hierarchical Structure-Aware Multi-modal Framework for Large-Scale Recommendation
by: Pan, Pingjun, et al.
Published: (2026)
by: Pan, Pingjun, et al.
Published: (2026)
MIND-V: Hierarchical World Model for Long-Horizon Robotic Manipulation with RL-based Physical Alignment
by: Zhang, Ruicheng, et al.
Published: (2025)
by: Zhang, Ruicheng, et al.
Published: (2025)
Similar Items
-
Towards Trustworthy Selective Generation: Reliability-Guided Diffusion for Ultra-Low-Field to High-Field MRI Synthesis
by: Zhang, Zhenxuan, et al.
Published: (2026) -
HiLa: Hierarchical Vision-Language Collaboration for Cancer Survival Prediction
by: Cui, Jiaqi, et al.
Published: (2025) -
HiPP-Prune: Hierarchical Preference-Conditioned Structured Pruning for Vision-Language Models
by: Bai, Lincen, et al.
Published: (2026) -
PathFL: Multi-Alignment Federated Learning for Pathology Image Segmentation
by: Zhang, Yuan, et al.
Published: (2025) -
Hi-Agent: Hierarchical Vision-Language Agents for Mobile Device Control
by: Wu, Zhe, et al.
Published: (2025)