Saved in:
| Main Authors: | Wei, Ran, Lan, ZhiXiong, Yan, Qing, Song, Ning, Lv, Ming, Ye, LongQing |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.21836 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Ovis-Image Technical Report
by: Wang, Guo-Hua, et al.
Published: (2025)
by: Wang, Guo-Hua, et al.
Published: (2025)
MedGemma Technical Report
by: Sellergren, Andrew, et al.
Published: (2025)
by: Sellergren, Andrew, et al.
Published: (2025)
LongCat-Image Technical Report
by: Meituan LongCat Team, et al.
Published: (2025)
by: Meituan LongCat Team, et al.
Published: (2025)
Qwen-Image-2.0 Technical Report
by: Zhao, Bing, et al.
Published: (2026)
by: Zhao, Bing, et al.
Published: (2026)
Qwen-Image Technical Report
by: Wu, Chenfei, et al.
Published: (2025)
by: Wu, Chenfei, et al.
Published: (2025)
ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation
by: Lan, Qizhen, et al.
Published: (2025)
by: Lan, Qizhen, et al.
Published: (2025)
CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs
by: Lan, Qizhen, et al.
Published: (2025)
by: Lan, Qizhen, et al.
Published: (2025)
SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation
by: Yan, Ke, et al.
Published: (2024)
by: Yan, Ke, et al.
Published: (2024)
MedFILIP: Medical Fine-grained Language-Image Pre-training
by: Liang, Xinjie, et al.
Published: (2025)
by: Liang, Xinjie, et al.
Published: (2025)
I-MedSAM: Implicit Medical Image Segmentation with Segment Anything
by: Wei, Xiaobao, et al.
Published: (2023)
by: Wei, Xiaobao, et al.
Published: (2023)
ERNIE-Image Technical Report
by: Liu, Jiaxiang, et al.
Published: (2026)
by: Liu, Jiaxiang, et al.
Published: (2026)
Edge-preserving Image Denoising via Multi-scale Adaptive Statistical Independence Testing
by: Yan, Ruyu, et al.
Published: (2025)
by: Yan, Ruyu, et al.
Published: (2025)
HunyuanImage 3.0 Technical Report
by: Cao, Siyu, et al.
Published: (2025)
by: Cao, Siyu, et al.
Published: (2025)
Omni-Fusion of Spatial and Spectral for Hyperspectral Image Segmentation
by: Zhang, Qing, et al.
Published: (2025)
by: Zhang, Qing, et al.
Published: (2025)
Qwen-Image-VAE-2.0 Technical Report
by: Zhang, Zekai, et al.
Published: (2026)
by: Zhang, Zekai, et al.
Published: (2026)
MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs
by: Liu, Jiyao, et al.
Published: (2025)
by: Liu, Jiyao, et al.
Published: (2025)
Pseudo-Label Enhanced Cascaded Framework: 2nd Technical Report for LSVOS 2025 VOS Track
by: Yan, An, et al.
Published: (2025)
by: Yan, An, et al.
Published: (2025)
MedAutoCorrect: Image-Conditioned Autocorrection in Medical Reporting
by: Asiimwe, Arnold Caleb, et al.
Published: (2024)
by: Asiimwe, Arnold Caleb, et al.
Published: (2024)
iFlyBot-VLA Technical Report
by: Zhang, Yuan, et al.
Published: (2025)
by: Zhang, Yuan, et al.
Published: (2025)
LongCat-Video Technical Report
by: Meituan LongCat Team, et al.
Published: (2025)
by: Meituan LongCat Team, et al.
Published: (2025)
MultiDiffSense: Diffusion-Based Multi-Modal Visuo-Tactile Image Generation Conditioned on Object Shape and Contact Pose
by: Bhouri, Sirine, et al.
Published: (2026)
by: Bhouri, Sirine, et al.
Published: (2026)
Multiple Code Hashing for Efficient Image Retrieval
by: Li, Ming-Wei, et al.
Published: (2020)
by: Li, Ming-Wei, et al.
Published: (2020)
SAIL-VL2 Technical Report
by: Yin, Weijie, et al.
Published: (2025)
by: Yin, Weijie, et al.
Published: (2025)
MedFlowSeg: Flow Matching for Medical Image Segmentation with Frequency-Aware Attention
by: Chen, Zhi, et al.
Published: (2026)
by: Chen, Zhi, et al.
Published: (2026)
Privacy-Aware Camera 2.0 Technical Report
by: Song, Huan, et al.
Published: (2026)
by: Song, Huan, et al.
Published: (2026)
Seedream 3.0 Technical Report
by: Gao, Yu, et al.
Published: (2025)
by: Gao, Yu, et al.
Published: (2025)
Kelix Technical Report
by: Ding, Boyang, et al.
Published: (2026)
by: Ding, Boyang, et al.
Published: (2026)
Ovis-U1 Technical Report
by: Wang, Guo-Hua, et al.
Published: (2025)
by: Wang, Guo-Hua, et al.
Published: (2025)
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
by: Song, Kunpeng, et al.
Published: (2024)
by: Song, Kunpeng, et al.
Published: (2024)
Kling-Omni Technical Report
by: Kling Team, et al.
Published: (2025)
by: Kling Team, et al.
Published: (2025)
Kimi-VL Technical Report
by: Kimi Team, et al.
Published: (2025)
by: Kimi Team, et al.
Published: (2025)
TechCoach: Towards Technical-Point-Aware Descriptive Action Coaching
by: Li, Yuan-Ming, et al.
Published: (2024)
by: Li, Yuan-Ming, et al.
Published: (2024)
Controllable Generation with Text-to-Image Diffusion Models: A Survey
by: Cao, Pu, et al.
Published: (2024)
by: Cao, Pu, et al.
Published: (2024)
BMIP: Bi-directional Modality Interaction Prompt Learning for VLM
by: Lv, Song-Lin, et al.
Published: (2025)
by: Lv, Song-Lin, et al.
Published: (2025)
Med-LEGO: Editing and Adapting toward Generalist Medical Image Diagnosis
by: Zhu, Yitao, et al.
Published: (2025)
by: Zhu, Yitao, et al.
Published: (2025)
Step-GUI Technical Report
by: Yan, Haolong, et al.
Published: (2025)
by: Yan, Haolong, et al.
Published: (2025)
Logics-Parsing Technical Report
by: Chen, Xiangyang, et al.
Published: (2025)
by: Chen, Xiangyang, et al.
Published: (2025)
Visual Detector Compression via Location-Aware Discriminant Analysis
by: Lan, Qizhen, et al.
Published: (2025)
by: Lan, Qizhen, et al.
Published: (2025)
SEMC: Structure-Enhanced Mixture-of-Experts Contrastive Learning for Ultrasound Standard Plane Recognition
by: Cai, Qing, et al.
Published: (2025)
by: Cai, Qing, et al.
Published: (2025)
StreamingClaw Technical Report
by: Chen, Jiawei, et al.
Published: (2026)
by: Chen, Jiawei, et al.
Published: (2026)
Similar Items
-
Ovis-Image Technical Report
by: Wang, Guo-Hua, et al.
Published: (2025) -
MedGemma Technical Report
by: Sellergren, Andrew, et al.
Published: (2025) -
LongCat-Image Technical Report
by: Meituan LongCat Team, et al.
Published: (2025) -
Qwen-Image-2.0 Technical Report
by: Zhao, Bing, et al.
Published: (2026) -
Qwen-Image Technical Report
by: Wu, Chenfei, et al.
Published: (2025)