:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wei, Ran, Lan, ZhiXiong, Yan, Qing, Song, Ning, Lv, Ming, Ye, LongQing
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2503.21836
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Ovis-Image Technical Report
by: Wang, Guo-Hua, et al.
Published: (2025)

MedGemma Technical Report
by: Sellergren, Andrew, et al.
Published: (2025)

LongCat-Image Technical Report
by: Meituan LongCat Team, et al.
Published: (2025)

Qwen-Image-2.0 Technical Report
by: Zhao, Bing, et al.
Published: (2026)

Qwen-Image Technical Report
by: Wu, Chenfei, et al.
Published: (2025)

ACAM-KD: Adaptive and Cooperative Attention Masking for Knowledge Distillation
by: Lan, Qizhen, et al.
Published: (2025)

CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs
by: Lan, Qizhen, et al.
Published: (2025)

SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation
by: Yan, Ke, et al.
Published: (2024)

MedFILIP: Medical Fine-grained Language-Image Pre-training
by: Liang, Xinjie, et al.
Published: (2025)

I-MedSAM: Implicit Medical Image Segmentation with Segment Anything
by: Wei, Xiaobao, et al.
Published: (2023)

ERNIE-Image Technical Report
by: Liu, Jiaxiang, et al.
Published: (2026)

Edge-preserving Image Denoising via Multi-scale Adaptive Statistical Independence Testing
by: Yan, Ruyu, et al.
Published: (2025)

HunyuanImage 3.0 Technical Report
by: Cao, Siyu, et al.
Published: (2025)

Omni-Fusion of Spatial and Spectral for Hyperspectral Image Segmentation
by: Zhang, Qing, et al.
Published: (2025)

Qwen-Image-VAE-2.0 Technical Report
by: Zhang, Zekai, et al.
Published: (2026)

MedQ-Bench: Evaluating and Exploring Medical Image Quality Assessment Abilities in MLLMs
by: Liu, Jiyao, et al.
Published: (2025)

Pseudo-Label Enhanced Cascaded Framework: 2nd Technical Report for LSVOS 2025 VOS Track
by: Yan, An, et al.
Published: (2025)

MedAutoCorrect: Image-Conditioned Autocorrection in Medical Reporting
by: Asiimwe, Arnold Caleb, et al.
Published: (2024)

iFlyBot-VLA Technical Report
by: Zhang, Yuan, et al.
Published: (2025)

LongCat-Video Technical Report
by: Meituan LongCat Team, et al.
Published: (2025)

MultiDiffSense: Diffusion-Based Multi-Modal Visuo-Tactile Image Generation Conditioned on Object Shape and Contact Pose
by: Bhouri, Sirine, et al.
Published: (2026)

Multiple Code Hashing for Efficient Image Retrieval
by: Li, Ming-Wei, et al.
Published: (2020)

SAIL-VL2 Technical Report
by: Yin, Weijie, et al.
Published: (2025)

MedFlowSeg: Flow Matching for Medical Image Segmentation with Frequency-Aware Attention
by: Chen, Zhi, et al.
Published: (2026)

Privacy-Aware Camera 2.0 Technical Report
by: Song, Huan, et al.
Published: (2026)

Seedream 3.0 Technical Report
by: Gao, Yu, et al.
Published: (2025)

Kelix Technical Report
by: Ding, Boyang, et al.
Published: (2026)

Ovis-U1 Technical Report
by: Wang, Guo-Hua, et al.
Published: (2025)

MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
by: Song, Kunpeng, et al.
Published: (2024)

Kling-Omni Technical Report
by: Kling Team, et al.
Published: (2025)

Kimi-VL Technical Report
by: Kimi Team, et al.
Published: (2025)

TechCoach: Towards Technical-Point-Aware Descriptive Action Coaching
by: Li, Yuan-Ming, et al.
Published: (2024)

Controllable Generation with Text-to-Image Diffusion Models: A Survey
by: Cao, Pu, et al.
Published: (2024)

BMIP: Bi-directional Modality Interaction Prompt Learning for VLM
by: Lv, Song-Lin, et al.
Published: (2025)

Med-LEGO: Editing and Adapting toward Generalist Medical Image Diagnosis
by: Zhu, Yitao, et al.
Published: (2025)

Step-GUI Technical Report
by: Yan, Haolong, et al.
Published: (2025)

Logics-Parsing Technical Report
by: Chen, Xiangyang, et al.
Published: (2025)

Visual Detector Compression via Location-Aware Discriminant Analysis
by: Lan, Qizhen, et al.
Published: (2025)

SEMC: Structure-Enhanced Mixture-of-Experts Contrastive Learning for Ultrasound Standard Plane Recognition
by: Cai, Qing, et al.
Published: (2025)

StreamingClaw Technical Report
by: Chen, Jiawei, et al.
Published: (2026)